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Preface 


When I was asked by K. K. Phua to do a book for World Scientific based on my 
work, he suggested a volume of essays or a reprint volume. I have decided to combine 
these two suggestions into one, by preparing a reprint volume with commentaries. 
Some of the commentaries are drawn from historical articles that I have written 
for publication, others are drawn from unpublished historical accounts written for 
institutional archives, and yet others have been written expressly for this volume. In 
the commentaries, I try to relate the reprinted articles to the time-line of my career, 
and at the same time to analyze their relations with the work of other physicists 
whose work influenced mine and vice versa. 

In keeping with these dual aims, I have arranged the articles and the commen¬ 
taries in approximately chronological order, but occasionally deviate from strict 
chronology in order to group topically related articles together. In choosing which 
articles to include, I have been guided by two generally coinciding measures, my own 
estimate of significance, and the citation count. However, in occasional cases I have 
included infrequently cited articles where I felt that there was an interesting related 
story to tell. Often, when finishing a line of work, I have written a long summarizing 
article or review; some of these are too long to be included in their entirety, and so I 
have included in the reprints only the sections most relevant to the narrative in the 
commentaries. Similarly, I have not included among the reprints the summer school 
lectures I have given on current algebras, anomalies, and neutrino physics, but refer¬ 
ences to them appear in the commentaries. In the last decade, I have published two 
books related to my work on generalized forms of quantum mechanics, and included 
many research results directly in these books in lieu of first writing papers. It is 
feasible to give only brief descriptions of these projects in the commentaries; I have 
included just a few papers from this period, all in the nature of follow-ons to the 
first book. 

In both the texts of the commentaries and the reference lists that follow them, 
reprinted articles are identified by a sans serif R, so that for example, R1 designates 
the first reprinted article. Numbers in square brackets following each reference in 
the reference lists give the pages in the commentaries where that reference is cited. 
There is also an index of names following the commentaries, and a list of detailed 
chapter subheadings in the Table of Contents. 

I wish to thank Tian-Yu Cao for a critical reading of the commentaries and much 
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helpful advice, Alfred Mueller for a helpful conversation on renormalon ambiguities, 
Richard Haymaker for a clarifying email on dual superconductivity parameters, and 
William Marciano, Robert Oakes, and Alberto Sirlin for calling my attention to 
relevant references. I also wish to thank the following people for sending me help¬ 
ful comments on the initial draft of the commentaries after it was posted on the 
archive as hep-ph/0505177: Nikolay Achasov, Dimi Chakalov, Christopher Hill, Ro¬ 
man Jackiw, Andrei Kataev, Peter Minkowski, Herbert Neuberger, and Lalit Sehgal. 
I am grateful to Antonino Zichichi for permission to use the quote from Gilberto 
Bernardini in Chapter 2, to Mary Bell for permission to use the quote from John 
Bell in Chapter 3, to James Bjorken for permission use his quote in Chapter 3, and 
to Clifford Taubes for helpful email correspondence and permission to use his quotes 
in Chapter 7. 

My editor at World Scientific, Kim Tan, has given valuable assistance throughout 
this project. Miriam Peterson and Margaret Best have patiently assisted in the 
conversion of my TeX drafts to camera-ready copy and with indexing, the latter a 
task that was shared with Lisa Fleischer and Michelle Sage. I am also indebted to 
Momota Ganguli and Judy Wilson-Smith for bibliographic searches, to Christopher 
McCafferty and James Stephens for help with computer problems, and to Marcia 
Tucker and Herman Joachim for assistance, respectively, in scanning and duplicating 
certain of the papers to be reprinted. Finally, I wish to express my appreciation to the 
Institute for Advanced Study (abbreviated throughout the commentaries as IAS) for 
its support of my work, first from 1966 to 1969, when I was a Long Term Member, 
and then from 1969 onwards, when I have been a member of the Faculty, in the 
School of Natural Sciences. My work has also been supported by the Department of 
Energy under Grant No. DE-FG02-90ER40542. 

In addition to the publishers acknowledged on each individual reprint, I also 
wish to thank World Scientific for the use of material originally prepared for their 
volumes commemorating the 50th anniversary of Yang-Mills theory. Chapter 3 on 
anomalies is largely based on an essay I contributed to 50 Years of Yang-Mills The¬ 
ory , edited by G. ’t Hooft, and the parts of Chapters 7 and 9 dealing respectively 
with monopoles and projective group representations are based on an essay I wrote 
for a projected companion volume on the influence of Yang-Mills theory on math¬ 
ematics. Also, some material in Chapters 2 and 3 overlaps with the contents of a 
letter on antecedents of asymptotic freedom that I wrote to Physics Today , which 
appears in the September, 2006 issue. 
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1. Early Years, and Condensed Matter Physics 


A brief synopsis of my career appears in an article that I wrote recently for the Abdus 
Salam International Centre for Theoretical Physics (Adler, 2004, Rl), which includes 
a description of events when I was young that led to my becoming a theoretical 
physicist. The focus of this article is on the career path that led to my work in high 
energy physics. However, before I published anything in high energy theory, I spent 
several summers working in industrial research laboratory jobs in condensed matter 
physics, and it was this work that led to my first scientific publications. 

By the end of my junior year at Harvard, I had taken courses in quantum me¬ 
chanics and also in condensed matter physics (then called solid state physics). With 
this background, during the summer of 1960, I got a job working for Joseph Birman, 
who at that time (before going on to Professorships at New York University and then 
City College of the City University of New York) headed a section studying electrolu¬ 
minescence at the General Telephone and Electronics (GT&E) Research Laboratory. 
This industrial research laboratory, formerly the Sylvania Research Laboratory, was 
conveniently located a few miles from where my family lived in Bayside, Queens. I 
had a desk in an office looking out over the entrance to the Long Island Sound, from 
which I could see sections of roadway being hoisted into place on the Throgs Neck 
Bridge, then under construction. 

During my first weeks at GT&E, Joe got me started learning some basic group 
theory as applied to crystal structures, and then suggested the problem of using these 
group theory methods to check a formula that Hopfield (1960) had given relating 
band theory structures in hexagonal and cubic variants of zinc sulfide (ZnS) and 
related compounds, substances that Joe had been studying (Birman, 1959) with an 
eye to electroluminescence applications. This turned out to be basically a technical 
exercise and confirmed Hopfield’s results. In the course of this work, which I finally 
wrote up a year later (Adler, 1962a, R2), I also attempted an a priori estimate 
of a parameter determined by experimental fits to the Hopfield formula. This got 
me interested in the Ewald sum method for doing crystal lattice sums, on which 
I wrote a paper (Adler, 1961) giving generalized results for sums over lattices of 
functions /(r)Yg m (0, 0), with Y^ m a spherical harmonic and f(r) a radial function 
representable as a transform by f(r) = r £ exp(— r 2 t)g(t)dt. These two pieces of 
work stemming from my summer at GT&E were my first scientific publications. 
With Joe’s encouragement, I also gave a 10 minute contributed paper (Adler and 
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Birman, 1961) on the ZnS work at the New York meeting of the American Physical 
Society the following winter, while I was back home on inter-term break from college. 
Since this was my first conference talk, I typed out a text and went over it so many 
times that I knew it by heart. After my talk, Joe said words to the effect, “That 
was fine, but next time you give a talk don’t sound like it was memorized”, wisdom 
that I have taken to heart on many subsequent occasions! 

When I returned to Harvard for my senior year I was told by some of the fac¬ 
ulty that Henry Ehrenreich from the General Electric (GE) Research Laboratory 
was on leave at Harvard that year, and was giving the graduate course on solid 
state physics, covering substantially different material from what I had heard the 
year before. I attended Henry’s lectures, which included a calculation of the en¬ 
ergy and wave-number dependent dielectric constant in isotropic solids, using the 
self-consistent field or energy-band approximation, along the lines of the treatment 
given in Ehrenreich and Cohen (1959). I got to know Henry outside the classroom 
as well, and he invited me to work at the GE Research Laboratory in Schenectady, 
NY the following summer, after my graduation from college in June 1961. This was 
appealing in a number of ways, since my family had moved to Bennington, VT the 
year before, about an hour’s drive away from Schenectady, and so I was able to 
drive home for a visit on weekends. At GE, Henry suggested that I generalize the 
treatment of the dielectric constant that he and Cohen had given so as to include 
various effects of interest in real solids. In the paper that resulted (Adler, 1962b, 
R3), I calculated the full frequency and wave-number dependent dielectric tensor in 
the energy-band approximation, including tensor components that couple longitu¬ 
dinal and transverse electromagnetic disturbances, which are absent in the isotropic 
approximation but are present even in solids with cubic symmetry. The longitudi¬ 
nal to longitudinal component of the general dielectric tensor reduces to the result 
obtained by Ehrenreich and Cohen when various identities (reflecting charge conser¬ 
vation and gauge invariance, as well as symmetries) are used. I also gave a method, 
based on an analysis of “Umklapp” processes that couple wave numbers differing by 
a reciprocal lattice vector, together with use of a multipole expansion, for calculating 
local field corrections to the dielectric constant, giving a modified Lorenz-Lorentz 
formula. (Local field corrections were also studied by Cohen’s student Nathan Wiser 
(1963) by a different method.) My paper on the dielectric constant in real solids has 
been widely cited in the subsequent condensed matter literature, reflecting its rele¬ 
vance for spectroscopic studies of solids, as well as its generalizations to nonlinear 
dielectric behavior. 

Although I had decided to focus on elementary particle theory for my graduate 
study in Princeton, I retained an interest in solid state physics, and returned to 
GE for half of the summer of 1962 to work again with Henry Ehrenreich, this time 
publishing a paper (Adler, 1963) in which I applied the dielectric constant results of 
the previous summer to the theory of hot electron energy loss in solids. Not long after 
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this visit, Henry left GE to take a Professorship at Harvard, where our paths crossed 
again during my postdoctoral years. After finishing my PhD at Princeton in 1964, I 
spent the summer working at Bell Telephone Laboratories in Murray Hill, under the 
supervision of Phil Anderson and Dick Werthamer. However, aside from informal 
notes on the application of raising and lowering operators to the vortex structure in 
type II superconductors, my principal publication resulting from this final industrial 
summer job was a writeup of my work on PCAC consistency conditions, which I 
will discuss in the next chapter. 

References for Chapter 1 

Adler, S. (1961). A Generalized Ewald Method for Lattice Sums. Physica 27, 1193-1201. [1] 

Adler, S. L. (1962a) R2. Theory of the Valence Band Splittings at k = 0 in Zinc-Blende and 
Wurtzite Structures. Phys. Rev. 126, 118-122. [1] 

Adler, S. L. (1962b) R3. Quantum Theory of the Dielectric Constant in Real Solids. Phys. 
Rev. 126, 413-420. [2] 

Adler, S. L. (1963). Theory of the Range of Hot Electrons in Real Metals. Phys. Rev. 130, 
1654-1666. [2] 

Adler, S. L. (2004) Rl. From Elements of Radio to Elementary Particle Physics, in One Hun¬ 
dred Reasons to be a Scientist (The Abdus Salam International Centre for Theoretical 
Physics, Trieste), pp. 25-26. [1] 

Adler, S. and J. L. Birman (1961). An LCAO Theory of the k = 0,0,0 Valence Band 
Splittings in Zinc Blende and Wurtzite Structures. Bull. Am. Phys. Society Series II, 
Vol. 6, No.l, Part 1, p. 22. [1,2] 

Birman, J. L. (1959). Simplified LCAO Method for Zincblende, Wurtzite, and Mixed Crystal 
Structures. Phys. Rev. 115, 1493-1505. [1] 

Ehrenreich, H. and M. H. Cohen (1959). Self-Consistent Field Approach to the Many- 
Electron Problem. Phys . Rev. 115, 786-790. [2] 

Hopfield, J. J. (1960). Fine Structure in the Optical Absorption Edge of Anisotropic Crystals. 
J. Phys. Chem. Solids 15, 97-107. [1] 

Wiser, N. (1963). Dielectric Constant with Local Field Effects Included. Phys. Rev. 129, 
62-69. [2] 
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2. High Energy Neutrino Reactions, PCAC Relations, 

and Sum Rules 


Introduction 

By the end of my undergraduate years at Harvard (1957-1961), I had gone through 
most of the graduate course curriculum, as well as a senior year reading course orga¬ 
nized by Paul Martin for my classmate Fred Goldhaber and me. This course gave me 
an introduction to quantum field theory, or more precisely, to quantum electrody¬ 
namics, through some of the seminal papers appearing in the reprint volume edited 
by Schwinger (1958). Although as a result of my summer research jobs I could have 
gone on relatively easily to a PhD in solid state physics, I wanted to enter particle 
physics, and moreover wanted exposure to styles of theoretical physics different from 
those I had seen already at Harvard. Hence I decided on Princeton for my graduate 
work (with strong encouragement from Harvard faculty member Frank Pipkin, who 
was an enthusiastic Princeton graduate alumnus), and enrolled there in the fall of 
1961. 

My first year there was spent preparing for general exams, mostly by reading. I 
also participated in a seminar organized by the graduate students, which surveyed 
many aspects of dispersion relations and covered some topics in Feynman diagram 
calculations as well. The only formal course I took was one given by Sam Treiman, 
which gave an introductory survey to elementary particle physics. I was impressed by 
the clarity of his approach, and both because of this and because Murph Goldberger 
was planning a sabbatical leave the following year, I asked Treiman to take me on 
as a thesis student. 

This turned out to be a fortunate choice. Treiman proposed that I do a thesis in 
the general area of high energy neutrino reactions, which was just then emerging as 
an area of phenomenological interest. After doing a survey of the literature in the 
field, I first did a “preliminary problem” of calculating the final lepton and nucleon 
polarization effects in the quasielastic neutrino reaction U£ + N —> £ + N y with all 
induced form factors retained in the vector and axial-vector vertices (Adler, 1964a). 

I did this calculation in two ways, first by using the covariant form of the matrix 
element and Dirac 7 matrix algebra, then by using the center of mass form and 
Pauli matrix algebra, and directly checked the equivalence of the two forms of the 
answer. This convinced Treiman that I could calculate, and incidentally introduced 
me to the axial-vector current and coupling which were to be central to my work 
for many years. 
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After this calculation was completed, I decided to make the main focus of my the¬ 
sis a calculation of the simplest inelastic high energy neutrino reaction, that of pion 
production in the (3,3) or A(1232) resonance region. This problem had the appeal 
of having as a paradigm the beautiful dispersion relations calculation of pion pho¬ 
toproduction of Chew, Goldberger, Low, and Nambu (1957), which was one of the 
classics of the dispersion relations program. An extension to electroproduction had 
already been carried out by Fubini, Nambu, and Wataghin (1958), but they had 
done no numerical work, and on closer examination their matrix element turned 
out to be divergent at zero hadronic momentum transfer vq when the lepton four- 
momentum transfer squared denoted by q 2 (or k 2 ) is nonzero. There were similar 
problems (surveyed in my thesis) with the other papers then available dealing with 
pion electroproduction or weak production, so doing a complete and careful calcu¬ 
lation, including numerical evaluation of the cross sections, seemed a good choice of 
thesis topic. It was also a demanding one; although I wrote my thesis and got my 
degree in 1964, my goal of a complete calculation, including the necessary computer 
work, was not achieved until 1968. 

Much of the delay though, was a result of the fact that weak pion produc¬ 
tion turned out to be a marvelous theoretical laboratory for studying the impli¬ 
cations of conservation hypotheses for the weak vector and axial-vector currents, 
and this became a parallel part of my research program, as reflected in the title of 
my thesis “High Energy Neutrino Reactions and Conservation Hypotheses” (Adler, 
1964b). From Treiman and from my reading, I had learned about the Feynman- 
Gell-Mann (1958) proposal of a hadronic conserved vector current (CVC), and I 
had also learned about the Goldberger-Treiman (1958) relation for the charged 
pion decay constant, which they had discovered through a pioneering dispersion 
theoretic calculation of the weak vertex. A simplified derivation of this relation had 
already been achieved through the suggestion of Nambu (1960), Bernstein, Fubini, 
Gell-Mann, and Thirring (1960), Gell-Mann and Levy (1960), and Bernstein, Gell- 
Mann, and Michel (1960), that the axial-vector current is partially conserved, in the 
sense that the divergence of the axial-vector current behaves at small squared mo¬ 
mentum transfer as a good approximation to the pion field, or equivalently, is pion 
pole dominated. (Much later on, after contacts with China resumed, I learned that 
Chou (1960) had given a similar simplified derivation of the Goldberger-Treiman 
relation, as well as further applications to decay processes.) The partial conservation 
hypothesis was an appealing one, but as Treiman kept emphasizing, it was supported 
by “only one number” and therefore had to be regarded with caution. So a second 
goal of my thesis work ended up being to keep an eye out for other possible tests of 
the conservation hypotheses for the weak vector and axial-vector currents. 

Before going on to discuss how these emerged from my weak pion production 
calculation, let me first recall what I knew when I started the thesis work. The first 
chapter of the thesis (written in the spring of 1964) was a theoretical survey; in 
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the section headed “Partially Conserved Axial Vector Current (PCAC)” I referred 
only to the papers of Goldberger and Treiman, of Nambu, of Bernstein et ah, and 
of Gell-Mann and Levy cited in the preceding paragraph. In the final section of 
the first chapter, entitled “Survey of Computations Relating to Specific Reactions” 
there is the following reference to the paper of Nambu and Shrauner (1962), which 
was my reference 37: “An entirely different approach to weak pion production in 
the low pion-energy region has been pursued by Nambu and Shrauner . 37 These 
authors assume that the weak interactions are approximately 75 invariant (“chirality 
conservation”). They then obtain formulas for production of low energy pions, in the 
approximation in which the pion mass is neglected, in analogy with the treatment 
of low energy bremsstrahling (sic) in electron scattering.” At the time I started my 
calculations, neither Treiman nor I understood the relation between the Nambu- 
Shrauner work and the issue of partial conservation of the axial-vector current. This 
was partly because we were suspicious of the assumption of zero pion mass, and 
partly because the Nambu-Shrauner paper makes no reference to the axial-vector 
coupling so it was not clear whether their “chirality” was related to the weak 
currents I was studying in my thesis. This second point is particularly significant, 
and I will return to it in considerable detail below. I was not able to determine from 
my files (by finding either a reference in my notes or a Xerox copy) when I first read 
the Nambu-Lurie (1962) paper on which the Nambu-Shrauner paper was based, 
but it was probably a year later, in early 1965. 


Forward Lepton Theorem 

Roughly the first year and a half of my thesis work on weak pion production was 
spent mastering the formal apparatus of Lorentz invariant amplitudes (used for writ¬ 
ing dispersion relations) and center of mass multipole expansions (used for imple¬ 
menting unitarity) and the transformations between them, the Born approximation 
structure, cross section calculations, etc. Then in the winter of 1963-1964 or the 
spring of 1964 (I can only establish dates approximately by the sequence of folders, 
since I did not date them), I began noticing things that transformed a hard and 
often dull calculation into a very interesting one (just in the nick of time, since I 
was due to finish in June of 1964 and had already accepted a postdoctoral position 
at the Harvard Society of Fellows starting in the fall semester.) 

The first thing I noticed was that at zero squared leptonic four momentum trans¬ 
fer, my expression for the weak pion production matrix element reduced to just the 
hadronic matrix element of the divergence of the axial-vector current, which by the 
partial conservation hypothesis is proportional to the amplitude for pion-nucleon 
scattering. I then tried to abstract something more general from this specific ob¬ 
servation, and soon had a neat theorem showing that in a general inelastic high 
energy neutrino reaction, when the lepton emerges forward and the lepton mass is 
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neglected, the leptonic matrix element is proportional to the four momentum trans¬ 
fer; hence when the leptonic matrix element is contracted with the hadronic part, 
the vector current contribution vanishes by CVC, and the axial-vector current con¬ 
tribution reduces by partial conservation (for which I coined the parallel acronym 
PCAC, which has become standard terminology) to the corresponding matrix ele¬ 
ment for an incident pion. Thus inelastic neutrino reactions with forward leptons 
can be used as potential tests of CVC and PCAC; this became a chapter of my 
thesis and was written up as a paper (Adler, 1964c, R4) as soon as my thesis was 
completed. The paper on CVC and PCAC tests was the first of three papers in 
which I found connections between high energy neutrino scattering reactions and 
properties of the weak currents; the other two were my long paper on the qa sum 
rule, and a paper on neutrino reaction tests of the local current algebra, both of 
which are reprinted in this volume and will be discussed shortly. 

To determine whether the CVC/PCAC test could be implemented experimen¬ 
tally, I wrote a letter to the neutrino experimentalists at CERN. After a few months 
I received a charming reply from Gilberto Bernardini, who commented “The delay 
of this answer, for which I apologize very much, is due to two facts. The first is the 
known time diagram of the ‘modern physicist 5 . In case you do not know it yet, I plot 
it here: (Diagram with a vertical time axis and an upwards pointing arrow; ‘work 5 
at the bottom, ‘travel & meetings 5 in the middle, and ‘dinners & ceremonies 5 at the 
top.) Unfortunately, according to my age, I am already very much in the central 
region and even higher. 55 Bernardini then went on to say that Antonino Zichichi had 
brought my paper to his attention a couple of weeks before, and then continued with 
an analysis of technical problems in executing my proposal. There followed a further 
exchange of letters with Bernardini, with theorist John Bell, and with experimen¬ 
talists Guy von Dardel and Carlo Franzinetti. Of particular note, von Dardel wrote 
me a long letter after he read my paper, remarking that the care with which he 
read it was partly due to a skiing accident that had kept him in bed with a broken 
leg and nothing better to do, and giving a formula that he had worked out, during 
his enforced time away from experimental activities, for corrections to my theorem 
when the lepton emerges at a small angle to the forward direction. This formula 
turned out to be not quite right (there was an incorrect energy factor), but started 
me thinking about the issue, which I discussed with John Bell when I attended an 
Informal Conference on Experimental Neutrino Physics at CERN, January 20-22, 
1965. Bell had redone the calculation of the pion exchange contribution to the small 
angle correction by splitting the amplitude into spin-flip and non-spin-flip parts, 
getting a result that turned out also to be not quite right (there was a factor of 2 off 
in one term). When I got back to Harvard I repeated the calculation, according to 
my notes, by the “Bell method 55 , and also by a covariant method, and got a formula 
that I never published, but conveyed in letter of Feb. 10, 1965 to Bell (with copies 
to Bernardini, Block, von Dardel, Faissner, Franzinetti, and Veltman, most of whom 
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I had talked with when I was at CERN). The corrected small angle formula states 
that the first factor on the second line of Eq. (16) of R4 should be replaced by 
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with k 2 = m|/co/A: 2 o + & 1 O& 2 O 0 2 the leptonic four-momentum transfer squared and 
with 6 the lepton-neutrino polar angle, assuming that the lepton-neutrino azimuthal 
angle has been averaged over. 

Even before my visit to CERN, Bell (1964) had noted that when one considers 
my forward lepton formula in the context of nuclei, “the following difficulty presents 
itself: Because of absorption, pion cross sections depend on the size of large nuclei 
roughly as A 2 / 3 . But neutrinos penetrate to all parts of nuclei; for them cross sec¬ 
tions should contain at least a part proportional to A. This indicates for large nuclei 
a critical dependence of —q 2 ) on g 2 .” Bell proceeded to use optical model meth¬ 
ods to discuss this “shadowing effect”, which has continued to be of interest over 
the years. It took many years for my forward lepton formula, and Bell’s shadowing 
observation, to be experimentally verified; for a survey of the status of both, and 
further references, see the recent conference talk by Kopeliovich (2004). An earlier 
review of Mangano et ah ( 2001 ) also discusses the experimental status of shadow¬ 
ing, and a good exposition of the theory is given in the review of Llewellyn Smith 
(1972). For specific applications of the forward lepton formula to exclusive chan¬ 
nels, see Ravndal (1973) and Rein and Sehgal (1981) for A(1232) production, and 
Faissner et ah (1983) for coherent 7 r° production (which was used to determine the 
coupling strength of the isovector neutral axial-vector current). Also, Sehgal (1988) 
and Weber and Sehgal (1991) discuss an interesting analog of the forward lepton 
theorem for purely leptonic neutrino-induced reactions. 


Soft Pion Theorems 

Returning now to my thesis work in the spring of 1964, the second thing that I 
noticed, again working from my explicit expression for the weak pion production 
amplitude, was that when I imposed the PC AC condition at zero values of the 
hadronic energy variable v and the hadronic momentum transfer variable 1 / 3 , only 
the Born approximation pole term coming from the nucleon intermediate state con¬ 
tributed; all of the model dependent parts of the weak amplitudes dropped out. 
Thus I got what I called a “consistency condition” on the pion-nucleon scattering 
amplitude implied by PCAC, taking the form 

g 2 /M = A nN M(u = 0 ,v B = 0, k 2 = o )/K NNn (k 2 = 0), 

with g r the pion-nucleon coupling constant, M the nucleon mass, —k 2 the squared 
mass of the initial pion (the final pion is still on mass shell), and with K NNn (0) the 
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pionic form factor of the nucleon, normalized so that K NNn (—M%) = L This seemed 
absolutely remarkable, and I immediately proceeded to do a dispersion relation 
evaluation of the pion-nucleon amplitude on the right, using the Roper (1964) phase 
shift analysis as input, and assuming that the effects of off-shell continuation in 
A n iV(+) ^ we n as j n j^NNn^ were sma p j n setting up this calculation, I used several 
theoretically equivalent ways of writing the subtracted dispersion relation to get an 
estimate of the errors in the analysis. The Christenson-Cronin-Fitch-Turlay (1964) 
experiment on CP violation had a substantial block of computer time reserved for 
analysis, and courtesy of them I was able to use a small amount of their time to 
run my programs, a few days before I was scheduled to give a talk at Columbia. I 
recall staying up all night to get the job done, and at one point, in the wee hours 
of the morning, dropping my deck of cards and then having to spend precious time 
getting them back in the proper order. But I did get my calculation done by morning 
(and never again attempted an “all-nighter”.) The relation worked very well, and 
as Treiman later said, “now there is a second number”; PCAC was starting to look 
interesting. This work became the final chapter of my thesis. 

Immediately after finishing my thesis I took a summer job at Bell Laborato¬ 
ries at Murray Hill, nominally working for Phil Anderson. I wanted to learn about 
superconductivity, and Phil assigned me to work for Dick Werthamer. I did learn 
about the BCS and Ginzburg-Landau theories, and Abrikosov vortices in type-II 
superconductors, but I did not succeed in my project with Dick, which was to try 
to understand the resistance to vortex line motion using thermal Green’s functions. 
A few weeks before the end of the summer, I asked for and got Phil’s permission to 
spend some time writing a paper on the pion-nucleon consistency condition (Adler, 
1965a, R5), which I also then extended to pion-pion and pion-lambda scattering. In 
the pion-pion case, since there are no pole terms, the consistency condition takes the 
form that the pion-pion scattering amplitude with one zero mass pion, evaluated at 
the symmetric point s = t = u = M^, is zero. This was the first example of a soft 
pion zero or, as termed in the literature, “Adler zero”, in non-baryonic amplitudes, 
that I will return to shortly. Knowing that I was planning to go on in particle theory, 
Phil told me one day that he had an interesting paper to show me, which had just 
been submitted to the journal Physics which he was editing. It was Gell-Mann’s 
(1964) paper on current algebra; Phil let me read it, but not Xerox it. This was to 
prove decisive for my work on sum rules nine months later. My interactions with 
Phil however were brief, and never touched on the subject of symmetry breaking in 
superconductivity and particle physics, on which Phil had written a paper (Ander¬ 
son, 1963) that I learned of only many years later, that was a forerunner of work on 
the “Higgs mechanism” for giving masses to vector bosons. 

In the fall of 1964 I moved to Harvard as Junior Fellow in the Society of Fel¬ 
lows, sharing a postdoc office next to the office occupied by Henry Ehrenreich in 
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the Applied Physics division. (Henry had recently left General Electric to accept 
a Professorship at Harvard.) In principle I was going to do solid state physics as 
well as particle theory, but that never happened. I spent the fall term working on 
numerical aspects of my weak pion production calculation, and also reading papers 
on attempts to calculate the axial-vector renormalization constant g a, including the 
papers of Gell-Mann and Levy (1960) and Bernstein, Gell-Mann and Michel (1960). 
I had a hunch that the fact that gA is near one was somehow connected with PC AC, 
but I did not see a concrete way of exploiting PCAC in a calculation. I also was 
starting to think about how to make the PCAC consistency condition calculations 
independent of the cumbersome Lorentz invariant amplitude apparatus that I had 
used to get them. I soon found that the relevant terms could be isolated directly from 
the Feynman diagrams without invoking all the formal kinematic apparatus of my 
thesis, and this approach extended to a general matrix element as well; the strategy 
was the same one that I had used in the paper on CVC and PCAC tests, of going 
from a particular observation in the context of my weak pion production calculation 
to something more general. The result was a formula for soft pion production, in 
terms of external line insertions on the hadronic amplitude for the same process 
in the absence of the pion (Adler, 1965b, R6). For baryons of nonzero isospin, the 
insertion factors are nonzero, while for isospin zero baryons, and mesons such as the 
pion or kaon, the insertion factor vanishes. This latter result generalizes the soft-pion 
zero or “Adler zero” to the emission of a soft pion in any reaction involving only 
incoming and outgoing mesons, but no external baryons. These zeros continue to 
play a role in the analysis of experimental results on mesonic resonances; for recent 
discussions, see Bugg (2003, 2004) and Rupp, Kleefeld, and van Beveren (2004). 

The soft pion zeros are an indication that according to PCAC, the pion cou¬ 
pling to other hadrons is effectively pseudovector, and not pseudoscalar. When I 
visited CERN in late January of 1965, while in the midst of work on the Feynman 
diagram approach to the PCAC consistency conditions, I found that Veltman had 
been thinking in a similar direction, but had not reached the point of writing down 
external line insertion rules. Veltman gave me a one page memo to file that he had 
written, which pointed out that my PCAC consistency conditions are equivalent to 
pseudovector coupling, which implies the vanishing of invariant amplitudes for soft 
pion emission after singular terms are split off. Veltman also noted that Feynman 
had briefly remarked on the relation between the Goldberger-Treiman relation and 
pseudovector coupling in his conference summary talk at Aix-en-Provence in 1962, 
and gave me a copy of the relevant page. Feynman did not, however, report agree¬ 
ment with experiments on pion-nucleon scattering, apparently because he did not 
recognize the necessity of splitting off the singular Born terms before concluding 
that pion emission amplitudes vanish in the soft pion limit. 

In the course of my work on the insertion rules I remembered the paper of Nambu 
and Shrauner (1962) which I had briefly mentioned in the Introduction to my thesis; 
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I now looked this up, as well as the paper of Nambu and Lurie (1962) on which it 
was based, and saw that my final formula, when specialized to the case of an ingoing 
and outgoing nucleon line, was substantially the same as the pion bremsstrahlung 
formula of Nambu and Lurie. I noted this in my paper, and consistently referred to 
the Nambu papers from this point on. In recognition of Nambu’s work, I used his 
notation x and term “chirality” to refer to the integrated axial-vector charge in my 
next two papers, which dealt with the sum rule; however, in modern terms this 
is a misnomer, since chirality is now used to mean the left- or right-handed sums of 
vector and axial-vector charges. Gell-Mann’s notation for the axial-vector charges 
has become the standard one, and after these two papers I followed the Gell-Mann 
notation. 

The comparison with Nambu’s approach also raised the issue of the role of the 
pion mass: do the PCAC results limit smoothly to the zero pion mass ones, for which 
the soft pion theorem derivations appear quite different? This point was dealt with in 
footnote 6 of my paper R6, where I showed that the limits, (1) pion mass approaches 
zero, and (2) pion four momentum transfer squared approaches zero, can be taken 
in either order; the same soft pion theorem results, although the contribution which 
comes from the massless pion pole when the limit (1) is taken first, comes instead 
from the axial-vector divergence when the limit (2) is taken first. This point is now 
taken for granted, but in the early years it caused me (and others) considerable 
confusion. After this paper I almost immediately got involved with sum rules, and 
so I did not publish the detailed connection between my second PCAC paper and 
the Nambu-Lurie approach until a few years later, when I included it as “Appendix 
A” of Chapter 2 of the book on Current Algebras which I put together with Roger 
Dashen (Adler and Dashen, 1968). This appendix is reprinted here as R7. At the 
end of Appendix A, I again discussed the relationship between the zero pion mass 
and nonzero pion mass calculations. The analysis of Appendix A also shows how the 
PCAC approach to soft pion theorems that I had developed fixes the undetermined 
renormalization constant appearing in the chirality approach of Nambu-Lurie. In 
the formulas of Appendix A, there are factors of gA that are missing in the formulas 
of the papers of Nambu, Lurie, and Shrauner. Correspondingly, in the paper of 
Nambu and Lurie, in the discussion associated with their Eq. (2.7), they noted that 
a renormalization constant Z appears, but didn’t observe that this can be precisely 
identified as gA- Instead, they redefined their chirality as Z~ l x-> that is as (gA)'~ 1 X- 
They then made a compensating adjustment in the pion decay constant in their Eq. 
(4.5), where they dropped the gA factor which appears in the Goldberger-Treiman 
relation. Nambu and Lurie say there, “1/A is more or less the conventional pion 
coupling constant 1/A = / = g/2m. (4.5) It is not proven, however, that this agrees 
with the coupling constant defined in the dispersion theory. For the time being, we 
assume it to be the case.” In the subsequent paper of Nambu and Shrauner (1962), 
an identification of / with the standard pion-nucleon coupling was established, but 
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the issue of where to include factors of gA was not addressed. My impression from 
this was that there was some uncertainty in the minds of Nambu and his students 
about how the chirality is to be normalized, and this impression was reinforced 
by a conversation I later had with Nambu about their work and my Appendix A 
derivation of their result. 

In the low energy theorem for one soft pion which Nambu and Lurie had derived, 
and which I had obtained from PCAC and the Feynman rules in my second PCAC 
paper, the gA factor drops out, and so the normalization of the axial-vector charge or 
“chirality” is irrelevant. The applications discussed in the papers of Nambu, Lurie, 
and Shrauner all involved only one soft pion; Nambu and Lurie looked, for example, 
at7r+jV —> n + N + n, with the final pion soft but with the other pions “hard”; in 
fact, what they actually did was to calculate single soft pion emission in the reaction 
7r + N —> A(1232). Similarly, Nambu and Shrauner (1962) analyzed single soft pion 
electroproduction and weak production, relating them to the form factors of the 
vector and axial-vector currents. In this paper they included current commutator 
terms by analogy with the classic Low (1958) paper on bremsstrahlung; their answer 
for electroproduction is correct because the gA factor drops out there anyway, but 
their answer for weak axial-vector production lacks gA factors in places, for reasons 
explained in the next paragraph. A follow-up paper of Shrauner (1963) dealt with 
single soft pion production in pion-nucleon scattering, with the scattering pions 
“hard”. My “PCAC consistency condition” was likewise a single soft pion theorem 
which gives a relation between the amplitude for n + N —> N + 7r, with the final 
pion soft, and the amplitude n + N —> N, which is just the pion-nucleon coupling 
constant, and involves no factors of gA- 

The factors of gA and the explicit identification of the “chirality” with the charge 
associated with the axial-vector current become important, however, if one wants to 
discuss multiple soft pion production, and also weak axial-vector pion production, 
since one then encounters commutators of an axial-vector charge with an axial-vector 
charge or current, which are evaluated by the Gell-Mann current algebra. If one 
defines the relevant chirality as (<7 a) - 1 times the axial-vector charge, as is implicit 
in the Nambu-Lurie paper when one identifies their Z with gA-, then the relevant 
commutator is (#a)~ 2 times a vector charge, which at zero momentum transfer just 
gives (#a)~ 2 - This is in fact the origin of the (#a)~ 2 term in the gA sum rule, where 
the difference between (#a)~ 2 and 1 is highly significant. The point, then, is that 
while Nambu and Lurie gave a correct formula for single soft pion production, it in 
fact cannot be generalized to multiple soft pion production (or soft pion production 
by the weak axial-vector current) without first dealing carefully with the question 
of normalization, as I did in my second PCAC paper R6 and in Appendix A of the 
book on current algebras R7. 

Another difference between the work of Nambu and his students, and what I 
did in my first PCAC consistency condition paper R5, related to the method of 
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comparison with experiment, and the level of accuracy claimed for soft pion pre¬ 
dictions. The Goldberger-Treiman relation is good to about 7% accuracy, and my 
comparison of the PCAC consistency condition with experiment also indicated that 
the relation was satisfied to about 10%, thus reinforcing the idea that PCAC could 
be used as a quantitative tool for studying the strong interactions, with the residual 
errors arising from the extrapolation of the pion four-momentum squared k 2 from 
to 0. Given that the pion mass is much smaller than all other hadron masses, an 
extrapolation error 0.1 is reasonable. The success of the gA sum 

rule shortly afterwards gave further support to the idea that PCAC gives quanti¬ 
tatively accurate predictions. Nambu, Lurie, and Shrauner, however, argued only 
for qualitative agreement between their soft pion results and experiment based on 
comparisons of rescaled angular distributions, but did not find anything close to 
~ 10% agreement for absolute cross sections. For example, for the relation between 
the cross sections for pion-nucleon scattering with production of an additional pion, 
and pion-nucleon scattering, Nambu and Lurie (1962) showed agreement with their 
predictions to within roughly a factor of three (giving a predicted cross section of 
0.2 mb versus experimental values in the range 0.6 to 0.7 mb). Similarly, for the 
same reaction Shrauner (1963) found that “the magnitudes of the cross sections 
seem to be significantly underestimated by a factor of about 7”. The source of these 
discrepancies is not clear. They may be due, in part, to the fact that, instead of 
testing the soft pion predictions at the kinematic point of zero pion four momentum 
(such as the point u = ug = 0 used in my PCAC consistency condition work), 
Nambu, Lurie, and Shrauner did the comparisons in energy intervals above scat¬ 
tering threshold. (However, Shrauner argues, on the basis of branching ratios,that 
the discrepancy is probably not attributable to an overlap of the A(1232) resonance 
with the comparison region.) I think that a combination of lack of clarity about how 
their chiral current was related to the physical axial-vector current, as reflected in 
the normalization problems noted above, together with the lack of striking quanti¬ 
tative comparisons with experiment, were responsible for the work of Nambu and 
his students being largely unnoticed by the community. It was only after the quan¬ 
titative successes of PCAC in my consistency condition paper and in the gA sum 
rule that followed shortly afterwards, and my demonstration of the equivalence be¬ 
tween the PCAC insertion rules and the chirality conservation approach, that the 
significance of the work of the Nambu group became clear. 

Finally, as an historical footnote to this discussion of soft pion theorems, Tou- 
schek (1957) appears to have been the first to introduce continuous 75 symmetry 
transformations, as applied to the neutrino field, and to observe that invariance un¬ 
der these transformations requires that the neutrino mass be zero. Nishijima (1959) 
(in work submitted for publication in late 1958) considered continuous 75 symmetry 
transformations in theories of massive fermions; to preserve 75 invariance he gauged 
the transformations with a massless pseudoscalar boson, transforming as B —> B + A 
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under a 75 transformation with parameter A. The action written in Nishijima’s pa¬ 
per is just the effective action one would now write for a singlet Nambu-Goldstone 
boson (such as an axion) coupled to a massive fermion. Nambu (1959), in remarks 
at the Kiev Conference, noted the analogy between 75 symmetry in particle physics 
and gauge invariance in superconductivity, and related this to his suggestion that 
a nucleon-antinucleon pair in a pseudoscalar state could be the pion. This idea was 
further developed in the well-known paper Nambu and Jona-Lasinio (1961), that 
laid the basis for the modern theory of Nambu-Goldstone bosons associated with 
spontaneous symmetry breaking, and for the fact that most of the mass of the nu¬ 
cleon comes from chiral symmetry breaking. In the meantime, Gursey (1960) had 
introduced isovector 75 transformations, as an extension of the similar isoscalar 
transformations used by Nishijima, and constructed a precursor to nonlinear pion 
effective Lagrangians. These papers all contained important seeds of our present-day 
understanding of chiral symmetries. 


Sum Rules 

I have now gotten ahead of the chronological story; a lot of things happened very 
fast in 1965. In the fall of 1964 I started thinking about the question of the renor¬ 
malization of the nucleon axial-vector coupling and accumulated a file of papers 
on the subject. However, my attempts at a calculation were based on the commu¬ 
tator of the nucleon field with the weak axial-vector charge, giving results identical 
to those already obtained by Bernstein, Gell-Mann, and Michel (1960), which ex¬ 
pressed gA in terms of unmeasurable off-shell form factors, but achieving no further 
progress. In early 1965 I saw a preprint of Fubini and Furlan (published as Fubini 
and Furlan, 1965) which applied the commutator of vector current charges, together 
with the ingenious idea of going to an infinite momentum frame, to calculate the 
radiatively induced renormalization of the vector current. (Harvard did not have a 
preprint library in those days, but Schwinger’s secretary Shirley would let me into 
his office from time to time to look through the unread preprints that were stacked 
on his desk. This presented no difficulty since Schwinger was a night-owl who mainly 
worked at home, and used his office only a few hours a week, when he came in to 
lecture and to see students. That is how I became aware of the Fubini-Furlan pa¬ 
per. As a result of this experience, one of the first things I did when I arrived at 
the Institute for Advanced Study eighteen months later was to start a preprint li¬ 
brary for the particle physicists.) I immediately thought about applying this to the 
axial-vector current, using the Gell-Mann current algebra that Pd seen the previous 
summer at Bell Labs. However, because of other things I was working on I didn’t get 
around to it until a few months later, when in a chance encounter Arthur Jaffe told 
me that he had heard a talk by Roger Dashen about work he and Gell-Mann had 
been doing on sum rules. I decided I had better stop delaying (although it turned 
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out that Dashen and Gell-Mann were working on fixed momentum transfer sum 
rules), dropped my weak pion production computer work, and spent spring break 
working out the consequences of combining the Gell-Mann current algebra, PCAC, 
and the Fubini-Furlan method. It turned out to be surprisingly easy, with the infi¬ 
nite momentum frame solving a problem I had encountered in earlier attempts to 
calculate g a, which is that the axial-vector charge matrix element is proportional to 
the nucleon velocity, and vanishes for nucleons at rest. I soon had a formula relating 
the difference between 1 and (#a )~ 2 to a convergent integral over a difference of 
pion-nucleon cross sections, 



4 M 2 N 1 

g 2 K NN *( 0) 2 t r 


^OO 

Mjv + Mtt 


WdW 
W 2 -Ml 






with M n and M/v the pion and nucleon masses, o-^(W) the total cross section for 
scattering of a zero-mass on a proton at center-of-mass energy W t and again with 
K NNn (0) the pionic form factor of the nucleon, normalized so that K NNn (—M%) = 
1. I first tried to saturate the integral in the narrow A(1232) approximation, and 
the result was a disappointing = 3. I then pulled out the computer deck I 
had used for the consistency condition numerical work the previous year, did the 
integral carefully, and got g& = 1-24. I also observed that the relation for gA could 
be equivalently recast as a two-soft pion low energy theorem, 



-2 M 2 n 
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Here and B nN ^ are the isospin-odd pion-nucleon scattering amplitudes, 

v and uq are again the energy and momentum transfer variables, and M^ are 
the initial and final pion masses, which are now both off shell. A few days after I 
submitted a letter to Physical Review Letters , Sidney Coleman returned from a trip 
to SLAC and when I described my results to him, he told me that he had just heard 
about a similar calculation being done there by Bill Weisberger, whose points of 
departure were the same as mine: the Gell-Mann current algebra, the Fubini-Furlan 
paper, and my paper on PCAC consistency conditions. I talked to Weisberger by 
phone, and then called PRL and asked them to delay publication of my letter until 
they received the manuscript Weisberger was preparing. My paper (Adler, 1965c, 
R8) and Weisberger’s (Weisberger, 1965) appeared as back-to-back letters in the 
June 21 issue. They give substantially identical derivations; Weisberger’s numerical 
result of 1.16 differed from mine of 1.24 because I had included a correction for the 
off-pion-mass-shell extrapolation of the threshold phase space factor associated with 
the A(1232) resonance, which I knew from my work on weak pion production could 
be reliably estimated. At the time, this correction made agreement with experiment 
worse (the experimental value for gA was then 1.18), but the best value now has 
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settled down to = 1.257 d= .003, in gratifyingly good agreement with the value I 
got when I included the kinematic extrapolation correction. Weisberger and I both 
submitted longer papers to Physical Review describing our work (Adler, 1965d, 
R9); Weisberger (1966). These emphasized the low energy theorem approach to 
the relation for g a, giving historically the first two-soft pion low energy theorem. 
In my paper I also gave an analog for pion-pion scattering, and then in the final 
section (Adler, 1965, R9, Section V), I returned to the observation that I had made 
a year earlier about forward lepton scattering, and showed that the g& sum rule 
could be converted to an exact relation, involving no off-shell PCAC extrapolation, 
for forward inelastic high energy neutrino reactions. This relation, which provided 
a test of the Gell-Mann current algebra of axial-vector charge commutators, was 
another indication of a deep connection between the structure of currents on the 
one hand, and inelastic lepton scattering on the other. 

The <7 a sum rule provided yet a third result supporting the use of PCAC as a 
method for calculating soft pion processes. Simultaneously, it was a stunning success 
for Gell-Mann’s brilliant idea of abstracting the current algebra from the naive quark 
model, with the hope that it would prove to be a feature that would also be valid 
in the then unknown theory of the strong interactions. At this point the whole 
community took notice, and a string of current algebra/PCAC applications appeared 
in rapid succession. To mention just a few, Weinberg (1966a) and Tomozawa (1966) 
reexpressed the soft pion theorems for pion-nucleon scattering, coming from my 
consistency condition papers and the g& sum rule papers, in the form of formulas 
for the pion-nucleon scattering lengths, and Weinberg in the same paper also used my 
result of a PCAC zero in pion-pion scattering, plus a symmetry argument, as inputs 
for a derivation of pion-pion scattering lengths. Weinberg (1966b) also generalized 
the two-soft pion low energy form of the g& sum rule to a general formula for multiple 
soft pion production. Finally, in another striking application of soft pion theorems, 
Callan and Treiman (1966) gave a series of important results for K meson decays, 
in which the role of rapidly varying pole terms was clarified in Weinberg (1966c). 

In connection with the g& sum rule, I have an interesting Feynman anecdote 
to relate. I spent the spring term of 1966 as a member of Murray Gell-Mann’s 
postdoctoral group at Cal Tech. A few weeks after I arrived, Feynman asked me to 
stop by his office to look at some pages in his notebook, in which he had almost 
derived the g& sum rule, before Weisberger and I did it. The whole expression was 
there (including the kinematic correction that I had included for the off-mass-shell 
extrapolation), except that, where the Gell-Mann algebra had dictated a 1 coming 
from the commutator of two axial-vector charges giving an unrenormalized vector 
charge, Feynman had put 0! So numerically the relation did not work, and Feynman 
had given up on it and gone on to other things. He evidently had not paid attention 
to Gell-Mann’s current algebra, or at least not realized, from his heuristic way of 
doing things, that it was essential for this calculation. 
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Returning again to events in 1965, as soon as the long paper on g a was com¬ 
pleted, I departed to be a summer visitor at CERN. There I met Murray Gell-Mann 
for the first time, and had long conversations with him. Murray was particularly 
interested in the Section V relation between the current algebra of vector and axial- 
vector charges and forward high energy neutrino reactions, and urged me to try to 
extend it to a test of the local current algebra which he had given in his Physics 
paper (Gell-Mann, 1964). I spent the summer working on this, and found that I 
could do it; as I recall, the crucial bits came together when I spent a day work¬ 
ing at a kitchen table during a week off for holiday at Lake Garda. The results 
were written up in the late summer of 1965 at CERN and/or Harvard, and ap¬ 
peared in Adler (1966), RIO. This article gave the first detailed working out of the 
structure of deep inelastic high energy neutrino scattering (the electroproduction 
case was given independently in the review of de Forest and Walecka (1966)), with 
both the electroproduction and neutrino cases specific examples of general local 
lepton coupling theorems given by Lee and Yang and by Pais, as referenced in my 
article RIO. However, the a,/3, 7 notation that I used for the structure functions 
did not become the standard one; the now standard structure functions, 

which follow the notation of de Forest and Walecka and were further popularized 
by Bjorken, are linearly related to the ones I used. [Specifically, I separated the 
cross section into strangeness-conserving and strangeness-changing pieces, whereas 
the current convention is to define the structure functions as the sum of both. At 
zero Cabibbo angle, the relation between my a,/3, 7 and the conventional is 

a = Wi, /3 = H 2 , 2 M/V 7 — W 3 , with M/v the nucleon mass. For general Cabibbo 
angle 6c , one has cos 2 + sin 2 Oc/3 |as|"=i = w ^h similar relations for 

the other two structure functions.] The article actually gave three sum rules; two for 
the a and 7 structure functions which subsequent analysis by Dashen showed to be 
divergent and hence useless, and one for the /3 deep inelastic amplitude which is a 
convergent and useful relation. The beta sum rule divides into axial-vector and vec¬ 
tor parts, which are separately given as Eqs. (53a) and (53b) respectively of Adler 
(1966), RIO, and which when added to give the total A S = 0 cross section yield 

roc TT7 

2 = g A (q 2 ) 2 + F^(q 2 ) 2 +q 2 F^(q 2 ) 2 + / — dW^~\q 2 , W) -/?(+> (q 2 , WQ] . 
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This sum rule (and the ones for the separate vector and axial-vector contributions) 
has the notable feature that the left-hand side is independent of g 2 , even though the 
Born term contributions and the continuum integrand on the right are g 2 -dependent. 
At zero squared momentum transfer g 2 , the axial-vector part of the /3 sum rule 
reduces to the relation I gave in my long paper on g a, which had prompted Gell- 
Mann^ question about a generalization; the first derivative of the vector part with 
respect to q 2 at q 2 = 0 gives the sum rule also derived by Cabibbo and Radicati 
(1966) using moments of currents. Because the neutrino and antineutrino differential 
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cross sections d 2 a/d(q 2 )dW are dominated by the /3 structure function in the limit 
of large neutrino energy, by integrating over W one gets the limiting cross section 
relation (at zero Cabibbo angle) 

I - da(V + p) da(u + p)~i _ G 2 

E [ ™oc [ d{q 2 ) d{q 2 ) \ ~ tt ’ 

with G the Fermi constant. Similar relations at non-zero Cabibbo angle are given 
in Eq. (27) of RIO, and it is easy to obtain analogous relations for the vector and 
axial-vector contributions to the cross sections taken separately. 

In late October of 1965 I spoke on “High Energy Semileptonic Reactions 11 at the 
International Conference on Weak Interactions held at Argonne National Labora¬ 
tory (Adler, 1965e), in which I gave the first public presentation of the local current 
algebra sum rules for the /3 deep inelastic neutrino structure functions, and the lim¬ 
iting relations for the differential cross sections that they imply. In the published 
discussion following this talk, in answer to a question by Fubini, I noted that the /3 
sum rule had been rederived by Callan (unpublished) using the infinite momentum 
frame limiting method, but that the a and 7 sum rules could not be derived this 
way, reinforcing suspicions that “the integral for /3 is convergent, while the other 
two relations (for a and 7 ) really need sub tractions . 11 Bjorken was in the audience 
and was intrigued by the /3 sum rule results, and soon afterwards converted them 
into a differential cross section inequality (Bjorken, 1966, 1967) for deep inelastic 
electron-nucleon scattering, for which there was the prospect of experimental tests 
relatively soon. To see why the neutrino cross section relation given above implies 
an inequality for electron scattering, one notes that since the v + p differential cross 
section is positive, the right-hand side G 2 /n gives a lower bound for the V + p dif¬ 
ferential cross section, with a similar lower bound holding for the vector current 
contribution alone. But noting that according to CVC, the vector weak current is 
in the same isospin multiplet as the isovector part of the electromagnetic current, 
and using the Wigner-Eckart theorem, one gets a corresponding lower bound for the 
inelastic differential cross section induced by an isovector virtual photon scattering 
on a nucleon. One then notes that in the scattering of a virtual photon on a target 
containing equal numbers of neutrons and protons, the isovector and isoscalar cur¬ 
rents add incoherently, and so the isovector current contribution alone gives a lower 
bound. Combining the two bounds, and including an extra l/(k 2 ) 2 for the virtual 
photon propagator, replacing G by the fine structure constant a, and keeping track 
of numerical factors, one gets Bjorken’s electron scattering result 

d[a(e + p) + cr(e + n)] 2na 2 

£ ™oo d(W) > (Fj 2 ’ 

which was testable in the experiments soon to begin at SLAC. Verification of my 
neutrino sum rule, on the other hand, took two decades and more; see Allasia et 
al. (1985) for the first reported test, and Conrad, Shaevitz, and Bolton (1998) for 
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more recent high precision results. For a recent study of my neutrino sum rule, in 
comparison with the Gottfried (1967) sum rule for electron-proton scattering, within 
the framework of the large N c expansion of QCD with N c colors, see Broadhurst, 
Kataev, and Maxwell (2004) and Kataev (2004). 

Although not directly tested until many years after it was derived in 1965, my 
neutrino sum rule had important conceptual implications that figured prominently 
in developments over the next few years. To begin with, it gave the first indica¬ 
tions that deep inelastic lepton scattering would give information about the local 
properties of currents, a fact that at first seemed astonishing, but which turned out 
to have important extensions. Secondly, as noted by Chew in remarks at the 1967 
Solvay Conference (Solvay, 1968), the closure property tested in the sum rules, if 
verified experimentally, would suggest the presence of elementary constituents in¬ 
side hadrons. In a Letter (Chew, 1967) published shortly after this conference, Chew 
argued that my sum rule, if verified, would rule out the then popular “bootstrap” 
models of hadrons, in which all strongly interacting particles were asserted to be 
equivalent (“nuclear democracy”). In his words, “such sum rules may allow con¬ 
frontation between an underlying local spacetime structure for strong interactions 
and a true bootstrap. The pure bootstrap idea, we suggest, may be incompatible 
with closure.” In a similar vein, Bjorken, in his 1967 Varenna lectures (Bjorken, 
1968), argued that the neutrino sum rule was strongly suggestive of the presence 
of hadronic constituents, and this was also noted in the review of Llewellyn Smith 
(1972). 

These conceptual developments still left undetermined the mechanism by which 
the neutrino sum rule, and Bjorken’s electron scattering inequality, could be sat¬ 
urated at large q 2 . During my visit to Cal Tech in 1966, I renewed my graduate 
school acquaintance with Fred Gilman and worked with him on two projects. One 
was an analysis of the saturation of the neutrino sum rule for small q 2 (Adler and 
Gilman, 1967, Rll), in which we concluded that SLAC (soon to start operating) 
would have enough energy to confront the saturation of the nonzero q 2 sum rules 
in a meaningful way. In this paper, we noted that the /3 sum rule posed what at 
the time was a puzzle: the left-hand side of the sum rule is a constant, while the 
Born terms on the right are squares of nucleon form factors, which vanish rapidly as 
the momentum transfer q 2 becomes large. The low lying nucleon resonance contri¬ 
butions on the right were expected to behave like the A(1232) contribution, which 
is form factor dominated and also falls off rapidly with q 2 . Hence it was clear that 
something new and interesting must happen in the deep inelastic region if the sum 
rule were to be satisfied for large q 2 : “to maintain a constant sum at large q 2 , the 
high W states, which require a large E to be excited, must make a much more 
important contribution to the sum rules than they do at q 2 = 0”. We were cau¬ 
tious, however (too cautious, as it turned out!), and did not attempt to model the 
structure of the deep inelastic component needed to saturate the sum rule at large 
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q 2 . Bjorken became interested in the issue of how the sum rule could be saturated, 
and formulated several preliminary models that (in retrospect) already had hints of 
the dominance of a regime where the energy transfer v grows proportionately to the 
value of q 2 . I summarized these pre-scaling proposals of Bjorken in the discussion 
period of the 1967 Solvay Conference (Solvay, 1968), which Bjorken did not attend, 
in response to questions from Chew and others as to how the neutrino sum rule 
could be saturated. The precise saturation mechanism was clarified (to a very good 
first approximation) some months after the Solvay conference with the proposal by 
Bjorken (Bjorken, 1969) of scaling, and soon afterwards, with the experimental work 
at SLAC on deep inelastic electron scattering, that confirmed Bjorken’s intuition. 
For a very clear exposition of the relation between scaling and the neutrino sum 
rule, see Sec. 3.6B of Llewellyn Smith (1972), who notes that when the sum rule is 
rewritten in terms of Bjorken’s scaling variable a;, “The simplest way to ensure the 
Q 2 [my q 2 ] independence of the left-hand side as Q 2 —> oo is to assume that the 
limit in eq.(3.71) [in my notation, limQ 2 _, oo wfixec i f3( ± )(u; > Q 2 /Mx)\ exists”. 


More Low Energy Theorems; Weak Pion Production Redux 

In the fall of 1965 I received an invitation from Oppenheimer, which I accepted, to 
come to the Institute for Advanced Study as a long term member with a five year 
appointment, starting in the fall of 1966. Roger Dashen, whom I had met briefly 
when he visited Harvard earlier in 1965, received a similar invitation. The intent 
behind our appointments was that we would reinvigorate high energy theory at the 
Institute, which had fallen into a decline with the departures of Lee, Yang, and Pais 
to professorships elsewhere, and with a turn of Dyson’s research interests towards 
astrophysics. 

Before going to Princeton, as mentioned above, I spent the spring term of 1966 
as a postdoc in Murray Gell-Mann’s group at Cal Tech. By this time the successes of 
PC AC and current algebra had attracted a lot of attention and stimulated an out¬ 
pouring of papers, the more important ones of which appear in the volume which 
Dashen and I put together a year later. My own work in the spring of 1966 was 
focused on two issues. The first involved using PCAC to get small momentum ex¬ 
pansions of matrix elements of the axial-vector current, in analogy with the paper of 
Low (1958) on soft photon bremsstrahlung. With Joe Dothan, I wrote a long paper 
(Adler and Dothan, 1966, R12) applying these ideas to the weak pion production 
amplitude and to radiative muon capture. The weak pion results figured in my later 
comprehensive paper on the subject (see below), while the radiative muon capture 
work was incorporated into later chiral perturbation theory treatments of radiative 
muon capture; for a review of the current theoretical and experimental status of 
muon capture, including a discussion of discrepancies between theory and exper¬ 
iment in the radiative capture case, see Gorringe and Fearing (2004). The other 
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direction of work involved two phenomenological studies done with Fred Gilman. 
One of these dealt with saturation of the neutrino sum rule, as described in the pre¬ 
ceding section. The other dealt with a detailed phenomenological study of the PCAC 
predictions for pion photo- and electro-production (Adler and Gilman, 1966, R13), 
including a saturation analysis for the Fubini-Furlan-Rossetti (1965) sum rule; for 
a recent update on this, see Pasquini, Drechsel, and Tiator (2005). 

My first year at the Institute was largely devoted to writing the book on Current 
Algebras with Roger Dashen (Adler and Dashen, 1968). The book consisted of se¬ 
lected reprints grouped by categories with commentaries that we supplied, plus some 
general introductory material. I was responsible for writing the introductory sections 
and the commentaries for Chapters 1-3, which included Appendix A, reprinted here 
as R7. Roger was responsible for the commentaries for Chapters 4-7, which included 
an original and very detailed analysis of precisely which sum rules could be derived 
by the infinite momentum frame method, or in different language, when a naive 
assumption of unsubtracted dispersion relations would (and would not) give correct 
results. This analysis confirmed earlier suspicions that my /3 neutrino sum rule was 
correct, but that the a and 7 sum rules should have subtractions, and so were not 
useful. The book on Current Algebras was completed, and sent off to the publisher, 
in the fall of 1967. 

During this period I also worked with Bill Weisberger, who was then at Prince¬ 
ton, on sorting out the tricky pion pole structure in two pion photo- and electro¬ 
production, which had to be handled carefully to get a fully gauge-invariant ex¬ 
pression (Adler and Weisberger, 1968, R14). Our interest in this process, as noted 
in the title of the paper, was motivated by the fact that it gives an alterna¬ 
tive, indirect method of measuring the nucleon axial-vector form factor #a(& 2 ). 
An experiment to measure £a(& 2 ) by this method was carried out by Joos et ah 
(1976) giving a value nriA = 1.18 ± 0.07 GeV for the mass in the dipole formula 
9A(k 2 ) = 5a( 0)(1 + ^ 2 / m A)~ 2 * This value is in good agreement with the value 
nriA = 1-07 ± 0.06 GeV given in the quasielastic scattering + n —>//“ + p exper¬ 
iment of Baker et al. (1981), and also in reasonable agreement with values oi m a 
obtained from single pion electroproduction at threshold using the low energy the¬ 
orem of Nambu and Shrauner (1962) (for which experimental references are given 
in both the Joos et al. and Baker et al. articles). At the 1968 Nobel Symposium 
on Elementary Particle Theory, I gave a brief talk (Adler, 1968a) reviewing various 
methods that had been proposed to measure the nucleon axial-vector form factor: 
quasielastic neutrino scattering, neutrino production of the A(1232), electroproduc¬ 
tion of a single soft pion (the Nambu-Shrauner proposal), and electroproduction 
of the A(1232) plus an additional soft pion (the proposal of my paper R14 with 
Weisberger). Over the years since then, all of these methods have been carried out. 

I also returned, after completion of the book on Current Algebras , to the repeat¬ 
edly delayed project of completing the numerical work associated with my thesis 
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calculation of weak pion production, and this kept me busy until the spring of 1968, 
when I finished a comprehensive article on photo-, electro-, and weak single-pion 
production in the A(1232), or as it was then termed, the (3,3) resonance region 
(Adler, 1968b, R15). This paper is so long (123 pages) that it is not feasible to 
reprint it all here, so I have included only the introduction (Sec. 1) and part of the 
discussion of implications of PC AC (Secs. 5A and 5B). The basic approximation used 
in this paper consisted of using the Born approximation for all nonresonant multi¬ 
poles, augmented by terms coming from the PCAC low energy theorems, together 
with a unitarized Born approximation for the dominant resonant (3,3) multipoles, 
giving predictions for weak pion production in the (3,3) region in terms of the vec¬ 
tor and axial-vector form factors of the nucleon. By 1968 there were experimental 
results on pion electroproduction which were in satisfactory agreement with my the¬ 
ory, except for values of the momentum transfer k 2 significantly larger than roughly 
0.6(GeV/c) 2 , where in retrospect one can see effects from the scaling regime showing 
up. For neutrino pion production, preliminary comparison of my results with CERN 
data showed an axial-vector form factor #a(& 2 ) that falls off more slowly with k 2 than 
the vector form factors, with a dipole mass of tua ~ 1.2GeV. A subsequent compar¬ 
ison of my model with high-statistics neutrino data from Brookhaven by Kitagaki 
et ah (1986) gave good fits with a dipole mass of tua = 1.28 ±0.11 GeV, somewhat 
high compared to values obtained by other methods described above. Reasonable 
fits of my model to the A cross section and density matrix elements measured in 
the hydrogen bubble chamber at Argonne were also reported in papers of Schreiner 
and von Hippel (1973a,b), and a comparison with other models and data was given 
by Rein and Sehgal (1981). (For a recent alternative approach to A(1232) weak 
production, and extensive references to earlier theoretical and experimental studies 
of this reaction, see Paschos et ah (2004).) After 1968 I did not work again on weak 
pion production until 1974-75, when the subject became important because it was 
an avenue for exploring weak neutral currents, as discussed in Chapter 5 below. 

To conclude this section on low energy theorems, let me address the question of 
the extent to which the modern viewpoint, of pions as Nambu-Goldstone bosons, 
entered into my work. The earliest reference that I could find in my research notes 
to the “Goldstone theorem” (and specifically to the derivations given in the paper 
of Goldstone, Salam, and Weinberg, 1962) dates from the spring of 1967, in other 
words, after nearly all the work on soft pion theorems was completed. (This reference 
was in the context of calculations on the axial-vector vertex in QED that were the 
starting point of my work on the axial anomaly, to be discussed in the next chapter.) 

I fully appreciated the role of pions as Nambu-Goldstone bosons only after hearing 
seminars that referred to Nambu-Goldstone versus Wigner-Weyl representations of 
75 symmetry, which were connected (as best I recall) with the work of Gell-Mann, 
Oakes, and Renner (1968) and Dashen (1969) on chiral SU( 3) x SU(3) as a strong 
interaction symmetry. This may at first seem surprising, but now that the tapestry 
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of the standard model is completed, we see clearly the interrelations of its many 
threads; at the time when these threads were being laid down, those working from 
one direction were often unaware or only dimly aware of progress from another. 

Perhaps this is also a good point to say that the elucidation of the chiral struc¬ 
ture of the strong interactions was only one of the results flowing from the successes 
of current algebra methods and PCAC; something that was perhaps even more sig¬ 
nificant at the time was the demonstration that quantum field theory methods were 
really valid, after all, in dealing with hadronic interactions. When I entered gradu¬ 
ate school, the prevailing view was that the strong interactions would be understood 
through some kind of dispersion theoretic “reciprocal bootstrap”, and nearly every 
particle physics talk I heard began with a Mandelstam diagram on the blackboard. 
By 1967, this view had changed; it was clear that field theory could produce re¬ 
sults which could not be obtained from the dispersion relations program, and this 
strongly influenced subsequent developments. 
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3. Anomalies: Chiral Anomalies and Their Nonrenormalization, 
Perturbative Corrections to Scaling, and Trace Anomalies 

to All Orders 


Chiral Anomalies and 7 r° —> 77 Decay 

I got into the subject of anomalies in an indirect way, through exploration during 
1967-1968 of the speculative idea that the muon-electron mass difference could be 
accounted for by giving the muon an additional magnetic monopole electromagnetic 
coupling through an axial-vector current, which somehow was nonperturbatively 
renormalized to zero. After much fruitless study of the integral equations for the 
axial-vector vertex part, I decided in the spring of 1968 to first try to answer a 
well-defined question, which was whether the axial-vector vertex in QED was renor¬ 
malized by multiplication by Z 2 , as I had been implicitly assuming. At the time 
when I turned to this question, I had just started a 6-week visit to the Cavendish 
Laboratory in Cambridge, England after flying to London with my family on April 
21, 1968 (as recorded by my ex-wife Judith in my oldest daughter Jessica’s “baby 
book”). In the Cavendish I shared an office with my former adviser Sam Treiman, 
and was enjoying the opportunity to try a new project not requiring extensive com¬ 
puter analysis; I had only a month before finished my Annals of Physics paper R15 
on weak pion production (see Chapter 2), which had required extensive computa¬ 
tion, not easy to do in those days when one had to wait hours or even a day for the 
results of a computer run. 

My interest in the multiplicative renormalization question had been piqued by 
work of van Nieuwenhuizen, in which he had attempted to demonstrate the finite¬ 
ness to all orders of radiative corrections to /i decay, using an argument based on 
subtraction of renormalization constants that I knew to be incorrect beyond lead¬ 
ing order. I had learned about this work during the previous summer, when I was 
a lecturer at the Varenna summer school held by Lake Como from July 17-29, 
1967, at which van Nieuwenhuizen had given a seminar on this topic that was cri¬ 
tiqued by Bjorken, another lecturer. (For further historical details about this, see my 
review article Adler (2004a) on anomalies and anomaly nonrenormalization, from 
which much of this commentary has been adapted.) Working in the old Cavendish, I 
rather rapidly found an inductive multiplicative renormalizability proof, paralleling 
the one in Bjorken and Drell (1965) for finiteness of Z 2 times the vector vertex. 
I prepared a detailed outline for a paper describing the proof, but before writing 
things up, I decided as a check to test whether the formal argument for the closed 
loop part of the Ward identity worked in the case of the smallest loop diagram. This 
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is a triangle diagram with one axial and two vector vertices (the AVV triangle; 
see Fig. 1(a)), which because of Furry’s theorem (C invariance) has no analog in 
the vector vertex case. I knew from a student seminar that I had attended during 
my graduate study at Princeton that this diagram had been explicitly calculated 
using a gauge-invariant regularization by Rosenberg (1963), who was interested in 
the astrophysical process jy + v —> 7 + v, with 7 y a virtual photon emitted by a 
nucleus. I got Rosenberg’s paper, tested the Ward identity, and to my astonishment 
(and Treiman’s when I told him the result) found that it failed! I soon found that 
the problem was that my formal proof used a shift of integration variables inside 
a linearly divergent integral, which (as I again recalled from student reading) had 
been analyzed in an Appendix to the classic text of Jauch and Rohrlich (1955), with 
a calculable constant remainder. For all closed loop contributions to the axial vertex 
in Abelian electrodynamics with larger numbers of vector vertices (the AVVVV , 
AVVVVVV ,... loops; see Fig. 1(b)), the fermion loop integrals for fixed photon 
momenta are highly convergent and the shift of integration variables needed in the 
Ward identity is valid, so proceeding in this fermion loop-wise fashion there were 
apparently no further additional or “anomalous” contributions to the axial-vector 
Ward identity. With this fact in the back of my mind I was convinced from the outset 
that the anomalous contribution to the axial Ward identity would come just from 
the triangle diagram, with no renormalizations of the anomaly coefficient arising 
from higher order AVV diagrams with virtual photon insertions. 





(a) (b) 


Fig. 1. Fermion loop diagam contributions to the axial-vector vertex part. Solid lines are 
fermions, and dashed lines are photons, (a) The smallest loop, the AVV triangle diagram, (b) 
Larger loops with four or more vector vertices, which (when summed over vertex orderings) 
obey normal Ward identities. 


In early June, at the end of my 6 weeks in Cambridge, I returned to the US and 
then went to Aspen, where I spent the summer working out a manuscript on the 
properties of the axial anomaly, which became the body (pages 2426-2434) of the 
final published version (Adler, 1969, R16). Several of the things done there deserve 
mention, since they were important in later applications. The first was a calculation 
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of the field theoretic form of the anomaly, giving the now well-known result 

= 2tmo/(x) + ^F^(x)F^(x)e iaTp , 

with jp — the axial-vector current (referred to above as A), j 5 — *075 the 

pseudoscalar current, and with mo and oq the (unrenormalized) fermion mass and 
coupling constant. The second was a demonstration that because of the anomaly, Z 2 
is no longer the multiplicative renormalization constant for the axial-vector vertex, 
as a result of the diagram drawn in Fig. 1(a) in which the AVV triangle is joined to 
an electron line with two virtual photons. Instead, the axial-vector vertex is made 
finite by multiplication by the renormalization constant 

Z A = Z 2 [l + ^(a 0 A) 2 log(A 2 / m2 ) + •••] , 

thus giving an answer to the question with which I started my investigation. Thirdly, 
as an application of this result, I showed that the anomaly leads, in fourth or¬ 
der of perturbation theory, to infinite radiative corrections to the current-current 
theory of z/^/i and u e e scattering, but that this infinity can be cancelled between 
different fermion species by adding appropriate u^e and v e \i scattering terms to 
the Lagrangian. This result is a forerunner of anomaly cancellation mechanisms in 
modern gauge theories. It is related to the fact, also discussed in my paper, that the 
asymptotic behavior of the AVV triangle diagram saturates the bound given by the 
Weinberg power counting rules, rather than being one power better as is the case 
for the AVVVV and higher loop diagrams, and has a leading asymptotic term that 
is a function solely of the external momenta. Finally, I also showed that a gauge 
invariant chiral generator still exists in the presence of the anomaly. Although not 
figuring in our subsequent discussion here, in its non-Abelian generalization this was 
relevant (as reviewed in Coleman, 1989) to later discussions of the U( 1 ) problem 
in quantum chromodynamics (QCD), leading up to the solution given by ’t Hooft 
(1976). 

No sooner was this part of my paper completed than Sidney Coleman arrived 
in Aspen from Europe, and told me that Bell and Jackiw (published as Bell and 
Jackiw, 1969) had independently discovered the anomalous behavior of the AVV 
triangle graph, in the context of a sigma model investigation of the Veltman (1967)- 
Sutherland (1967) theorem stating that 7 r° —> 77 decay is forbidden in a PCAC 
calculation. The Sutherland-Veltman theorem is a kinematic statement about the 
AVV three-point function, which asserts that if the momenta associated with the 
currents A, V, V are respectively q , Aq, A^, then the requirement of gauge invariance 
on the vector currents forces the AVV vertex to be of order qk\k 2 in the external 
momenta. Hence when one applies a divergence to the axial-vector vertex and uses 
the standard PCAC relation (with the quark current T f the analog of ^ j £ ) 

d^i fl (x) = (f rr M>/V2)Mx) , 
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with M n the pion mass, cf> n the pion field, and f n the charged pion decay constant, 
one finds that the 7r° — > 77 matrix element is of order q 2 k\k2 , and hence vanishes 
in the soft pion limit q 2 —> 0. Bell and Jackiw analyzed this result by a perturbative 
calculation in the cr-model, in which PCAC is formally built in from the outset, and 
found a non-vanishing result for the 7r° —> 77 amplitude, which they traced back 
to the fact that the regularized AVV triangle diagram cannot be defined to satisfy 
the requirements of both PCAC and gauge invariance. This constituted the “PCAC 
Puzzle” referred to in the title of their paper. They then proposed to modify the 
original cr-model by adding further regulator fields with mass-dependent coupling 
constants in such a manner as to simultaneously enforce gauge invariance and PCAC, 
thus enforcing the Sutherland-Veltman prediction of a vanishing 7r° —> 77 decay 
amplitude. In the words of Bell and Jackiw in their paper, “It has to be insisted that 
the introduction of this mass dependence of coupling constants is not an arbitrary 
step in the PCAC context. If a regularization is introduced to define the theory, 
it must respect any formal properties which are to be appealed to.” And again in 
concluding their paper, they stated “To the complaint that we have changed the 
theory, we answer that only the revised version embodies simultaneously the ideas 
of PCAC and gauge invariance.” 

It was immediately clear to me, in the course of the conversation with Sidney 
Coleman, that introducing additional regulators to eliminate the anomaly would 
entail renormalizability problems in a meson scattering, and was not the correct way 
to proceed. However, it was also clear that Bell and Jackiw had made an important 
observation in tying the anomaly to the Sutherland-Veltman theorem for 7r° —> 77 
decay, and that I could use the sigma-model version of the anomaly equation to get 
a nonzero prediction for the 7r° —> 77 amplitude, with the whole decay amplitude 
arising from the anomaly term. I then wrote an Appendix to my paper (pages 2434- 
2438), clearly delineated from the manuscript that I had finished before Sidney’s 
arrival, in which I gave a detailed rebuttal of the regulator construction, by showing 
that the anomaly could not be eliminated without spoiling either gauge-invariance 
or renormalizability. (In later discussions I added unitarity to this list, to exclude 
the possibility of canceling the anomaly by adding a term to the axial current with 
a d^/idx) 2 singularity.) In this Appendix I also used an anomaly-modified PCAC 
equation 

= (umZ/V2)Mx) + s2^(x)F^( x )e^ Tp , 

with S a constant determined by the constituent fermion charges and axial-vector 
couplings, to obtain a PCAC formula for the n° —> 77 amplitude F n 

F* = -(a/7r)2SV2/f n . 

Although the axial anomaly, in the context of breakdown of the “pseudoscalar- 
pseudovector equivalence theorem”, had in fact been observed much earlier, start- 
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ing with Fukuda and Miyamoto (1949) and Steinberger (1949) and continuing to 
Schwinger (1951), my paper broke new ground by treating the anomaly neither as 
a baffling calculational result, nor as a field theoretic artifact to be eliminated by 
a suitable regularization scheme, but instead as a real physical effect (breaking of 
classical symmetries by the quantization process) with observable physical conse¬ 
quences. 

This point of view was not immediately embraced by everyone else. After com¬ 
pleting my Appendix I sent Bell and Jackiw copies of my longhand manuscript, and 
an interesting correspondence ensued. In a letter dated August 25, 1968, Jackiw was 
skeptical whether one could extract concrete physical predictions from the anomaly, 
and whether one could augment the divergence of the axial-vector current by a def¬ 
inite extra electromagnetic contribution, as in the modified PCAC equation above. 
Bell, who was traveling, wrote me on Sept. 2, 1968, and was more appreciative of 
the possibility of using a modified PCAC to get a formula for the neutral pion de¬ 
cay amplitude, writing “The general idea of adding some quadratic electromagnetic 
terms to PCAC has been in our minds since Sutherland’s rj problem. We did not 
see what to do with it.” He also defended the approach he and Jackiw had taken, 
writing “The reader may be left with the impression that your development is con¬ 
tradictory to ours, rather than complementary. Our first observation is that the a 
model interpreted in a conventional way just does not have PCAC. This is already 
a resolution of the puzzle, and the one which you develop in a very nice way. We, 
interested in the cr-model only as exemplifying PCAC, choose to modify the con¬ 
ventional procedures, in order to exhibit a model in which general PCAC reasoning 
could be illustrated in explicit calculation.” In recognition of this letter from John 
Bell, whom I revered, I added a footnote 15 to my manuscript saying “Our results 
do not contradict those of Bell and Jackiw, but rather complement them. The main 
point of Bell and Jackiw is that the a model interpreted in the conventional way, 
does not satisfy the requirements of PCAC. Bell and Jackiw modify the a model in 
such a way as to restore PCAC. We, on the other hand, stay within the conventional 
a model, and try to systematize and exploit the PCAC breakdown.” This footnote, 
which contradicts statements made in the text of my paper, has puzzled a number 
of people; in retrospect, rather than writing it as a paraphrase of Bell’s words, I 
should have quoted directly from Bell’s letter. 

Following this correspondence, my paper was typed on my return to Princeton 
and was received by the Physical Review on Sept. 24, 1968. (Bell and Jackiw’s 
paper, a CERN preprint dated July 16, 1968, was submitted to II Nuovo Cimento , 
and received by that journal on Sept. 11, 1968.) My paper was accepted along with 
a signed referee’s report from Bjorken, stating “This paper opens a topic similar 
to the old controversies on photon mass and nature of vacuum polarization. The 
lesson there, as I (no doubt foolishly) predict will happen here, is that infinities 
in diagrams are really troublesome, and that if the cutoff which is used violates a 
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cherished symmetry of the theory, the results do not respect the symmetry. I will 
also predict a long chain of papers devoted to the question the author has raised, 
culminating in a clever renormalizable cutoff which respects chiral symmetry and 
which, therefore, removes Adler’s extra term.” Thus, acceptance of the point of 
view that I had advocated was not immediate, but only followed over time. In 1999, 
Bjorken was a speaker at my 60th birthday conference at the Institute for Advanced 
Study, and amused the audience by reading from his report, and then very graciously 
gave me his file copy, with an appreciative inscription, as a souvenir. 

The viewpoint that the anomaly determined the 7r° —> 77 decay amplitude had 
significant physical consequences. In the Appendix to my paper, I showed that the 
value S — | implied by the fractionally charged quark model gave a decay amplitude 
that was roughly a factor of 3 too small. More generally, I showed that a triplet 
constituent model with charges (Q,Q — 1,Q — 1) gave S — Q — and so with 
integrally charged constituents (Q = 0 or Q — 1) one gets an amplitude that agrees 
in absolute value, to within the expected accuracy of PCAC, with experiment. I 
noted in my paper that Q — 0, or S = — ^ corresponded to the case in which 
radiative corrections to weak interactions had been shown to be finite, but this 
choice for the sign of the 7r° —> 77 amplitude was soon to be ruled out. Over the 
next few months Okubo (1969) and Gilman (1969) wrote me letters accompanying 
preprints which demonstrated, by different methods, that the sign corresponding 
to a single positive integrally charged constituent going around the triangle loop 
agrees with experiment. Okubo also analyzed various alternative models for proton 
constituents, and pointed out that while some are excluded by the experimentally 
determined value of S', the integrally charged Maki (1964)-Hara (1964) single triplet 
model (the model that I had considered in my Appendix, but now with Q = 1), and 
the corresponding integrally charged three triplet model of Han and Nambu (1965) 
(see also Tavkhelidze (1965), Miyamoto (1965), and Nambu (1965)), are both in 
accord with the empirical value S ~ In a conference talk a year later, in September 
1969 (Adler, 1970a, R17) I reviewed the subject of the anomaly calculation of neutral 
pion decay, as developed in the papers that had appeared during the preceding year. 

The work just described gave the first indications that neutral pion decay pro¬ 
vides empirical evidence that can discriminate between different models for hadronic 
constituents. The correct interpretation of the fact that S ~ | came only later, when 
what we now call the “color” degree of freedom was introduced in the seminal pa¬ 
pers of Bardeen, Fritzsch, and Gell-Mann (1972; reprinted as hep-ph/0211388) and 
Fritzsch and Gell-Mann (1971/1972; reprinted as hep-ph/0301127). These papers 
used my calculation of 7r° —> 77 decay as supporting justification for the tripling 
of the number of fractionally charged quark degrees of freedom, thus increasing the 
theoretical value of S for fractionally charged quarks from | to The paper of 
Bardeen, Fritzsch, and Gell-Mann also pointed out that this tripling would show up 
in a measurement of R, the ratio of hadronic to muon pair production in electron 
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positron collisions, while noting that “Experiments at present are too low in energy 
and not accurate enough to test this prediction, but in the next year or two the 
situation should change.”, as indeed it did. 

Before leaving the subject of the early history of the anomaly and its antecedents, 
perhaps this is the appropriate place to mention the paper of Johnson and Low 
(1966), which showed that the Bjorken (1966)-Johnson-Low (1966) (BJL) method 
of identifying formal commutators with an infinite energy limit of Feynman diagrams 
gives, in significant cases, results that differ from the naive field-theoretic evaluation 
of these commutators. This method was later used by Jackiw and Johnson (1969) 
and by Boulware and myself (Adler and Boulware, 1969, R18) to show that the AVV 
axial anomaly can be reinterpreted in terms of anomalous commutators. This line 
of investigation, however, did not readily lend itself to a determination of anomaly 
effects beyond leading order. For example, I still have in my files an unpublished 
manuscript (circa 1966) attempting to use the BJL method to tackle a simpler prob¬ 
lem, that of proving that the Schwinger term in quantum electrodynamics (QED) is 
a c-number to all orders of perturbation theory. I believe that this result is true (and 
it may well have been proved by now using operator product expansion methods), 
but I was not able at that time to achieve sufficient control of the BJL limits of high 
order diagrams with general external legs to give a proof. (See also remarks on this 
in Chapter 4.) 


Anomaly Nonrenormalization 

We are now ready to address the issue of the determination of anomalies beyond 
leading order in perturbation theory. Before the neutral pion low energy theorem 
could be used as evidence for the charge structure of quarks, one needed to be 
sure that there were no perturbative corrections to the anomaly and the low energy 
theorem following from it. As I noted above, the fermion loop-wise argument that 
I used in my original treatment left me convinced that only the lowest order AVV 
diagram would contribute to the anomaly, but this was not a proof. This point of 
view was challenged in the article by Jackiw and Johnson (1969), received by the 
Physical Review on Nov. 25, 1968, who stated “Adler has given an argument to the 
end that there exist no higher-order effects. He introduced a cutoff, calculated the 
divergence, and then let the cutoff go to infinity. This is seen in the present context 
to be equivalent to the second method above. However, we believe that this method 
may not be reliable because of the dependence on the order of limits.” And in their 
conclusion, they stated “In a definite model the nature of the modification (to the 
axial-vector current divergence equation) can be determined, but in general only 
to lowest order in interactions.” This controversy with Jackiw and Johnson was the 
motivation for a more thorough analysis of the nonrenormalization issue undertaken 
by Bill Bardeen and myself in the fall and winter of 1968-1969 (Adler and Bardeen, 
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1969, R19) and was cited in the “Acknowledgments” section of our paper, where we 
thanked “R. Jackiw and K. Johnson for a stimulating controversy which led to the 
writing of this paper.” 

The paper with Bardeen approached the problem of nonrenormalization by two 
different methods. We first gave a general constructive argument for nonrenormal¬ 
ization of the anomaly to all orders, in both quantum electrodynamics and in the 
cr-model in which PCAC is canonically realized, and we then backed this argu¬ 
ment up with an explicit calculation of the leading order radiative corrections to 
the anomaly, showing that they cancelled among the various contributing Feynman 
diagrams. The strategy of the general argument was to note that since the anomaly 
equations written above involve unrenormalized fields, masses, and coupling con¬ 
stants, these equations are well defined only in a cutoff field theory. Thus, for both 
electrodynamics and the cr-model, we constructed cutoff versions by introducing 
photon or cr-meson regulator fields with mass A. (This was simple for the case of 
electrodynamics, but more difficult, relying heavily on Bill Bardeen’s prior experi¬ 
ence with meson field theories, in the case of the cr-model.) In both cases, the cutoff 
prescription allows the usual renormalization program to be carried out, expressing 
the unrenormalized quantities in terms of renormalized ones and the cutoff A. In 
the cutoff theories, the fermion loop-wise argument I used in my original anomaly 
paper is still valid, because regulating boson propagators does not alter the chiral 
symmetry properties of the theory, and thus it is straightforward to prove the va¬ 
lidity of the anomaly equations involving unrenormalized quantities to all orders of 
perturbation theory. 

Taking the vacuum to two 7 matrix element of the anomaly equations, and 
applying the Sutherland-Veltman theorem, which asserts the vanishing of the matrix 
element of at the special kinematic point q 2 — 0, Bardeen and I then got exact 
low energy theorems for the matrix elements ( 27 | 2 imoj 5 | 0 ) (in electrodynamics) 
and (2 7 |(/ 7 r M2/ v / 2)0 7 r |O) (in the cr-model) of the “naive” axial-vector divergence 
at this kinematic point, which were given by the negative of the corresponding 
matrix element of the anomaly term. However, since we could prove that these 
matrix elements are finite in the limit as the cutoff A approaches infinity, this in 
turn gave exact low energy theorems for the renormalized, physical matrix elements 
in both cases. One subtlety that entered into the all orders calculation was the 
role of photon rescattering diagrams connected to the anomaly term, but using 
gauge invariance arguments analogous to those involved in the Sutherland-Veltman 
theorem, we were able to show that these diagrams made a vanishing contribution to 
the low energy theorem at the special kinematic point q 2 — 0. Thus, my paper with 
Bardeen provided a rigorous underpinning for the use of the 7 r° —> 77 low energy 
theorem to study the charge structure of quarks. 

In our explicit second order calculation, we calculated the leading order radiative 
corrections to this low energy theorem, arising from addition of a single virtual pho- 
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ton or virtual cr-meson to the lowest order diagram. We did this by two methods, 
one involving a direct calculation of the integrals, and the other (devised by Bill 
Bardeen) using a clever integration by parts argument to bypass the direct calcula¬ 
tion. Both methods gave the same answer: the sum of all the radiative corrections is 
zero, as expected from our general nonrenormalization argument. We also traced the 
contradictory results obtained in the paper of Jackiw and Johnson to the fact that 
these authors had studied an axial-vector current (such as "07^ 75 ^ i n the cr-model) 
that is not made finite by the usual renormalizations in the absence of electromag¬ 
netism; as a consequence, the naive divergence of this current is not multiplicatively 
renormalizable. As we noted in our paper, “In other words, the axial-vector current 
considered by Jackiw and Johnson and its naive divergence are not well-defined ob¬ 
jects in the usual renormalized perturbation theory; hence the ambiguous results 
which these authors have obtained are not too surprising.” Our result of a defi¬ 
nite, unrenormalized low energy theorem, we noted, came about because “In each 
model we have studied a particular axial-vector current: in spinor electrodynamics, 
the usual axial-vector current ... and in the a model the Polkinghorne (1958a,b) 
axial-vector current ... which, in the absence of electromagnetism, obeys the PCAC 
condition.” It is these axial-vector currents that obey a simple anomaly equation to 
all orders in perturbation theory, and which give an exact, physically relevant low 
energy theorem for the naive axial-vector divergence. 

This paper with Bill Bardeen should have ended the controversy over whether 
the anomaly was renormalized, but it didn’t. Johnson pointed out in an unpublished 
report that since the anomaly is mass-independent, it should be possible to calculate 
it in massless electrodynamics, for which the naive divergence 2 imoj 5 vanishes and 
the divergence of the axial-vector current directly gives the anomaly. Moreover, in 
massless electrodynamics there is no need for mass renormalization, and so if one 
chooses Landau gauge for the virtual photon propagator, the second order radiative 
correction calculation becomes entirely ultraviolet finite, with no renormalization 
counter terms needed. Such a second order calculation was reported by Sen (1970), 
a Johnson student, who claimed to find nonvanishing second order radiative correc¬ 
tions to the anomaly. However, the calculational scheme proposed by Johnson and 
used by Sen has the problem that, while ultraviolet finite, there are severe infrared 
divergences, which if not handled carefully can lead to spurious results. After a long 
and arduous calculation (Adler, Brown, Wong, and Young, 1971) my collaborators 
and I were able to show that the zero mass calculation, when properly done, also 
gives a vanishing second order radiative correction to the anomaly. This confirmed 
the result I had found with Bardeen, which had by then also been confirmed by dif¬ 
ferent methods in the mo 7 ^ 0 theory in papers of Abers, Dicus, and Teplitz (1971) 
and Young, Wong, Gounaris, and Brown (1971). 

Even this was not the end of controversies over the nonrenormalization theo¬ 
rem, as discussed in detail in my review Adler (2004a) that focuses specifically on 
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anomaly nonrenormalization. Suffice it to say here that no objections raised have 
withstood careful analysis, and there is now a detailed understanding of anomaly 
nonrenormalization both by perturbative methods, and by non-perturbative meth¬ 
ods proceeding from the Callan-Symanzik equations. There is also a detailed under¬ 
standing of anomaly nonrenormalization in the context of supersymmetric theories, 
where initial apparent puzzles are now resolved. 


Point Splitting Calculations of the Anomaly 

At this point let me backtrack, and discuss the role of point-splitting methods 
in the study of the Abelian electrodynamics anomaly. In the present context, 
point-splitting was first used in the discussion given by Schwinger (1951) of the 
pseudoscalar-pseudovector equivalence theorem, to be described in more detail 
shortly. Almost immediately following circulation of the seminal anomaly preprints 
in the fall of 1968, Hagen (1969, received Sept, 24, 1968, and a letter to me dated 
Oct. 16, 1968), Zumino (1969, and a letter to me dated Oct. 7, 1968), and Brandt 
(1969, received Dec, 17, 1968, and a letter to me dated Oct. 16, 1968) all rederived 
the anomaly formula by a point-splitting method. Independently, a point-splitting 
derivation of the anomaly was given by Jackiw and Johnson (1969, received 25 
November, 1968), who explicitly made the connection to Schwinger’s earlier work 
(Johnson was a Schwinger student, and was well acquainted with Schwinger’s body 
of work). The point of all of these calculations is that the anomaly can be derived 
by formal algebraic use of the equations of motion, provided one redefines the sin¬ 
gular product '0 ( x )7m7sV j ( x ) appearing in the axial-vector current by the point-split 
expression 

lim '0 (x / ) 7 m 75 exp[—ie / dx X B^{x) , 

x—>x' J x l 

and takes the limit x f —> x at the end of the calculation. 

Responding to these developments, I appended a “Note added in proof” to my 
anomaly paper, mentioning the four field-theoretic, point-splitting derivations that 
had subsequently been given, and adding “Jackiw and Johnson point out that the 
essential features of the field-theoretic derivation, in the case of external electro¬ 
magnetic fields, are contained in J. Schwinger, Phys. Rev. 82 , 664 (1951)”. What 
to me was an interesting irony emerged from learning of the connection between 
anomalies and the famous Schwinger (1951) paper on vacuum polarization. I had in 
fact read Section II and the Appendices of the 1951 paper, when Alfred Goldhaber 
and I, during our senior year at Harvard (1960-61), did a reading course on quantum 
electrodynamics with Paul Martin, which focused on papers in Schwinger’s reprint 
volume (Schwinger, 1958). Paul had told us to read the parts of the Schwinger pa¬ 
per that were needed to calculate the VV vacuum polarization loop, but to skip the 
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rest as being too technical. Reading Section V of Schwinger’s paper brought back 
to mind a brief, forgotten conversation I had had with Jack Steinberger, who was 
Director of the Varenna Summer School in 1967. Steinberger had told me that he 
had done a calculation on the pseudovector-pseudoscalar equivalence theorem for 
7 T° —> 77 , but had gotten different answers in the two cases; also that Schwinger had 
claimed to reconcile the answers, but that he (Steinberger) couldn’t make sense out 
of Schwinger’s argument. Jack had urged me to look at it, which I never did until 
getting the Jackiw-Johnson preprint, but in retrospect everything fell into place, 
and the connection to Schwinger’s work was apparent. 

This now brings me to the question, did Schwinger’s paper constitute the dis¬ 
covery of the anomaly? Both Jackiw, in his paper with Johnson, and I were careful 
to note the connection between Schwinger’s (1951) paper and the point-splitting 
derivations of the anomaly, once it was called to our attention. However, recently 
some of Schwinger’s former students have gone further, arguing that Schwinger was 
the discoverer of the anomaly and that my paper and that of Bell and Jackiw were 
merely a “rediscovery” of a previously known result. I believe that this claim goes 
beyond the published record of what is in Schwinger’s paper, as analyzed in detail in 
Sec. 2.3 and Appendix A of my review Adler (2004a). Stated briefly, Schwinger’s cal¬ 
culation was devoted to making the pseudovector calculation give the same non-zero 
answer as the pseudoscalar one, and what Schwinger calls the redefined axial-vector 
divergence is in fact not the divergence of the gauge-invariant axial-vector current, 
but rather the axial-vector current divergence minus the anomaly. In other words, 
Schwinger’s calculation effectively transposes the anomaly term to the left-hand 
side of the anomaly equation, so that what he evaluates is the effective Lagrangian 
arising from the left-hand side of the equation 

Pjfa) - ^F^(x)F^(x)e^ r/> = 2im 0 j\x) , 

which then necessarily gives the same result as calculation of an effective Lagrangian 
from the right-hand side, which is pseudoscalar coupling. There is no gauge-invariant 
axial-vector current for which the combination on the left-hand side is the diver¬ 
gence, but as shown in Eqs. (58) and (59) of R16, there is a gauge-non-invariant 
axial-vector current which has this divergence. 

The use of a point-splitting method was of course important and fruitful, and 
in retrospect, the axial anomaly is hidden within Schwinger’s calculation. But 
Schwinger never took the crucial step of observing that the axial-vector current 
matrix elements cannot, in a renormalizable quantum theory, be made to satisfy 
all of the expected classical symmetries. And more specifically, he never took the 
step of defining a gauge-invariant axial-vector current by point splitting, which has 
a well-defined anomaly term in its divergence, with the anomaly term completely 
accounting for the disagreement between the pseudoscalar and pseudovector calcu¬ 
lations of neutral pion decay. So I would say that although Schwinger took steps in 
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the right direction, particularly in noting the utility of point-splitting in defining the 
axial-vector current, his 1951 paper obscured the true physics and does not mark 
the discovery of the anomaly. This happened only much later, in 1968, and led to a 
flurry of activity by many people. My view is supported, I believe, by the fact that 
Schwinger’s calculation seemed arcane, even to people (like Steinberger) with whom 
he had talked about it and to colleagues familiar with his work, and exerted no in¬ 
fluence on the field until after preprints on the seminal work of 1968 had appeared. 


The Non-Abelian Anomaly, Its Nonrenormalization and Geometric 
Interpretation 

Since in the chiral limit the AVV triangle is identical to an AAA triangle (as is 
easily seen by an argument involving anticommutation of a 75 around the loop), I 
knew already in unpublished notes dating from the late summer of 1968 that the 
AAA triangle would also have an anomaly; a similar observation was also made 
by Gerstein and Jackiw (1969). From fragmentary calculations begun in Aspen I 
suspected that higher loop diagrams might have anomalies as well, so after the 
nonrenormalization work with Bill Bardeen was finished I suggested to Bill that he 
work out the general anomaly for larger diagrams. (I was at that point involved 
in other calculations with Wu-Ki Tung, on the perturbative breakdown of scaling 
formulas such as the Callan-Gross relation, to be discussed shortly.) I showed Bill 
my notes, which turned out to be of little use, but which contained a very pertinent 
remark by Roger Dashen that including charge structure (which I had not) would 
allow a larger class of potentially anomalous diagrams. Within a few weeks, Bill 
carried out a brilliant calculation, by point-splitting methods, of the general anomaly 
in both the Abelian and the non-Abelian cases (Bardeen, 1969). Expressed in terms 
of vector and axial-vector Yang-Mills field strengths 

Fp(x)=dPV*(x)-d*V*(x)-i[V*(x),V*(x)]-i[A*(x),A*(xy , 

F^(x) - &*A v {x) - ^#(r) - i[V^(x),A I/ (x)\ - i[A*{x), V u {x )] , 


Bardeen’s result takes the form 

= normal divergence term 

+ (l/47r 2 ) £M „ orr tr r [AS [(l/4)i^" (a:)i^(a:) + (1/1 2)F^(x)F^(x) 

+ (2/3)iA^(x)A u (x)F^ T (x) + (2/3 )iF^(x)A a (x)A T (x) 

+ (2/3 )iA^(x)F^(x)A T (x) - (8/3 )A»{x)A u {x)A a {x)A T {x)} , 

with tr/ denoting a trace over internal degrees of freedom, and the internal 
symmetry matrix associated with the axial-vector external field. In the Abelian case, 
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with trivial internal symmetry structure, the terms involving two or three factors of 
A fJ, ' v '" m vanish by antisymmetry of e Ml/crr , and there are only AVV and AAA triangle 
anomalies. When there is non-trivial internal symmetry or charge structure, there 
are anomalies associated with the box and pentagon diagrams as well, confirming 
Dashen’s intuition mentioned earlier. Bardeen notes that whereas the triangle and 
box anomalies result from linear divergences associated with these diagrams, the 
pentagon anomalies arise not from linear divergences, but rather from the definition 
of the box diagrams to have the correct vector current Ward identities. Bardeen also 
notes, in his conclusion, another prophetic remark of Dashen, to the effect that the 
pentagon anomalies should add anomalous terms to the PCAC low energy theorems 
for five pion scattering; I shall return to this shortly. 

There are two distinct lines of argument leading to the conclusion that the non- 
Abelian chiral anomaly also has a nonrenormalization theorem, and is given exactly 
by Bardeen’s leading order calculation. The first route parallels that used in the 
Abelian case, involving variously a loop-wise regulator construction, explicit fourth 
order calculation, and an argument using the Callan-Symanzik equations; for de¬ 
tailed references, see Adler (2004a). The conclusion in all cases is that the Adler- 
Bardeen theorem extends to the non-Abelian case. Heuristically, what is happening 
is that except for a few small one-fermion loop diagrams, non-Abelian theories, just 
like Abelian ones, are made finite by gauge invariant regularization of the gluon 
propagators. But this regularization has no effect on the chiral properties of the 
theory, and therefore does not change its anomaly structure, which can thus be de¬ 
duced from the structure of the few small fermion loop diagrams for which naive 
classical manipulations break down. 

The second route leading to the conclusion that the non-Abelian anomaly is non- 
renormalized might be termed “algebraic/geometrical”, and consists of two steps. 
The first step consists of a demonstration that the higher order terms in Bardeen’s 
non-Abelian formula are completely determined by the leading, Abelian anomaly. 
During a summer visit to Fermilab in 1971, I collaborated with Ben Lee, Sam 
Treiman, and Tony Zee (Adler, Lee, Treiman, and Zee 1971, R20) in a calcula¬ 
tion of a low energy theorem for the reaction 7 + 7 n + n + n in both the neutral 
and charged pion cases. This was motivated in part by discrepancies in calculations 
that had just appeared in the literature, and in part by its relevance to theoretical 
unitarity calculations of a lower bound on the K £ —> /i+/i _ decay rate. Using PCAC, 
we showed that the fact that the 7 + 7 —> matrix elements vanish in the limit 
when a final 7r° becomes soft, together with photon gauge invariance, relates these 
amplitudes to the matrix elements F n for 7 + 7 —> tt° and for 7 —> 7 r° + 7 r + + 7 r _ , 

and moreover, gives a relation between the latter two matrix elements, 
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Thus all of the matrix elements in question are uniquely determined by F n , which 
itself is determined by the AVV anomaly calculation. An identical result for the same 
reactions was independently given by Terent’ev (Terentiev) (1971). In the meantime, 
in a beautiful formal analysis, Wess and Zumino (1971) showed that the current 
algebra satisfied by the flavor SU(3) octet of vector and axial-vector currents implies 
integrability or “consistency” conditions on the non-Abelian axial-vector anomaly, 
which are satisfied by the Bardeen formula, and conversely, that these constraints 
uniquely imply the Bardeen structure up to an overall factor, which is determined 
by the Abelian AVV anomaly. By introducing an auxiliary pseudoscalar field, Wess 
and Zumino were able to write down a local action obeying the anomalous Ward 
identities and the consistency conditions. (There is no corresponding local action 
involving just the vector and axial-vector currents, since if there were, the anomalies 
could be eliminated by a local counterterm.) Wess and Zumino also gave expressions 
for the processes 7 —> 37r and 2-y —> 37r discussed by Adler et al. and Terentiev, as 
well as giving the anomaly contribution to the five pseudoscalar vertex. The net 
result of these three simultaneous pieces of work was to show that the Bardeen 
formula has a rigidly constrained structure, up to an overall factor given by the 
7 r° —> 77 decay amplitude. 

The second step in the “algebraic/geometric” route to anomaly renormaliza¬ 
tion is a celebrated paper of Witten (1983), which shows that the Wess-Zumino 
action has a representation as the integral of a fifth rank antisymmetric tensor (con¬ 
structed from the auxiliary pseudoscalar field) over a five-dimensional disk of which 
four-dimensional space is the boundary. In addition to giving a new interpretation of 
the Wess-Zumino action T, Witten’s argument also gave a constraint on the overall 
factor in T that was not determined by the Wess-Zumino consistency argument. 
Witten observed that his construction is not unique, because a closed five-sphere 
intersecting a hyperplane gives two ways of bounding the four-sphere along the equa¬ 
tor with a five dimensional hemispherical disk. Requiring these two constructions 
to give the same value for exp (iT), which is the way the anomaly enters into a 
Feynman path integral, requires integer quantization of the overall coefficient in the 
Wess-Zumino-Witten action. This integer can be read off from the AVV triangle 
diagram, and for the case of an underlying color SU(N C ) gauge theory turns out to 
be just Ac, the number of colors. 

To summarize, the “algebraic/geometric” approach shows that the Bardeen 
anomaly has a unique structure, up to an overall constant, and moreover that this 
overall constant is constrained by an integer quantization condition. Hence once the 
overall constant is fixed by comparison with leading order perturbation theory (say 
in QCD), it is clear that this result must be exact to all orders, since the presence 
of renormalizations in higher orders of the strong coupling constant would lead to 
violations of the quantization condition. 

The fact that non-Abelian anomalies are given by an overall rigid structure 
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has important implications for quantum field theory. For example, the presence of 
anomalies spoils the renormalizability of non-Abelian gauge theories and requires 
the cancellation of gauged anomalies between different fermion species (see Gross 
and Jackiw (1972), Bouchiat, Iliopoulos, and Meyer (1972), and Weinberg (1973)), 
through imposition of the condition tr{T a ,T/ 3 }T 7 — 0 for all a,/3, 7 , with T a the 
coupling matrices of gauge bosons to left-handed fermions. The fact that anomalies 
have a rigid structure then implies that once these anomaly cancellation conditions 
are imposed for the lowest order anomalous triangle diagrams, no further conditions 
arise from anomalous square or pentagon diagrams, or from radiative corrections 
to these leading fermion loop diagrams. Other places where the one-loop geometric 
structure of non-Abelian anomalies enters are in instanton physics, and in the ’t 
Hooft anomaly matching conditions. These and other chiral anomaly applications 
are discussed in more detail in my review Adler (2004a), and also in my Encyclopedia 
of Mathematical Physics article Adler (2004b). Both of these sources give extensive 
references to recent review articles and books on anomalies, which update the 1970 
reviews given in my Brandeis lectures (Adler, 1970b) and in Jackiw’s Brookhaven 
lectures (Jackiw, 1970). 


Perturbative Corrections to Scaling 

While finishing the paper with Bardeen on anomaly nonrenormalization, I had em¬ 
barked on a different set of perturbative calculations with Wu-Ki Tung; these be¬ 
came a forerunner of a different kind of “anomaly”, the anomalous scaling observed 
in deep inelastic electron and neutrino scattering. Our starting point was the ques¬ 
tion of whether applications of the Bjorken (1966) limit technique, which assumed 
that the asymptotic behavior of time-ordered products is given by the “naive” or 
free field theory equal time commutator, would be modified in perturbation theory. 
Strong hints in this direction had been given in a paper of Johnson and Low (1966), 
which showed that the “Bjorken-Johnson-Low” limit can produce anomalous com¬ 
mutators, and related results were also obtained in an earlier paper of Vainshtein 
and Ioffe (1967); our aim was to do calculations focusing on several physically impor¬ 
tant applications not covered in this previous work. These were the calculation by 
Bjorken (1966) of the radiative corrections to /3-decay, the Bjorken (1967) backward- 
neutrino-scattering asymptotic sum rule, and the Callan-Gross (1969) relation re¬ 
lating the ratio of the longitudinal to transverse deep inelastic electron scattering 
cross sections to the constitution of the electric current, with the latter an applica¬ 
tion both of the Bjorken-Johnson-Low limit method, and of the later proposal by 
Bjorken (1969) of scaling of the deep inelastic structure functions. 

For our test model, we considered an SU(3) triplet of spin-1/2 particles bound 
by exchange of a massive singlet gluon, which we took as either a vector, scalar, 
or pseudoscalar. The results of the vector exchange calculation, to leading order of 
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perturbation theory, were reported in Adler and Tung (1969), R21, while additional 
leading order results in the scalar and pseudoscalar gluon cases, and some fourth 
order results, were given in the follow-up paper Adler and Tung (1970), R22. We 
concluded that the Callan-Gross relation for spin-1/2 quarks, which asserts the van¬ 
ishing of q 2 o-L(q 2 ,cu) for large q 2 with fixed scaling variable breaks down in leading 
order of perturbation theory. A similar conclusion was also reached by Jackiw and 
Preparata (1969a,b), whose first paper appears in the same issue of Physical Review 
Letters as our paper R21. Tung and I related the breakdown of the Callan-Gross 
relation to a corresponding breakdown of Bjorken’s backward neutrino sum rule. 
We also showed that the certain current commutators receive a systematic pattern 
of logarithmic asymptotic corrections, and calculated the leading perturbative cor¬ 
rection to the logarithmically divergent part of the radiative corrections to /3 decay. 
Tung (1969), while still at the Institute, and Jackiw and Preparata (1969c), went 
on to carry out general analyses of the range of validity and breakdown of the 
Bjorken-Johnson-Low limit in perturbation theory. 

These papers had a number of implications for subsequent developments. The 
logarithmic deviations from the Callan-Gross relation were soon understood in a 
more systematic way through the Wilson (1969) operator product expansion and 
the Callan (1970)-Symanzik (1970) equations, which gave anomalous dimensions in 
accord with the leading order results obtained by Tung and me and by Jackiw and 
Preparata, and with the fourth order results obtained by Tung and me in R22; for a 
discussion of this, see Beg (1975). The fact that perturbative field theory gives strong 
violations of scaling led to a skepticism as to whether field theory could describe 
the strong interactions at all. For example, Fritzsch and Gell-Mann (1971/1972), in 
their long paper on “Light Cone Current Algebra”, remarked that “The renormalized 
perturbation theory, taken term by term, reveals various pathologies in commutators 
of currents. Not only are there in each order logarithmic singularities on the light 
cone, which destroy scaling, and violations of the rule that cfl/o't 0 in the Bjorken 
limit, but also a careful perturbation theory treatment show the existence of higher 
singularities on the light cone...” This was one of their motivations for introducing 
the light cone algebra, which abstracted from field theory algebraic relations that 
led to scaling and par ton model results, with the field theory itself being discarded. 

At the same time, there were also thoughts that a renormalization group fixed 
point in field theory might provide a remedy. In the same article, Fritzsch and 
Gell-Mann noted that in the context of a singlet vector gluon theory, “we must 
imagine that the sum of perturbation theory yields the special case of a ‘finite 
vector theory’ 27 [reference to Gell-Mann and Low, and Baker and Johnson] if we 
are to bring the vector gluon theory and the basic algebra into harmony.” Quite 
independently, in a conference talk at Princeton that I gave in October of 1971 
(published considerably later as Adler (1974), R23), in Section 2.4, on “Questions 
raised by the breakdown of the BJL limit”, I made the remark “Can one make a 
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consistent calculational scheme in which Bjorken limits, the Callan-Gross relation 
and scaling are all valid? This is a real challenge to theorists...Perhaps a successful 
approach would involve summation of perturbation theory graphs plus use of the 
Gell-Mann-Low eigenvalue condition (see sect. 3).” (I made these comments at 
just the time when I was working on a possible eigenvalue condition in quantum 
electrodynamics, growing out of the work of Gell-Mann and Low, and Johnson, 
Baker, and Willey, as described below in Chapter 4. The relevance of an eigenvalue 
to power law behavior was also pointed out in the papers of Callan (1972) and of 
Christ, Hasslacher, and Mueller (1972), which I included as references when I edited 
my 1971 conference talk in the fall of 1972.) However, in the field theories then 
under consideration, there was an obstacle to realizing this idea. As I noted in Sec. 
3 of my Princeton talk, for singlet gluon theories the renormalization group methods 
suggested either no simple scaling behavior (if there were no renormalization group 
fixed point at which the /3 function had a zero), or power law deviations from scaling 
of the form ( q 2 )~ 7 (if there were a fixed point at a nonzero coupling value Ao where 
/3 vanished, with 7 the value of the anomalous dimension at the fixed point). Since 
in a strong coupling theory 7 would be expected to be large at the fixed point, power 
law deviations from scaling looked to be too large to agree with experiment. 

It took another eighteen months for this obstacle to be overcome. Three de¬ 
velopments were involved: the introduction of the modern form of “color” as a 
tripling of the fractionally charged quark degrees of freedom by Bardeen, Fritzsch, 
and Gell-Mann (1972), the non-Abelian gauging of this form of color by Fritzsch 
and Gell-Mann (1972), and finally, in line with Gell-Mann’s dictum “Nature reads 
the books of free field theory”, a search for field theories that would have almost 
free behavior in the scaling limit. The conclusion of this search, the discovery of the 
asymptotic freedom of non-Abelian gauge theories and its implications by Gross, 
Politzer, and Wilczek, in the end proved a realization of the field-theoretic route 
that been contemplated by various people in 1971. In asymptotically free theories, 
because the renormalization group fixed point (the Gell-Mann-Low eigenvalue) is 
at zero coupling, where the anomalous dimension 7 vanishes, the deviations from 
scaling are not powers of g 2 , but rather only powers of logg 2 , with exponents that 
can be calculated in leading order of perturbation theory. Thus the deviations from 
scaling predicted by non-Abelian gauge theories, and specifically by quantum chro¬ 
modynamics (QCD) as the theory of the strong interactions, are much weaker than 
would be expected for singlet gluon theories, and are compatible with experiment. 

Returning briefly to the calculations that Tung and I did, our results for the 
radiative corrections to /3-decay in the singlet vector gluon model turned out later 
to have applications in the QCD context. They can be converted to the realistic case 
of the octet gluon of QCD by multiplication by a color factor, as discussed in the 
review of Sirlin (1978), and so have become part of the technology for calculating 
radiative corrections to weak processes. 
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Trace Anomalies to All Orders 

In an influential paper Wilson (1969) proposed the operator product expansion, 
incorporating ideas on the approximate scale invariance of the strong interactions 
suggested by Mack (1968). As one of the applications of his technique, Wilson dis¬ 
cussed 7T° —> 27 decay and the axial-vector anomaly from the viewpoint of the short 
distance singularity of the coordinate space AVV three-point function. Using these 
methods, Crewther (1972) and Chanowitz and Ellis (1972) investigated the short 
distance structure of the three-point function dVyVv, with 0 = 0 ^ the trace of the 
energy-momentum tensor, and concluded that this is also anomalous, thus confirm¬ 
ing earlier indications of a perturbative trace anomaly obtained in a study of broken 
scale invariance by Coleman and Jackiw (1971). Letting A ^(p) be the momentum 
space QV^jVy three point function, and II ^ be the corresponding VyV v two-point 
function, the naive Ward identity A ^(p) = (2 — p a d/dp^^l^p) is modified to 

= ^2 — PvQp ^ n^,(p) — ~ ) > 

with the trace anomaly coefficient R given by 

R = E ^ + \ E • 

i,spin| z,spin 0 

Thus, for QED, with a single fermion of charge e, the anomaly term is 
— [ 2 a/( 37 r)](p M Pi/-? 7 M i/p 2 ). In a subsequent paper, Chanowitz and Ellis (1973) showed 
that the fourth order trace anomaly can be read off directly from the coefficient of 
the leading logarithm in the asymptotic behavior of II Ml/ (p), giving to next order an 
anomaly coefficient — 2 a/( 37 r) — a 2 /(2n 2 ). Thus, their fourth order argument indi¬ 
cated a direct connection between the trace anomaly and the renormalization group 
/3 function. 

My involvement with trace anomalies began roughly five years later, when Phys¬ 
ical Review sent me for refereeing a paper by Iwasaki (1977). In this paper, which 
noted the relevance to trace anomalies, Iwasaki proved a kinematic theorem on the 
vacuum to two photon matrix element of the trace of the energy-momentum ten¬ 
sor, that is an analog of the Sutherland-Veltman theorem for the vacuum to two 
photon matrix element of the divergence of the axial-vector current. Just as the 
latter has a kinematic zero at q 2 — 0, Iwasaki showed that the kinematic struc¬ 
ture of the vacuum to two photon matrix element of the energy-momentum tensor 
implies, when one takes the trace, that there is also a kinematic zero at q 2 — 0, 
irrespective of the presence of anomalies (just as the Sutherland-Veltman result 
holds in the presence of anomalies). Reading this article suggested the idea that just 
as the Sutherland-Veltman theorem can be used as part of an argument to prove 
nonrenormalization of the axial-vector anomaly, Iwasaki’s theorem could be used 
to analogously calculate the trace anomaly to all orders. (In addition to writing a 
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favorable report on Iwasaki’s paper, I invited him to spend a year at the IAS, which 
he did during the 1977-78 academic year.) During the spring of 1976 I wrote an 
initial preprint attempting an all orders calculation of the trace anomaly in quan¬ 
tum electrodynamics, but this had an error pointed out to me by Baqi Beg. Over 
the summer of 1976 I then collaborated with two local postdocs, John Collins (at 
Princeton) and Anthony Duncan (at the Institute), to work out a corrected ver¬ 
sion (Adler, Collins, and Duncan, 1977, R24). Collins and Duncan simultaneously 
teamed up with another Institute postdoc, Satish Joglekar, to apply similar ideas 
to quantum chromodynamics, published as Collins, Duncan, and Joglekar (1977), 
and independently the same result for QCD was obtained by N. K. Nielsen (1977). 
Similar results were given in a preprint of Minkowski (1976), which grew out of 
discussions in the Gell-Mann group at Cal Tech in which the role of the /3 function 
in the trace anomaly formula, and its implications for generating the scale of the 
strong interactions, were appreciated (C. T. Hill, private communication, 2005, and 
P. Minkowski, private communication, 2005). 

In the simpler case of QED, the argument based on Iwasaki’s theorem is given 
in Section II of R24. The basic idea is to use Iwasaki’s result for the vacuum to 
two photon matrix element of the trace of the energy momentum tensor, together 
with expressions for the electron to electron and the vacuum to two photon matrix 
elements of the “naive” trace moV’V’ given by application of the Callan-Symanzik 
equations. The final result for the trace is given by 


= [1 + S(a)]m 0 ^ + 3(a)N[F\ a F Xcr ] + ... 


with iV[ ] an explicitly defined subtracted operator, with ... indicating terms that 
vanish by the equations of motion, and with 5(a) and /3(a) the renormalization 
group functions defined by 1 + 5(a) = (m/mo)5mo/9m and /3(a) = (m/a)da/dm. 
The first two terms in the power series expansion of the coefficient of the F\ a F^ a 
term in the trace agree with the fourth-order calculation of Chanowitz and Ellis. The 
trace equation in QCD has a similar structure, again with the /3 function appearing 
as the anomaly coefficient. The fact that the trace anomaly coefficient is given by the 
appropriate /3 function extends to the supersymmetric case, and leads to interesting 
issues that are reviewed in the final section of Adler (2004a). The appearance of 
the /3 function in the anomaly coefficient has also played a role in the inference of 
the structure of effective Lagrangians from the form of the trace anomaly; see, for 
example, Pagels and Tomboulis (1978) for an application to QCD, and Veneziano 
and Yankielowicz (1982) for an application to supersymmetric Yang-Mills theory. 
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Introduction 

My interest in a detailed study of quantum electrodynamics (QED) began during 
my visit to Cambridge, U.K. in the spring of 1968, when I found the anomalous 
properties of the axial-vector triangle diagram discussed in Chapter 3. This started 
me thinking more generally about the properties of fermion loop diagrams, and 
in particular I wondered whether such diagrams in quantum electrodynamics could 
lead to an eigenvalue condition for the electric charge, possibly giving an explanation 
of why the charges of different particle species (such as the electron and proton) are 
the same in magnitude. This speculation ultimately proved to be wrong, and I look 
back on the investigations that it inspired with mixed feelings, as being somewhat of 
a misadventure. On the one hand, my work on aspects of quantum electrodynamics 
led to a number of important papers with useful results, but on the other hand, my 
preoccupation with this program kept me from jumping into the emerging area of 
Yang-Mills unification at the point when much of the interesting theoretical work 
on non-Abelian theories was being done. 

My work on QED divided into three distinct phases, described in the following 
sections. The first part dealt with a calculation of the process of photon splitting 
in strong magnetic fields, which served as a warm-up for getting into the study 
of fermion loop diagrams. After this calculation was completed, I turned to an 
investigation of the renormalization group properties of QED, using as a tool the 
newly discovered Callan-Symanzik equations. Finally, in an attempt to get a better 
formalism for calculating the renormalization group /3 function contribution from 
closed loop diagrams, I worked out a compactification of massless QED on the 4- 
sphere, and applied this formalism to a number of theoretical issues. By the end 
of this phase, it was clear that developments in non-Abelian gauge theories were 
the future of the field of particle physics and, through grand unification, offered 
a compelling way to understand charge quantization, which had been the starting 
motivation for my interest in electrodynamics. So at this point I set my QED work 
aside and moved on to some of the phenomenological investigations described in 
Chapter 5. 
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Strong Magnetic Field Electrodynamics: 

Photon Splitting and Vacuum Dielectric Constant 

The discovery of pulsars with ultra-strong trapped magnetic fields led to a surge of 
interest in strong field QED processes, that are unobservably small for attainable 
laboratory magnetic fields. One of the processes of interest is photon splitting in 
a constant magnetic field, which is described by a closed electron loop Feynman 
diagram. When conversations at the Institute turned to whether this reaction could 
be of relevance in the dynamics of pulsar magnetospheres, my interest in getting 
into a general study of fermion loop processes in QED made it natural for me to 
get involved. The initial phase of this study led to a paper (Adler, Bahcall, Callan, 
and Rosenbluth, 1970, R25), that surveyed the basic features of the photon splitting 
process. Briefly, the lowest order box diagram makes a vanishing contribution, by 
an argument using Lorentz invariance and gauge invariance, and so the leading 
contribution comes from the hexagon diagram, with three insertions of the external 
magnetic field. (Earlier calculations had overlooked this fact, and so led to the 
wrong dependence on magnetic field strength.) Using the Heisenberg-Euler effective 
Lagrangian, we calculated the photon splitting absorption coefficients for the various 
photon polarization states relative to the magnetic field vector, to leading order 
in the external magnetic field, for photon energies small relative to the electron 
mass. We also gave the selection rules that result from the fact that the dielectric 
constant for the vacuum permeated by a strong magnetic field is different for the 
different photon polarizations (this was where Marshall Rosenbluth’s expertise as a 
plasma physicist entered in), and made numerical estimates. Some of our results were 
independently obtained around the same time by Bialynicka-Birula and Bialynicki- 
Birula (1970). 

Again with the aim of getting more experience with QED calculations, I decided 
to embark on an exact calculation of photon splitting, for arbitrary magnetic fields 
and for arbitrary photon energies below the pair production threshold. This involved 
a very lengthy calculation using the proper time method, that Schwinger had first 
used (Schwinger, 1951) to give an elegant rederivation of the Heisenberg-Euler effec¬ 
tive Lagrangian. I derived general formulas for both the photon splitting amplitudes, 
and the refractive indices needed for the selection rules (in the latter case correcting 
an earlier result of Minguzzi (1956,1958a,1958b)). I wrote a computer program to 
numerically evaluate the photon splitting absorption rates, and computed sample 
results, as well as giving a detailed discussion of possible plasma physics corrections 
to the selection rules. These results were all reported in a comprehensive article 
(Adler, 1971, R26) on photon splitting and dispersion in a strong magnetic field. 

My overall conclusion was that the leading order calculation from the hexagon 
diagram gives good order of magnitude estimates, as graphed in Fig. 8 of R26, which 
plots the ratio of the exact photon splitting absorption coefficient to the hexagon 
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diagram prediction, versus magnetic field, for photon frequencies equal to zero and 
equal to the electron mass m. This plot, incidentally, gives a check both on my 
exact analytic calculation and the numerical work, since the ratio approaches unity 
for small field strengths, where the hexagon dominates. For magnetic fields of order 
the “critical field” Bqr = m 2 /e~ 4.41 x 10 9 Tesla (4.41 x 10 13 Gauss), and photon 
frequencies of order the electron mass m, the photon splitting mean free path is 
much shorter than characteristic pulsar magnetosphere depths. However, since the 
absorption coefficients scale as B 6 for small fields, and since the pulsars known 
in 1971 tended to have fields of up to a few tenths of Bcr , the photon splitting 
process at that time seemed to be not of great astrophysical importance. Stoneham 
(1979) published an analytic recalculation of photon splitting by a different method 
(without numerical evaluation), which as we shall see agreed with my calculation. 
In an Appendix to his paper, he also improved on my estimate of the very small 
corrections that arise from the box diagram, when finite opening angles resulting 
from photon dispersion are taken into account, and we exchanged letters on this 
aspect of his work. However, after Stoneham’s paper, interest in photon splitting 
waned for quite a number of years. 

In the mid 1990’s, the discovery of “magnetars”, pulsars with fields much higher 
than the critical field, revived interest in photon splitting. Around April, 1995, John 
Bahcall told me that recent papers by Mentzel, Berg, and Wunner (1994) and Wun- 
ner, Sang, and Berg (1995) claimed that the photon splitting absorption coefficients 
for energetic photons in strong fields were a factor of 10 4 higher than given in my 
1971 paper. If true, this would have had important astrophysical ramifications, so I 
looked back at my own work, and at the papers of the Wunner group. I was struck 
by the fact that the Wunner group had not checked to see whether their calculation 
reproduced the known B 6 dependence of photon splitting for weak fields and low 
energy photons, a consistency test that, as noted above, I had incorporated into 
my analytic and numerical work. So I strongly suspected that they had made an 
error, possibly through a lack of gauge invariance, and wrote a letter to this effect 
to the Wunner group, while John simultaneously wrote to Astrophysical Journal 
Letters , where their second paper was being considered for publication. Neither of 
these letters had any effect, and the Wunner, Sang, and Berg paper was published 
in December, 1995. John Bahcall and Bohdan Paczynski then urged me to make 
my private misgivings known more publicly. In response, I wrote a short IAS As¬ 
trophysics Preprint Series article in January, 1996 (Adler, 1996), expanding on my 
letter to the Wunner group, and concluding “it is important that their calculation 
and mine be rechecked by a third party, with the aim of understanding where the 
discrepancy arises and determining who is right.” I submitted this note to the As¬ 
trophysical Journal , which rejected it. 

Although this short note was never published, it had the intended effect as 
a result of its internal circulation within the IAS. Not long afterwards Christian 
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Schubert, an IAS visitor at the time, came to my office and said that with new 
“stringy” Feynman rules with which he was expert, he thought he could repeat in 
a few days the calculation that had taken me a couple of months by the proper 
time method. I replied that if he could do that, I would deal with the numerical 
aspects. A week or two later Christian gave me two equivalent formulas for the 
photon splitting amplitude obtained by his methods; in the meantime, the Russian 
group of Baier, Milstein, and Shaisultanov (1996) had produced yet another calcu¬ 
lation, which agreed numerically with my 1971 paper. During a short visit to the 
Institute for Theoretical Physics in Santa Barbara, I wrote programs to directly 
compare Schubert’s two expressions, my 1971 result, Stoneham’s 1979 formula, and 
the analytic formula of the Russian group, all as applied to the allowed polarization 
case. (The reason for doing this numerically is that an analytic conversion between 
inequivalent Feynman parameterizations is very difficult, because zero can be writ¬ 
ten as a multidimensional integral in complicated ways.) The programs showed that 
the five calculations gave precisely identical amplitudes. This was reported in the 
paper that I drafted with Schubert on my return to the IAS (Adler and Schubert, 
1996, R27). We also posted my computer programs on my web site, and advertised 
this posting in the paper, so that the community at large could verify what we had 
done. About a month later, I received an email from Wunner retracting the earlier 
numerical results of his group, which turned out to result from a single sign error in 
their computer programs. When this sign error was corrected, the analytic results of 
Mentzel, Berg and Wunner gave answers that agreed with everyone else, as discussed 
in Wilke and Wunner (1997). Thus the photon splitting controversy was finally re¬ 
solved. Subsequently, John Bahcall had me assemble a file of all the relevant papers 
and correspondence for a post-mortem meeting that he held with the editors of the 
Astrophysical Journal , to analyze and improve the process that that had allowed an 
incorrect paper to get into print, despite several advance warnings that the results 
were suspect. 


The “Finite QED” Program via the Callan-Symanzik Equations 

My comprehensive article on photon splitting was finished in early 1971, and the 
following summer I returned to my long-standing interest in a study of unresolved 
issues in the theory of quantum electrodynamics. Johnson, Baker, and Willey (1964), 
Johnson, Willey, and Baker (1967), and Baker and Johnson (1969, 1971a,b) had 
written an important series of papers (referred to below as JBW) in which they 
argued that if QED has a Gell-Mann-Low eigenvalue, then the asymptotic behavior 
of both the electron and photon propagators would drastically simplify, with the 
mass term in the electron propagator having power law scaling behavior, and the 
asymptotic photon propagator behaving, after charge renormalization, as if it had 
no photon self-energy part. Bill Bardeen and I were both in Aspen for part of the 
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summer of 1971, and we embarked on a study of QED using the then very new Callan 
(1970)-Symanzik (1970) equations. Rather than addressing the issue of a possible 
eigenvalue in QED, we studied the simplified model suggested by the presence of 
such an eigenvalue, in which the photon propagator is taken as a free propagator 
with no self-energy part. In this case the /3 function term, which has a coupling 
constant derivative, is not present in the Callan-Symanzik equations, and these 
equations then can be explicitly integrated to give the simple form for the electron 
propagator found by JBW. These results were described in the paper Adler and 
Bardeen (1971), R28. In addition to giving results of interest for QED, this paper 
was one of the first applications of the Callan-Symanzik equations, and was also 
a motivation for my remarks at the Princeton conference later in 1971 (see R23), 
in which I suggested a possible connection between an eigenvalue condition in the 
strong interactions and Bjorken scaling. 

After finishing the paper with Bardeen, I turned to a detailed study of the full 
theory of QED, with photon self-energy parts retained, on which I wrote a com¬ 
prehensive paper Adler (1972a), R29. This paper had a number of new results. I 
began with a review of the original Gell-Mann-Low formulation of the renormal¬ 
ization group in QED, and then redid their analysis in terms of the more modern 
Callan-Symanzik approach, ending up in Eq. (53) with the explicit map between 
the Callan-Symanzik /3(a) function and the functions -0(a) and q(a) that enter into 
the Gell-Mann-Low formulation. (An implicit form of this map had appeared in 
Sec. II.3 of Symanzik (1970).) After reviewing the JBW program and the results 
obtained with Bardeen in R28, I showed by an argument based on the Federbush- 
Johnson (1960) theorem that if there is an eigenvalue in QED, then in the massless 
limit all 2n-point current correlation functions must vanish at the eigenvalue. I then 
went on to show, in an argument that benefited from a conversation with Roger 
Dashen, that the vanishing of higher correlation functions also implied the vanish¬ 
ing of all coupling constant derivatives of the photon proper self-energy part at the 
eigenvalue; hence the eigenvalue, if it existed, must be an infinite order zero of the 
one-loop /3 function. These were all correct results that give the paper an enduring 
value. 

I concluded the paper by proposing that in addition to the standard renormal¬ 
ization group result, in which the eigenvalue plays the role (through running of 
the coupling) of the unrenormalized fine structure constant ao, there could be an 
additional solution, resulting from a fermion-loopwise summation of the theory, in 
which the eigenvalue plays the role of the physical coupling a. A motivation for this 
proposal was that the formal power series argument, which shows the equivalence of 
loopwise summation to the usual renormalization group analysis, could break down 
in the presence of an essential singularity in the coupling. I then went on to conjec¬ 
ture that loopwise summation with an eigenvalue for a was the mechanism fixing 
the physical fine structure constant in a uniform manner for all fermion species. 
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As I have noted in the Introduction to this Chapter, this conjecture turned out to 
be wrong, and in retrospect my excessive emphasis on it in writing R29 distorted 
the presentation of an otherwise good paper. At the time key people working on 
the renormalization group, in particular Gell-Mann, Low, and Wilson, were all very 
skeptical. Wilson, in particular, remarked at a Princeton seminar that my demon¬ 
stration of an infinite order zero showed there could be no eigenvalue in QED, and 
although I was privately annoyed at the time, it is now clear that this was the correct 
conclusion. 

Finally, in an Appendix to my paper, I returned to the electron propagator 
analysis carried out in R28, this time in a general covariant gauge. This investigation 
was later reanalyzed in more detail, and improved, in a comprehensive study by 
Lautrup (1976). 

The final paper in this section, Adler, Callan, Gross, and Jackiw (1972), R30, 
studied the combined implications of the BJL limit, the nonrenormalization of 
anomalies, and the possible presence of an eigenvalue in QED. This paper, which 
was initially drafted by Roman Jackiw, grew out of discussions among the authors 
at Princeton and at the National Accelerator Laboratory. It shows that the following 
three phenomena are, when taken in combination, incompatible: (1) nonrenormal¬ 
ization of the axial-vector anomaly, (2) the existence of an eigenvalue in QED, (3) 
validity of naive scale invariant short-distance expansions involving the axial-vector 
current at the eigenvalue. Since the finite QED program was intended to eliminate 
the pathologies of QED, through presence of an eigenvalue, this showed that its aims 
could not be attained, and again cast strong doubt on the existence of an eigenvalue 
in QED. For later work coming to the same conclusion, and references to more re¬ 
cent literature, see Baker and Johnson (1979), and Acharya and Narayana Swamy 
(1997). On rereading R30 now, it occurs to me that the argument establishing a 
relation between the axial-vector anomaly and the Schwinger term given in Section 
III may be extendable to show that the vanishing of anomalies in axial-vector loop 
diagrams coupling to four or more photons in QED implies, through similar use of 
a BJL limit, that the Schwinger term in the two-point function is a c-number. As 
noted in Chapter 3, this is a result that I was unable to prove, before the advent of 
the theory of anomalies, in 1966. For another approach to constraining the structure 
of the Schwinger term, see Jackiw, Van Royen, and West (1970). 


Compactification of Massless QED and Applications 

The fact that the eigenvalue condition for QED can be studied in the conformally 
invariant, massless electron theory, led me to study remappings of the Feynman rules 
for QED that make use of conformal invariance. In Adler (1972b), R31, I showed 
that the equations of motion and Feynman rules for massless Euclidean QED can 
be written in terms of equivalent equations of motion and Feynman rules expressed 
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in terms of coordinates that are confined to the surface of a unit hypersphere in 
5-dimensional space (a four-sphere in mathematical terms). For example, letting rj a 
be the coordinate on the sphere (where a runs from 1 to 5, and (rf ) 2 — l), the 
usual four-vector potential is replaced by a five-vector A a obeying the constraint 
(with repeated indices summed) rf a A a = 0, and the electromagnetic field strength 
is replaced by a three-index tensor F a b c = L a bA c + L& c A a + L ca Ab , with L a b the 

A 

5-space rotation generators. This tensor has a two-index dual F a b , and the Maxwell 
equations become L a bF a bc = 2 eJ c , L a bFb c = F ac . The corresponding 0(5) covariant 
Feynman rules are given in Table I of R31. The result of this transformation of the 
theory is an explicit demonstration that massless QED can be compactified, so 
that there are only ultraviolet divergences (corresponding to points approaching 
each other on the surface of the sphere, where it becomes tangent to Euclidean 
4-space), but no infrared divergences. The 0(5) rules, however, are not manifestly 
conformal invariant; in a later section of the paper I showed that they are related, by 
a projective transformation, to a manifestly conformal invariant (but non-compact) 
0(5,1) formalism that was introduced earlier by Dirac (1936). 

In two subsequent papers I further developed and applied the 0(5) covariant 
formalism. In Adler (1973) I showed that the usual Feynman path integral takes 
the form of an amplitude integral, constructed as an infinite product of individu¬ 
ally well-defined ordinary integrals over coefficients appearing in the hyperspherical 
harmonic expansion of the electromagnetic potential A a ■ In the paper Adler (1974), 
R32, I used the amplitude integral formalism to study a simple model, in which only 
a single photon mode of the form A a oc vi a rj ■ — V2 a r n‘ v i, with orthogonal unit 

vectors, is retained. The external field Fredholm determinant or vacuum persistence 
amplitude A(eA) — det (^7 ■ d + ej ■ A) could then be studied by exploiting the 
0(3) x 0(2) residual symmetry of this model, which permits the external field prob¬ 
lem to be reduced to a set of two coupled first order ordinary differential equations, 
with a Wronskian equal to the Fredholm determinant. A significant result coming 
out of the analysis of this model was that the renormalized Fredholm determinant 
is an entire function of order four as eA becomes infinite in a general complex di¬ 
rection. This played a role in a subsequent discussion of asymptotic estimates in 
perturbative QED, as discussed in the paper of Balian, Itzykson, Zuber, and Parisi 
(1978), which followed up on an earlier paper of Itzykson, Parisi, and Zuber (1977). 
Whereas extrapolation from the solvable case of a constant field strength F^ sug¬ 
gested that the order of the Fredholm determinant is two, my solvable example 
showed that two cannot be the correct answer for general vector potentials. Balian 
et al. noted this and then went on to present further arguments for the determinant 
being of order four in four-dimensional spacetime, or more generally of order D in 
D-dimensional spacetime. This in turn had important implications for their study 
of asymptotic behavior of the perturbation series in QED. The subject of the or¬ 
der of the Fredholm determinant was further developed by Bogomolny. In an initial 


62 


Adventures in Theoretical Physics 


paper by Bogomolny and Fateyev (1978), the case of fields with an 0(3) x 0(2) 
symmetry group that I had initiated in R32 was taken up again, and an asymptotic 
formula for the Fredholm determinant was obtained. In a subsequent paper, Bogo¬ 
molny (1979) showed that this asymptotic formula, and a similar formula obtained 
by Balian et al. for another special case, could be extended to the general result 
linic-^ioo A(eA) = (e 4 /127r 2 ) f d 4 x((A^ l ) 2 ') 2 , provided A M is chosen to obey the non¬ 
linear gauge condition d^A^A 2 ) = 0. Thus the order four result that I found in my 
“one-mode” model in fact gave the correct general answer for QED. 

A further application of the 0(5) formalism for QED emerged after the discov¬ 
ery of the instanton solution to the Yang-Mills field equations. Jackiw and Rebbi 
(1976) showed that the one-instanton solution is invariant under an 0(5) subgroup 
of the full conformal group, and hence can be rewritten in an elegant way in terms of 
the 0(5) formulation of electrodynamics, as extended to non-Abelian gauge fields. 
Letting a a be the 0(5) equivalent of the Dirac 7 matrices, and 7 ^ = (i/4)[a a ,a&], 
a matrix-valued vector potential A a obeying the constraint 77 ■ A = 0 can be im¬ 
mediately constructed as A a = Cry, 7 ^. Jackiw and Rebbi showed that when this 
vector potential is substituted into the Yang-Mills field equation as expressed in 
the non-Abelian extension of the 0(5) formalism, one gets a cubic equation for the 
coefficient O, two roots of which give pure gauge potentials with vanishing field 
strengths, but the third root of which gives the instanton! Thus, I had missed a sig¬ 
nificant opportunity in not pursuing the question, raised at least once when I gave 
seminars, of what the non-Abelian generalization of the 0(5) formalism was like. 
A variant of the non-Abelian 0(5) formalism was subsequently applied by Belavin 
and Polyakov (1977), with corrections by Ore (1977), to give a recalculation of the 
Fredholm determinant in an instanton background that was first computed by J t 
Hooft (1976). 
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5. Particle Phenomenology and Neutral Currents 


Introduction 

Much of the work described in Chapters 2 and 3 on soft pion theorems, sum rules, 
anomalies, and neutrino reactions falls in the category of phenomenology, but both 
the interrelations between different aspects of this research, and the chronology, sug¬ 
gested that it be discussed earlier. Even before this work was done, I wrote my first 
particle phenomenology paper in collaboration with my first year Princeton gradu¬ 
ate school roommate, and former Harvard classmate, Alfred Goldhaber (Adler and 
Goldhaber, 1963). In this paper we analyzed the possibility of using the deuteron to 
provide a polarized proton target, by determining the polarization of the recoiling 
spectator neutron through its scattering on He 4 . Although perhaps feasible, this 
proposal was never implemented, and much better methods for directly obtaining 
polarized targets are now available. After I completed the work on quantum elec¬ 
trodynamics described in Chapter 4, I returned to phenomenology in a number of 
papers written, or conceived, during visits to the National Accelerator Laboratory 
(subsequently renamed the Fermi National Accelerator Laboratory, or Fermilab), 
and continued with related work in a number of papers written at the IAS. I dis¬ 
cuss the earlier work done at Fermilab in the first section that follows, and then in 
the second section take up work at both Fermilab and the IAS relating to neutral 
currents. 


Visits to Fermilab 

When the National Accelerator Laboratory was inaugurated, my former thesis ad¬ 
visor Sam Treiman was brought in, on a succession of leaves from Princeton starting 
in 1970, to serve as temporary head of the Theory Group, with the charge of set¬ 
ting it up and recruiting a permanent head. Subsequently, Ben Lee was hired to be 
the permanent head of the Theory Group. During this period many theorists from 
outside institutions were invited to be term time and/or summer visitors, and as 
part of this program I made a series of visits to Fermilab, and wrote a number of 
phenomenological papers growing out of discussions with people there. 

As already noted in Chapter 3, during a 1971 visit to Fermilab I collaborated 
with Lee, Treiman, and Tony Zee to study the anomaly-based prediction for the pro¬ 
cess 77 —> 37 t, described in the paper R20. This was applied in a subsequent paper 
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that I wrote with Glennys Farrar and Treiman (Adler, Farrar, and Treiman, 1972, 
R33) to an analysis of the contribution of three pion intermediate states to the rare 
kaon decay Kl —> The background for this study was what was then called 

the “Kl —> /x + /i“ puzzle”, the fact that experiment had not detected this kaon 
decay mode at a level considerably below that given by a unitarity bound based on 
the assumed dominance of a two photon intermediate state in the absorptive part 
of the decay amplitude. There were thus two possibilities, either an experimental 
problem, or destructive interference with another intermediate state, for which the 
three pion intermediate state was a prime candidate. Aviv and Sawyer (1971) had 
proposed to use soft pion methods to estimate the three pion contribution, and had 
concluded that the contribution was much too small to be relevant. However, the 
Aviv-Sawyer analysis used an expression for the 37r —> 77 amplitude which had 
been shown in R20 to be incorrect. In R33, we estimated the three pion contribution 
by using the corrected 3n —> 77 amplitude calculated in R20, but still found that 
it gave much too small a contribution to explain the lack of observed Kl —> /x + /i _ 
events. Similar conclusions, again using the results of R20, were reached indepen¬ 
dently by Pratap, Smith, and Uy (1972). Ultimately, the origin of the “ Kl —> /x + /i“ 
puzzle” turned out to be experimental, and this decay mode has now been seen in 
a number of experiments, with the Particle Data Group giving an average value for 
r(M + /i“)/rTOT of ~ 7.2 x 10“ 9 , as compared with the theoretical unitarity lower 
bound of 7.0 x 10 “ 9 based on the current Kl ^ 77 branching ratio. 

During the years 1973-1974, my Fermilab visits led to papers in two separate 
areas, searches for neutral currents in weak pion production, and the analysis of 
what was then a discrepancy between theory and experiment in /i-mesic atom x-ray 
spectra. I will take up this second area first, because the neutral current work leads 
directly into the papers discussed in the next section. My interest in the /i-mesic 
atom discrepancy was stimulated by my earlier work on quantum electrodynamics, 
since an eigenvalue in QED could show up as deviations from the standard pertur¬ 
bation theory predictions for vacuum polarization effects. Thinking about tests for 
vacuum polarization discrepancies in QED led me to think more generally about 
other aspects of vacuum polarization, in particular the predictions for the ratio 
R(s) = cr(e~*~e~ —> hadrons; s) /a(e+e~ —> / i + / i “; s ) in various models for quark 
structure of hadrons. This offshoot of the QED work led to results that are still 
used today, introduced in the paper Adler (1974a), R34, dealing with “Some simple 
vacuum-polarization phenomenology...”. My basic observation was that whereas R 
is measured in the timelike region, the natural place to compare experiment with 
scaling predictions of various theories is in the spacelike region, where (since there 
are no threshold effects) one might expect an early or “precocious” onset of scaling. 
Rather than directly using the dispersion relation for the vacuum polarization part 
to calculate the spacelike continuation, I proposed using its first derivative, and so 
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defined a function 


f°° duR(u) 
4mJ (* + U) 2 


This function is the one for which parton models and QCD most directly make pre¬ 
dictions, and since it is positive definite and involves a strongly convergent integral 
(for R approaching a constant), the experimentally inaccessible high energy tail has 
a known sign and a magnitude that can be bounded. For a parton model in which 
R asymptotically approaches a constant (7, one has T(—s) ~ C/s as s — > 00 , and a 
similar formula holds in QCD with a known logarithmic correction. The paper R34 
used the function T(— 5 ) to propose a test of the colored quark hypothesis. Subse¬ 
quently, De Rujula and Georgi (1976) used a modified version of this idea, defining 
D(s) = sT(— s), to analyze the new SPEAR data. They found that the original 
colored quark model was excluded, and among various viable possibilities, noted 
that “the standard model with charm is acceptable if heavy leptons are produced,” 
a conclusion that was borne out by experiment with the subsequent discovery of 
the r lepton. Shortly afterwards, Poggio, Quinn, and Weinberg (1976) proposed a 
generalized method in which the derivative of the hadronic vacuum polarization 
that I had used is replaced by a finite difference between the hadronic vacuum po¬ 
larization values at points a distance ±iA from the timelike real axis, leading to 
a “smeared” average of R(s) that retains sensitivity to threshold effects. Recently, 
my original method, generally in the form D(s) used by De Rujula and Georgi, has 
been revived under the name of the “Adler function”, in a number of papers; see, 
for example, Broadhurst and Kataev (1993), Kataev (1996); Peris, Perrottet, and de 
Rafael (1998); Beneke (1999); Eidelman, Jegerlehner, Kataev, and Veretin (1999); 
Kataev (1999); Cvetic, Lee, and Schmidt (2001); Cvetic, Dib, Lee, and Schmidt 
(2001); Milton, Solovtsov, and Solovtsova (2001); and Dorokhov (2004). 

In the second part of R34 I examined what was then a discrepancy between theory 
and experiment in /i-mesic atom x-ray transition energies, under the assumption 
that (if real) the discrepancy arose from a nonperturbative correction 5p to the 
vacuum polarization absorptive part. Assuming that 5p is positive, or positive and 
monotonic, I derived lower bounds on the corresponding deviation that would be 
expected in — 2). For instance, if 5p is assumed positive and monotonic, 

comparison of the kernels that weight p in the formulas for the x-ray transition 
energies and for gives the bound 5a M < —(0.98 ± 0.18) x 10 -7 . In a follow-up 
paper with Roger Dashen and Sam Treiman (Adler, Dashen, and Treiman, 1974) 
we discussed other tests for a nonperturbative vacuum polarization contribution, 
and also placed bounds on the mass of a light scalar meson that could be invoked 
to explain the x-ray discrepancy. A few months later, Barbieri (1975) extended 
the method of R34 to show that precision measurements of the (/i 4 He)"*" system 
were already at variance, within the vacuum polarization deviation or scalar meson 
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exchange hypotheses, with the supposed x-ray discrepancy. A later paper of Barbieri 
and Ericson (1975) gave additional evidence against the scalar meson explanation 
for the x-ray discrepancy. In the meantime, during 1975 and the few years following, 
there were a number of experimental developments, reviewed in detail in Borie and 
Rinker (1982), as a result of which the muonic x-ray discrepancy was eliminated. 
Incidentally, the current theoretical and experimental values of a M differ by a few 
parts in 10 —9 , well below the lower bounds on 5a M inferred in R34 from the /i-mesic 
atom x-ray data at that time, giving an additional indication that that the purported 
x-ray discrepancy was an experimental artifact. 


Neutral Currents 

The existence of weak neutral currents is a principal prediction of the Glashow- 
Weinberg-Salam electroweak theory, and commanded much attention in the 1970s. 
Failure to find weak neutral currents would have falsified the electroweak theory, 
and on the other hand, detection of weak neutral currents would give a value for 
the electroweak mixing angle Qw> which in turn determines the masses of the heavy 
intermediate bosons of the theory. As a result of my thesis work on weak pion 
production, it was natural for me to get interested in theoretical estimates of the 
neutral current weak pion production channels u + Ni —> u + n + Nf , with Nij a 
nucleon (either a neutron or proton) and with n a pion of appropriate charge. In 
July 1972, a collaboration with Wonyong Lee as spokesman proposed a study of 
weak neutral currents in both the purely leptonic and the pion production channels 
at the Brookhaven AGS accelerator, and a copy of their proposal is in my files. 
Through this, and through related correspondence of Ben Lee with Sam Treiman, I 
got interested in doing detailed calculations for this process, and over the next few 
years was in frequent touch with the experimental group for which Wonyong Lee 
was spokesman. 

My initial papers were motivated by the fact that preliminary estimates of neu¬ 
tral current weak pion production by Ben Lee (1972) appeared to conflict with 
experiments in complex nuclei reported by Wonyong Lee (1972), subject to two 
caveats. The first caveat was that Ben Lee’s static model estimates didn’t include 
1=1/2 contributions to weak pion production, and the second caveat was that nu¬ 
clear charge exchange corrections could be important, as noted by Perkins (1972). 
The first of these issues was dealt with in a short paper Adler (1974b), R35, where I 
used my model of R15, as adapted to the neutral current case, to estimate the effects 
of including the nonresonant isospin 1/2 channels, and concluded that they had lit¬ 
tle effect on Ben Lee’s estimate from the dominant isospin 3/2 channel. The second 
issue was dealt with in a paper on nuclear charge exchange corrections to pion pro¬ 
duction in the A(1232) region, that I wrote in collaboration with Shmuel Nussinov 
and Emmanuel Paschos (Adler, Nussinov, and Paschos, 1974, R36). In this paper, 
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we estimated the effects of multiple charge exchange scattering on pion production 
in nuclear targets, using an extension of techniques used by Fermi and others to 
calculate multiple neutron scattering in the early days of neutron physics. A con¬ 
siderable part of the fun of writing this paper was learning about this older work on 
neutron physics, and feeling a sense of continuity between current concerns of weak 
interaction physics and the quite differently motivated work of an earlier generation. 
In addition to giving analytic formulas, we tabulated various results for the case of a 
13 AI target, as appropriate to experiments with aluminum spark chamber plates. 
In R36, we made the simplifying assumption of an isotopically neutral target (that 
is, equal numbers of neutrons and protons), which is exact for qC 12 , and a good 
approximation for aluminum. In a follow up paper (Adler, 1974c), I extended the 
model to nuclear targets with a neutron excess. As can be seen from Table II of R36, 
charge exchange corrections are sizable, and in our model typically reduce the ratio 
of neutral current to charged current n° production by about 40%. 

My next paper on neutral currents was motivated by the fact that preliminary 
results of an experiment on weak pion production in hydrogen at Argonne National 
Laboratory showed a cluster of neutral current events just above threshold. In this 
kinematic regime soft pion methods should apply, allowing one to relate threshold 
neutral current weak pion production in the standard electroweak theory to the 
elastic neutral current cross section for v +p —> v + p. Using this relation, I showed 
in Adler (1974d), R37 that one could place bounds on the expected number of 
neutral current pion production events in the threshold region, with the Argonne 
results exceeding these bounds. Thus, there seemed to be stronger neutral current 
weak pion production than suggested by the SU(2) x U( 1) electroweak theory. 

Subsequent events then proceeded on several parallel tracks. In a follow-up paper 
to R37, published as Adler (1975a), R38, I used the full apparatus of my weak pion 
production calculation of R15 to extend the neutral current calculation above the 
threshold region to include the regime where A(1232) production dominates. This 
analysis reinforced the conclusions about the preliminary Argonne data already 
reached in R37. Simultaneously, with a large group of postdocs at the Institute, I 
embarked on a study of weak pion production in alternative models of neutral cur¬ 
rents with scalar, pseudoscalar, and tensor currents, and also with so-called “second 
class” (abnormal G-parity) currents. Additionally, in Adler, Karliner, Lieberman, 
Ng, and Tsao (1976), we did a detailed study of isospin-1/2 resonance production 
by V, A neutral currents. Perhaps the one part of the group effort on alternative 
current structures to have lasting value was a calculation of nucleon to nucleon and 
pion to pion matrix elements of scalar, pseudoscalar, and tensor current densities, 
using all the theoretical tools then at our disposal: flavor SUz and chiral SUz x SUz 
symmetries, the quark model, and the MIT “bag” model. The results of these cal¬ 
culations were checked by several of us, and tabulated in Adler, Colglazier, Healy, 
Karliner, Lieberman, Ng, and Tsao (1975), R39; they were subsequently relevant for 


70 


Adventures in Theoretical Physics 


estimates of the coupling to nucleons of hypothetical scalar and pseudoscalar par¬ 
ticles, such as axions. The main part of the group effort was a current algebra soft 
pion production calculation for the alternative current case, which involved exten¬ 
sive algebra and computer work. From this, we found that one could explain roughly 
half of the reported Argonne threshold events with currents of scalar, pseudoscalar, 
and tensor type, by allowing some deviations from the matrix element estimates of 
R39, as I reported at the January, 1975 Coral Gables Conference (Adler, 1975b). 
In the meantime, the Argonne group reexamined possible background problems af¬ 
fecting their preliminary results, with the result that they ultimately discounted the 
cluster of pion production events near threshold. So by September of 1975, when I 
reviewed the subject of gauge theories and neutrino interactions at a conference at 
Northeastern University (Adler, 1976a), the electroweak theory predictions for neu¬ 
tral current weak pion production, following from purely V and A currents, were no 
longer in conflict with experiment. This conclusion was reinforced by a subsequent 
detailed analysis by Monsay (1978) of neutral current weak pion production, using 
my model together with the charge exchange corrections of R36. 

In the summer of 1975 I lectured on neutrino interactions and neutral currents 
at the Sixth Hawaii Topical Conference on Particle Physics, and gave a compre¬ 
hensive survey of neutral current phenomenology based on parton model methods, 
soft pion theorems, and quark model calculations of baryon static properties. This 
appeared both in the conference proceedings (Adler, 1976b) and again in a tenth 
year anniversary volume selecting highlights from the preceding summer schools 
(Pakvasa and Tuan, 1982). My hope in preparing the 1975 lectures was that survey¬ 
ing all available tools would hasten the day when one could determine electroweak 
parameters based on using all available data for a global fit, instead of doing piece¬ 
meal fits channel-by-channel. Such a global fit was carried out a few years later 
by Abbott and Barnett (1978a,b), who included four types of data: deep inelastic 
neutrino scattering uN —> uX , elastic neutrino-proton scattering up —> z/p, neu¬ 
trino induced inclusive pion production uN —> unX, and neutrino induced exclusive 
pion production uN —> unN. For the exclusive pion process, they employed my 
weak pion production calculation of R15 as extended to neutral currents in R38, 
using test data that I ran for them from my programs as benchmarks to help debug 
their programming. Their results were, in the words of their letter Abstract, “for 
the first time, a unique determination of the weak neutral-current couplings of u 
and d quarks. Data for exclusive pion production are a crucial new input in this 
analysis. 55 Their multi-channel fit gave the first full confirmation that the Glashow- 
Weinberg-Salam model, with sin 2 0w between 0.22 and 0.30, was in agreement with 
the experimental up and down quark neutral current coupling parameters. To me, 
the Abbott-Barnett analysis was valued recompense for the several years of hard 
calculation and scholarly attention to detail that I had put into the subject of weak 
pion production. 
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6. Gravitation 


Introduction 

During the first half of the 1970’s, I started to get interested in learning more about 
gravitational physics. When I was a graduate student at Princeton in the early 
1960’s, particle physics and gravitational physics were quite separate subjects, with 
the former the domain of Goldberger and Treiman, and the latter the domain of 
Wheeler and Dicke, to mention just a few key faculty members. Under the un¬ 
structured system at Princeton, I never took a course in gravitation, and for my 
general exam got by with the introduction to general relativity that I obtained by 
reading the text of Peter Bergmann (1942), as well as reading some of the original 
Einstein papers reprinted in a Dover edition. (Working through the Dover volume 
was a project of an informal reading and discussion group during my senior year 
at Harvard, organized by Norval Fortson, an experimental physics graduate student 
affiliated with the residential house where I lived then.) However, in the 1970’s it 
became clear both that many new results had been obtained in general relativity, 
so that my undergraduate knowledge was out-of-date, and that general relativity 
was becoming part of the essential tool kit of people working in quantum field the¬ 
ory. Among the things that convinced me of this were reading the thesis of Stephen 
Fulling (1972) on scalar quantum field theory in de Sitter space, while I was working 
on the 0(5) formulation of QED, the work of’t Hooft and Veltman (1974) and Deser 
(1975) on one-loop divergences of quantum gravity, and the availability of the new 
books on gravitation of Weinberg (1972), Misner, Thorne, and Wheeler (1973), and 
Hawking and Ellis (1973). 

My intention in writing my comprehensive Hawaii lectures in the summer of 
1975 was to wind up my involvement with neutrino physics, so that I could turn 
to something new. Since in 1976 I was due for a sabbatical, and my family did not 
want to travel away from Princeton, I decided that to learn relativity I would take a 
“reverse sabbatical”, by going to Princeton University to teach the relativity course 
for a year. So I spent my evenings during the 1975-1976 academic year reading the 
texts of Weinberg and of Misner, Thorne, and Wheeler, and then took my sabbatical 
during the 1976-1977 academic year, teaching both the fall term course in Special 
Relativity and the spring term continuation course in General Relativity. I also was 
the faculty advisor for John David Crawford, who did a senior thesis on experimental 
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tests for curvature squared additions to the gravitational action. With this reading 
and teaching as background, I embarked on a number of relativity-related research 
projects, described in the next two sections. 


First Papers 

My first papers on gravity were the working out of a very speculative idea, that grav¬ 
itation might be a composite phenomenon, with the gravitational fields arising as 
composite “pairing” amplitudes of photons in analogy with the energy gap order pa¬ 
rameter for superconductivity. In the paper Adler, Lieberman, Ng, and Tsao (1976), 
we looked for weak coupling singularities in the electromagnetic photon-photon lad¬ 
der graph sum in a conformally fiat spacetime, and found some resemblances to 
the helicity structure of graviton exchange amplitudes. In a follow-up paper (Adler, 
1976) I gave a linearized Hartree formulation for the photon pairing problem in a 
general background metric. I was never able to establish a detailed connection be¬ 
tween photon pairing amplitudes and graviton couplings in the general case, and the 
fact that no weak coupling singularities occurred in fiat spacetime meant that one 
could not establish a connection with the standard results of linearized general rela¬ 
tivity. In retrospect, the absence of pairing effects in fiat spacetime could have been 
expected from a subsequent theorem of Weinberg and Witten (1980), that ruled 
out spin-2 composites under quite general assumptions, and effectively doomed the 
program as set up in the 1976 papers. However, a useful outcome of writing these 
papers was that it started me thinking more generally about the idea of gravitation 
as an effective theory, and in particular about Sakharov’s ideas on gravitation, which 
I briefly discussed in the paper Adler (1976); following up this direction later on led 
to my work on the Einstein action as a symmetry-breaking effect, discussed in the 
next section. 

A second topic that I worked on in 1976 was the regularization of the stress- 
energy tensor for particles propagating in a general background metric. In the paper 
Adler, Lieberman, and Ng (1977), we applied covariant point-splitting techniques to 
the Hadamard series for the Green’s functions, which we used to define a regular¬ 
ized stress-energy tensor for vector and scalar particles. This was a very technical 
computation, and contained useful formulas among its results, but also produced an 
embarrassment: by our method of regularization, we did not find the trace anomaly 
that had been found by others using different methods. We rechecked our calcula¬ 
tion carefully, but could not find the source of the discrepancy. The problem was 
finally resolved by Wald (1978) (in time to be described in a note added in proof 
to our 1977 paper). Wald had earlier (Wald, 1977) set up a general axiomatization 
for the stress-energy tensor, and in Wald (1978) had shown that it leads to an es¬ 
sentially unique result. Applying a point-separation method similar to ours, he had 
also found no trace anomaly, but then went on to note that there was a subtle error 
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in our analysis. We had assumed that the local and boundary-condition-dependent 
parts of the Hadamard solution are separately symmetric in their arguments, but 
this is in fact not the case; only their sum is symmetric. Wald (1978) showed in the 
scalar case that when the analysis is repeated without the incorrect assumption, one 
gets the standard trace anomaly. Judy Lieberman and I then did the corresponding 
calculation in the vector particle case (Adler and Lieberman, 1978, R40), again find¬ 
ing that when the asymmetry of the two pieces of the Hadamard solution is taken 
into account, one gets the correct trace anomaly. 

In a lunchtime conversation at some point during the 1977-1978 academic year, 
Robert Pearson asked whether the “no-hair” theorems of general relativity applied 
to the case of spontaneous symmetry breaking. I thought this was interesting and 
looked into it, finding no relevant papers in the literature. This became the subject 
of a joint paper (Adler and Pearson, 1978, R41), which showed that the standard 
“no-hair” theorems generalize to the vector field in the Abelian Higgs model, and 
to the non-conformally invariant Goldstone scalar field model. In our paper, we 
restricted ourselves to static, spherically symmetric black holes, and made the phys¬ 
ically motivated assumption that any “hair” would also be static and spherically 
symmetric. This permits a simplifying choice of gauge for the Abelian Higgs model 
introduced by Bekenstein (1972). He observes that static electric charge distribu¬ 
tions must give rise to static electric fields and vanishing magnetic fields. Thus one 
can find a special gauge in which the potentials A M obey A = 0, dA$/dt = 0. Since 
the gauge-independent source current obeys similar conditions j — 0, djo/dt = 0, 
and since the gauge-independent magnitude of the Higgs scalar field is static, one 
finds that the residual phase of the Higgs scalar field in the special gauge is a space- 
independent, linear function of time, which can be eliminated by a further gauge 
transformation that preserves the gauge conditions A = 0, dA$/dt = 0. Thus one 
can do the analysis of possible “hair” taking the vector potential to be zero, and 
the Abelian Higgs field to be real. I have described Bekenstein’s argument here in 
some detail because the choice of gauge in R41 is the basis of rather loosely worded 
objections to our paper in lectures of Gibbons (1990); his assertion (and that of 
authors who have quoted his lectures) that the gauge choice is problematic is not 
correct, as working through the Bekenstein argument given above makes clear. Also, 
I have rechecked the proof given in R41, and apart from the minor problem found 
by Ray, as discussed below, I find that the proof is correct, in disagreement with 
further statements in Gibbons 5 lecture. However, in response to Gibbons 5 comments 
about our choice of gauge, proofs of the “no-hair 55 theorem for the Abelian Higgs 
model that do not use a special gauge choice have since been given by Lahiri (1993) 
and by Ayon-Beato (2000). 

Our argument starting from Eq. (24) of R41 was subsequently considerably sim¬ 
plified, and in the case when d9/dX\n = 0 corrected, in a paper of Ray (1979). (The 
subscript H here refers to evaluation at the horizon; see R41 for details of this and 
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other notation used in the following discussion of Ray’s paper.) The minor problem 
noted by Ray resulted from our not dropping the subdominant term d6/dX on the 
right-hand side of Eq. (31) when integrating this equation to get Eq. (33), so as 
to be consistent with our dropping this term elsewhere, such as in Eq. (32). When 
this term is dropped, the 0* -1 / 2 factor in Eq. (33) is replaced by a constant, and the 
approximate solution of Eq. (33) agrees with the exact solution of Eq. (24) given by 
Ray. As Ray points out, with this correction one still finds that is infinite at 

the horizon unless K = 0, which is what is needed to complete the proof. 

Finally, I note that the subject of black hole “hair” in gauge theories has taken 
on new interest recently with the discovery that topological charges on a black hole 
can give nonzero effects outside the horizon; see, for example, Coleman, Preskill, 
and Wilczek (1992) and the related lectures of Wilczek (1998). 


Einstein Gravity as a Symmetry Breaking Effect 

In late January of 1978 I organized a small conference on “Geometry, Gravity and 
Field Theory” for the EST Foundation in San Francisco; this was a memorable 
event that was attended by a large fraction of the leading people with interests 
in quantum gravity. During my plane travel for this conference, and afterwards, 

1 started to think about the confinement problem in QCD, and this became the 
main focus of my research for the next two years, as described in the following 
chapter. However, learning about scale breaking in QCD also led me back into 
gravitational physics, through considering the role similar mechanisms might play 
in giving a quantitative form to the suggestion by Sakharov (1968) (see also Klein, 
1974) that Einstein gravity is the “metric elasticity” of spacetime. I did not arrive at 
the correct formulation immediately; I find in my files two unpublished manuscripts, 
the first positing monopole boundary conditions, and the second positing dimension- 

2 operators, as a source for symmetry breaking, in both cases suggesting connections 
with the Einstein-Hilbert action. I went as far as submitting a manuscript based on 
the second for publication, and also gave a seminar on it at Princeton University, 
where my arguments were torn to shreds by David Gross (following which I withdrew 
the manuscript). The criticism proved useful; I went home, learned more about 
dimensional transmutation and the theory of calculability versus renormalizability, 
and came up with the correct formulation given in Adler (1980a), R42. The basic 
idea here is that in theories which contain no scalars, so that scale invariance is 
spontaneously broken (QCD is a prime example, but “technicolor” type unification 
models also fit this description), there will be an induced order R term in the action 
in a curved background, with a coefficient that is calculable in terms of the scale 
mass of the theory. Thus, if an underlying unified theory spontaneously breaks scale 
invariance at the Planck scale, one can induce the Einstein gravitational action as a 
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scale-symmetry breaking effect, giving an explicit realization of the Sakharov-Klein 
idea. 

I followed up this paper with a second one (Adler, 1980b, R43) in which I gave an 
explicit formula for the “induced gravitational constant” in theories with dynamical 
breakdown of scale invariance, expressed in terms of the vacuum expectation of the 
autocorrelation function of the trace of the renormalized stress-energy tensor 

(i6rf? lnd )-‘ = ±jd? x \(xy ~ ■ 

This formula for the induced Newton constant was independently obtained at about 
the same time by Zee (1981), and in the subsequent literature, the term “induced 
gravity” has come to be frequently used to describe the whole set of ideas involved. 
These papers attracted considerable attention in the gravity community, one result 
of which was that Claudio Teitelboim and his colleagues at the University of Texas 
in Austin invited me to give the Schild lectures in April of 1981. (My four lectures 
over a two week period, entitled “Einstein Gravity as a Symmetry-Breaking Effect 
in Quantum Field Theory”, were the eleventh in the Schild series.) This proved 
memorable for an unanticipated reason; shortly before I was to go to Texas I con¬ 
tracted a mild case of what was probably type-A hepatitis (the kind transmitted 
by shellfish), and so was sick in bed with very little energy. I dragged myself out of 
bed on alternate days to write lecture notes, and then was so tired I had to sleep 
the entire day following. At any rate, I improved enough so that my doctor gave 
me permission to go to Texas, where Philip Candelas took me into his home and 
helped me get through my scheduled lectures. Ultimately, I expanded the lectures 
into a much-cited comprehensive article that appeared in Reviews of Modem Physics 
(Adler, 1982, R44). A year later, I wrote a briefer synopsis of the program of gener¬ 
ating the Einstein action as an effective field theory, for a Royal Society conference 
on “The Constants of Physics”, which was published as Adler (1983). 

The explicit formula for the induced gravitational constant raises a number of 
interesting issues. First of all, if one assumes an unsubtracted dispersion relation 
for the Fourier transform ^(g 2 ) of the autocorrelation function of the stress-energy 
tensor trace, the induced gravitational constant is negative. However, as shown by 
Khuri (1982a) using analyticity methods, in asymptotically free theories there are 
three possible cases, depending on the distribution of zeros of ^(g 2 ), and in one of 
these cases G- m d has positive sign. In further papers Khuri (1982b,c) showed that 
in this case one can place useful bounds on the induced gravitational constant, 
expressed in terms of the scale mass of the theory. 

The question of whether the formula for the induced gravitational constant gives 
a unique answer has been discussed, from the point of view infrared renormalon sin¬ 
gularities, by David (1984) and in a follow-up paper of David and Strominger (1984). 
These authors argue that renormalons introduce an arbitrariness into the calculation 
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of G i n d, as manifested through the fact that in the dimensional regularization of the 
ultraviolet singular “comparison function” introduced in Eq. (5.48) of R44, 

one has to continue onto a cut. In Appendix B, Section 3 of R44, I used a principal 
value prescription to deal with this, which David argues can be modified by taking 
complex weightings of the upper and lower sides of the branch cut, allowing a free 
parameter multiple of the imaginary part to be introduced into the calculation of 
the integral over the comparison function. David argues that this means that the 
expression for G\ n d has an inherent ambiguity. I believe that this conclusion is sus¬ 
pect; since QCD and similar theories that spontaneously generate a mass scale are 
believed to be consistent field theories, their curved spacetime embeddings should, 
by the equivalence principle, also be consistent theories. This strongly suggests that 
the coefficient of the order R term in a curvature expansion of the vacuum action 
functional should be well defined, and that the ambiguity is an artifact of the com¬ 
parison function procedure. This view is supported by the review article of Beneke 
(1999) on renormalons, where it is argued that renormalon ambiguities are typically 
canceled by corresponding ambiguities in non-perturbative terms (such as the inte¬ 
gral A Ijjv with integrand ^ - 4> c (£) in Eq. (5.48)), giving total physical amplitudes 
that are unambiguous. In other words, the renormalon ambiguities are an artifact of 
an attempted separation of QCD physical amplitudes into a “perturbative” and a 
“non-perturbative” part, and only indicate that if a branching prescription (such as 
a principal value) is needed for the perturbative part, then a corresponding branch¬ 
ing prescription is also needed for the non-perturbative part. This will make the 
calculation of quantities like G- m d difficult, but does not imply that the calculation 
cannot, in principle, give a unique, physical answer. In the paper of David and Stro- 
minger (1984), the authors show that G- m d is unambiguous in finite supersymmetric 
theories, giving an existence proof that there are theories with a finite induced New¬ 
ton’s constant. In the general case, they acknowledge that “there is no proof that 
Gi n d will necessarily be ambiguous”, and I suspect that in fact Gj n d will turn out to 
be well defined in a much wider class of supersymmetric and non-super symmetric 
theories than only finite ones. Clearly, this is a question that merits further study. 

If one thinks more generally about the structure of a fundamental theory of 
gravitation, there are a number of possibilities. It may be that the Planck length is 
the minimum length scale possible, because of an underlying “graininess” of space- 
time. Or spacetime may be a continuum, as generally assumed, in which case the 
Planck length plays the role of the scale at which a classical metric breaks down, 
with new dynamical principles taking over at shorter distances. The suggestion that 
the order R gravitational action is an expression of scale symmetry breaking in a 
more fundamental scale-invariant theory is clearly based on a continuum picture 
of spacetime. A continuum assumption is also made in string theories, which how¬ 
ever are not scale-invariant; in string theories a fundamental length scale (the string 
tension) appears in the action, and this directly sets the scale for the gravitational 
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action. Should spacetime turn out to be discrete or grainy, there may be more gen¬ 
eral forms of the induced gravitation idea that are relevant. Ultimately, the origin of 
the spacetime metric, and of the Einstein-Hilbert gravitational action that governs 
its dynamics, will not be certain until we have a unifying theory that also resolves 
the cosmological constant problem, which is not dealt with in any of the current 
ideas about quantum gravity. 
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7. Non-Abelian Monopoles, Confinement Models, and 

Chiral Symmetry Breaking 


Introduction 

The somewhat disparate topics to be discussed in this chapter are all connected 
through my interest during the late 1970’s and early 1980’s in studying nonper- 
turbative properties of quantum chromodynamics (QCD), the theory of the strong 
interactions. I began these investigations by looking for a semi-classical model for 
heavy quark confinement. My first idea, that quarks might be confined in a non- 
Abelian monopole background field, did not work, but led to interesting progress 
in the theory of monopoles, as described in the first section. Most significantly, as 
discussed in detail, the monopole work led indirectly to the completion by Clifford 
Taubes of his multimonopole existence theorem during a visit to the IAS in the 
spring of 1980. I then turned to models based on the nonlinear dielectric properties 
of the QCD vacuum, which led to the confinement of quarks in “bag”-like structures 
which yield good heavy quark static potentials, as discussed in the second section. 
Finally, at the end of this period I worked briefly on the spontaneous breaking of 
chiral symmetry in QCD within the framework of pairing models patterned after 
superconductivity, as discussed in the final section. All three of these aspects of my 
study of QCD involved heavy numerical work, which in turn led to my interest in 
algorithms discussed in the next chapter. 


Non-Abelian Monopoles 

My first attempt at the confinement problem, which did not succeed but which had 
useful by-products that I shall describe here, was based on the idea of considering 
the potential between classical quark sources in the background of a non-Abelian 
’t Hooft (1974a)-Polyakov (1974)-Prasad-Sommerfield (1975)-Bogomornyi (1976) 
monopole or its generalizations, which I conjectured in Adler (1978b), R45 might act 
as a quark-confining “bag”. To justify considering classical quark sources, I initially 
resorted to a scheme (Adler, 1978a) that I called “algebraic chromodynamics”, which 
involved looking at the color space spanning the direct product of independent 
color charge matrices. However, I eventually dropped this apparatus in my pursuit 
of the confinement problem, and used instead the popular approximation of color 
charge matrices lying in a maximal Abelian subgroup of the SU(3) color group of 


84 


Adventures in Theoretical Physics 


conventional QCD, which gives a good first approximation to the full QCD color 
structure. Since it is clear that source charges in classical Yang-Mills theory are not 
confined, I looked for a simple modification of this theory that might lead to a linear 
potential. The first idea I tried was to look at classical Yang-Mills charges in the 
field of a background monopole. This had the obvious problem that the monopole 
scale has no clear relation to the QCD scale set by dimensional transmutation, but 
I simply ignored this difficulty and plunged ahead. 

To pursue (and ultimately rule out) the conjecture that a monopole background 
would confine, I did a number of calculations of properties of monopole solutions. 
The first was a calculation of the Green’s function for a single Prasad-Sommerfield 
monopole, by using the multi-instanton representation of the monopole and a formal¬ 
ism for calculating multi-instanton Green’s functions given by Brown et al. (1978). 
This calculation was spread over two papers that I wrote; setting up contour integral 
expressions for the Green’s function was done in Appendix A of Adler (1978b), R45, 
and the final result for the monopole propagator, after evaluation of the contour in¬ 
tegrals and considerable algebraic simplification, was given in Appendix A of Adler 
(1979a), R46. (The fact that many lengthy expressions for parts of the Green’s func¬ 
tion collapsed, after algebraic rearrangement, into simple formulas, suggested that 
there should be a more efficient way to find the monopole Green’s function. Not 
long afterwards, Nahm (1980) gave a new representation for the monopole that per¬ 
mitted a much simpler calculation of the Green’s function given in R46.) To check 
that the lengthy expression that I had obtained for the propagator really satisfied 
the differential equation for the Green’s function, I used numerical methods, calcu¬ 
lating the partial derivatives acting on the propagator by finite difference methods 
on a very fine mesh. Prom numerical calculations based on the propagator formula, 
it was clear that a single monopole background would not lead to confinement; all 
that happened was that a Coulombic attractive — 1/r potential was reversed into 
a repulsive 1/r potential for large quark separations, a result that could have been 
anticipated from the large distance structure of the monopole field. 

Not yet ready to give up on the monopole background idea, I then wrote two 
papers speculating that the Prasad-Sommerfield monopole might be a member of a 
larger class of solutions, in which the point at which the monopole Higgs field van¬ 
ishes is extended to a higher-dimensional region, and in particular to a “string’Mike 
configuration with a line segment as a zero set. In the first of these papers (Adler 
1979b) I studied small deformations around the Prasad-Sommerfield monopole and 
found several series of such deformations. For normalized deformations I recovered 
the monopole zero modes that had already been obtained by Mottola (1978, 1979), 
but I found that “if an axially symmetric extension exists, it cannot be reached 
by integration out along a tangent vector defined by a nonvanishing, non-singular 
small-perturbation mode”. This work was later extended into a complete calculation 
of the perturbations around the Prasad-Sommerfield solution by Akhoury, Jun, and 
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Goldhaber (1980), who also found “no acceptable nontrivial zero energy modes.” 
In my second paper, Adler (1979c), I employed nonperturbative methods and sug¬ 
gested that despite the negative perturbative results, there might still be interesting 
extensions of the Prasad-Sommerfield solution with extended Higgs field zero sets. 

At just around the same time, Erick Weinberg wrote a paper (Weinberg, 1979b) 
extending an index theorem of Callias (1978) to give a parameter counting theorem 
for multi-monopole solutions. Weinberg concluded that “any solution with n units of 
magnetic charge belongs to a (4n — l)-parameter family of solutions. It is conjectured 
that these parameters correspond to the positions and relative U( 1) orientations of 
n noninteracting unit monopoles”. For n = 1, his results agreed with the zero-mode 
counting implied by Mottola’s explicit calculation. Weinberg and I were aware of 
each other’s work, as evidenced by correspondence in my file dating from March 
to June of 1979, and references relating to this correspondence in our papers Adler 
(1979c) and Weinberg (1979b). 

My contact with Clifford Taubes was initiated by an April, 1979 letter from 
Arthur Jaffe, after I gave a talk at Harvard while Jaffe, as it happened, was visiting 
Princeton! In his letter, Jaffe noted that I was working on problems similar to those 
on which his students were working, and enclosed a copy of a paper by Clifford 
Taubes. (This preprint was not filed with Jaffe’s letter, so I am not sure which of 
the early Taubes papers listed on the SLAC Spires archive that it was.) Jaffe’s let¬ 
ter initiated telephone contacts with Taubes and some correspondence from him. 
On Jan. 6, 1980 Taubes wrote to me that he was making progress in proving the 
existence of multi-monopole Prasad-Sommerfield solutions, and in this letter and a 
second one dated on January 18, 1980 he reported results that were relevant to my 
conjectures on the possibility of deformed monopoles. His results placed significant 
restrictions on my conjectures; in a letter dated Feb. 1, 1980 I wrote to Lochlainn 
O’Raifeartaigh, who had also been interested in axially symmetric monopoles, say¬ 
ing that “On thinking some more about your paper (O’Raifeartaigh’s preprint was 
unfortunately not retained in my files) I realized that the enclosed argument by 
Cliff Taubes is strong evidence against n = 2 monopoles involving a line zero. What 
Taubes shows is that a finite action solution of the Yang-Mills-Higgs Lagrangian 
cannot have a line zero of arbitrarily great length; hence if n = 2 monopoles con¬ 
tained a line zero joining the monopole centers, the monopole separation would be 
bounded from above. But this seems unlikely...”. This correspondence and the re¬ 
sult of Taubes was mentioned at the end of the published version, Houston and 
O’Raifeartaigh (1980). 

As a result of our overlapping interests, I arranged for Taubes to make an informal 
visit, of two or three months, to the IAS during the spring of 1980. Clifford had 
expressed interest in this, he noted in a recent email, in part because Raoul Bott 
had suggested that he visit the Institute to get acquainted with Karen Uhlenbeck, 
who was visiting the IAS that year. In the course of his visit he met and interacted 
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with Uhlenbeck, who, along with Bott, had a major impact on his development as 
a mathematician. 

Taubes began the visit by looking at my conjecture of extended zero sets, but 
after a while told me that he could not find an argument for them. Partly as a result 
of his work, I was getting disillusioned with my own conjecture, so I asked him what 
was happening with his attempted proof of multi-monopole solutions. Taubes replied 
that he was stuck on that, and not sure whether they existed. I then mentioned to 
him Erick Weinberg’s parameter counting result, which strongly suggested a space of 
moduli much like that in the instanton case, where looking at deformations correctly 
suggests the existence and structure of the multi-instanton solutions. To my surprise, 
Taubes was not aware of Erick’s result, and knowing it impelled him into action on 
his multi-monopole proof. Within a week or two he had completed a proof, and 
wrote it up on his return to Harvard. (Thus, there was a parallel to what happened 
a year before with respect to solutions of the first order Ginzburg-Landau equations. 
In that case, Taubes had heard a lecture at Harvard by Erick Weinberg on param¬ 
eter counting for multi-vortex solutions (written up as Weinberg, 1979a) and then 
went home and came up with his existence proof for multi-vortices, Taubes (1980). 
The vortex work provided the initial impetus for Taubes’ turning to the monopole 
problem.) In his paper Taubes showed that “for every integer N ^ 0 there is at 
least a countably infinite set of solutions to the static SU( 2) Yang-Mills-Higgs equa¬ 
tions in the Prasad-Sommerfield limit with monopole number N. The solutions are 
partially parameterized by an infinite sublattice in Sjv(i2 3 ), the iV-fold symmetric 
product of i? 3 and correspond to noninteracting, distinct monopoles.” This quote 
is taken from the Abstract of his preprint “The Existence of Multi-Monopole Solu¬ 
tions to the Static, SU (2) Yang-Mills-Higgs Equations in the Prasad-Sommerfield 
Limit”, which was received on the SLAC Spires data base in June, 1980, and which 
carried an acknowledgement on the title page noting that “This work was completed 
while the author was a guest at the Institute for Advanced Studies, Princeton, NJ 
08540”. His preprint also ended with an Acknowledgment section noting his con¬ 
versations with me, with Arthur Jaffe, and with Karen Uhlenbeck, as well as the 
Institute’s hospitality. The proof was not published in this form, however, but in¬ 
stead appeared (with acknowledgments edited out at some stage) as Chapter IV of 
the book Jaffe and Taubes (1980) that was completed in August of 1980. The mul¬ 
timonopole existence proof was a milestone in Taubes’ career; in a recent exchange 
of emails relating to the events described in this section, Taubes commented on his 
visit “to hang out at the IAS during the spring of 1980. It profoundly affected my 
subsequent career...”. He went on to further investigations of monopole solutions, 
that lead him to studies of 4-manifold theory which have had a great impact on 
mathematics. 

O’Raifeartaigh, who had been following the monopole work at a distance, invited 
me during the spring of 1980 to come to Dublin that summer to lecture on my papers. 
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However, since Taubes had much more interesting results I suggested to Lochlainn 
that he ask Clifford instead, and Taubes did go to Dublin to lecture. After Clifford’s 
visit, I redirected my search for semiclassical confinement models to a study of 
nonlinear dielectric models by analytic and numerical methods, in collaboration 
with Tsvi Piran,* these models do give an interesting class of confining theories, and 
are described in the following section. Based on the observation that the Yang-Mills 
action is multiquadratic (that is, at most quadratic in each individual potential 
component), Piran and I also applied the same numerical relaxation methods to 
give an efficient method for the computation of axially symmetric multimonopole 
solutions. (This was done mainly to illustrate the computer methods, since by then 
exact analytic 2-monopole solutions had appeared; see Forgacs, Horvath and Palla 
(1981) and Ward (1981).) The numerical methods that Piran and I developed were 
described in our Reviews of Modern Physics article Adler and Piran (1984), R47 
that marked the completion of the research program on confining models, and as a 
by-product, on monopoles. 


Confinement Models 

Having seen that monopole backgrounds would not confine, I turned my attention 
to another type of semi-classical model, proposed in various forms by Savvidy (1977) 
(see also Matinyan and Savvidy (1978)) and Pagels and Tomboulis (1978). The basic 
idea is to do electrostatics with Abelianized quark charges, and with the fundamen¬ 
tal QCD action replaced by a renormalization group improved effective action, in 
which the gauge coupling is replaced by a running coupling, that is taken to be a 
function solely of the field strength squared. Although use of the running coupling 
is only justified by the renormalization group in the ultraviolet regime of large field 
strengths, the model assumes that the same functional form can be extrapolated to 
small field strengths as well. This leads to electrodynamics with a nonlinear, field- 
dependent dielectric constant that develops a zero for small squared field strengths. 
Because the only dynamical input from QCD is the running coupling, the model, as 
Frank Wilczek later remarked to me, can be considered as a very simple embodiment 
of the idea that “asymptotic freedom” should be associated with “infrared slavery”. 
Since the running coupling involves a scale mass, the model directly incorporates 
the phenomenon of dimensional transmutation. Pagels and Tomboulis conjectured, 
on the basis of various evidence, that the nonlinear dielectric model would confine, 
but did not have a proof. 

In the paper Adler (1981), R48, I analyzed the effective action model in detail 
and proved that it confines quarks. The argument starts from a Euclidean form 
of the Feynman path integral, and shows that the static potential is the minimum 
of the effective action in the presence of sources. I then specialized to the leading- 
logarithm effective action, and showed that the action minimum is associated with a 
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field configuration in which a color magnetic field fills in whenever the color electric 
field is less than the minimum magnitude k at which the effective action is minimized. 
This reduces the action minimization to an electrostatics problem, to which one can 
apply flux conversation estimates due to’t Hooft (1974b). In the nonlinear dielectric 
model context, these estimates show that the static potential is bounded from below 
by kQ{R — r), with Q the Abelianized quark charge, with r a constant, and with R 
the interquark separation. Hence the potential increases linearly for large i?, and the 
model confines. In an Appendix to R48, I discussed how a one-loop renormalization 
group exact, leading-logarithm running coupling can be obtained, by a coupling 
constant transformation, from the more usual two-loop renormalization group exact 
running coupling (to which the confinement argument also applies). 

When I presented this proof of confinement by the nonlinear dielectric model 
at a Department of Energy sponsored workshop in Yerevan, Armenia in 1983, an 
interesting dialogue with the Soviet physicist A. B. Migdal ensued. When I started 
to talk, and said what I was going to prove, Migdal stood up and stated that it was 
well-known that the Savvidy (-Pagels-Tomboulis) model did not confine, and gave 
some reasons. I then presented my proof, after which Migdal stood up, and said 
words to the effect that the problem is that there are too many confining models! 
As we shall see, there is really only one other model, the “dual superconductor” 
model, which like the nonlinear dielectric model is motivated by the idea of a color 
magnetic condensate, but describes this with a different dynamics. 

Following publication of R48, I wrote a paper (Adler, 1982a) formalizing the 
approximations (further discussed below) needed to get an Abelianized effective 
action model from the functional integral for QCD. I then turned to the problem 
of understanding in detail how the leading-log model gives a confining potential. 
Since it was clear that this would, at least in part, involve numerical solution of the 
nonlinear differential equations involved, I brought in Tsvi Piran as a postdoc. Tsvi 
had worked extensively in the numerical solution of the Einstein equations of general 
relativity, and came to the IAS with the understanding that he would continue this 
and other interests he had in astrophysics, but would also collaborate with me in the 
numerical solution of the leading-log model equations. Because of my work on the 
induced gravity program, this collaboration didn’t start immediately after Tsvi’s 
arrival, but once we began work, Tsvi taught me a great deal about setting up an 
interactive program to numerically solve partial differential equations. As is typical 
in doing numerical work, most of our time was spent developing and testing our 
computer codes, which took many months. To guard against programming errors, 
Tsvi and I each independently wrote our own programs, which once debugged gave 
identical results. The final production runs took a total of less than two days running 
time on the then new IAS VAX 11/780 computer, using mesh sizes of up to 100 x 100 
to resolve details of the confinement domain. 

The equations to be solved, in the leading-log model with three light fermion 
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flavors and scale mass K, are 

V • (e(E)E) = j° , 

j° = QS(x)S(y)\5(z - a) - 5(z + a)] , 

e(E) = h 0 \og(E 2 /K 2 ) , E=\E\ , 

h 9 
io= 8^ ' 

We also studied the leading-log-log model, in which the two-loop exact form of the 
running coupling is used. We originally tried to solve the equations directly in terms 
of the scalar potential A 0 , but found that the numerical programs were unstable. 
I then introduced a flux function reformulation of the problem (suggested by sim¬ 
ilar methods used in plasma physics), and this gave a stable, rapidly convergent 
iteration showing formation of a flux-confining free boundary. To understand the 
structure of the free boundary, Tsvi suggested that a paper of Fichera on elliptic 
equations that degenerate to parabolic would be relevant, and this indeed was the 
case, as described in Appendix A of our review R47. Prior to writing the review, we 
wrote two shorter papers. The first (Adler and Piran, 1982a, R49) demonstrated flux 
confinement and gave a numerical determination of the large R asymptotic form of 
the interquark potential, which contains a leading term linear in i?, and a subdom¬ 
inant term proportional to log k 1 / 2 R. The second (Adler and Piran, 1982b, R50) 
gave compact, accurate functional forms that fit the computed static potentials for 
both the leading-log and the leading-log-log models. 

One nice feature of the leading-log model (as well as the leading-log-log exten¬ 
sion) is that its small distance and large distance limiting cases can be approximated 
analytically. In the small distance limit, I devised an analytic perturbation method 
(Adler, 1982b) which shows that the potential has the standard form of a Coulomb 
potential with a logarithmic correction that is expected from perturbative QCD, 
permitting the parameter k of the model to be related to the QCD scale mass. With 
this identification, the model has no adjustable parameters. In the large distance 
limit, an ingenious analysis by Lehmann and Wu (1984) showed that the confine¬ 
ment domain is an ellipsoid of revolution, with maximum diameter growing as R 1 / 2 
with the interquark separation, and gave an analytic expression for the free bound¬ 
ary shape for large R as well as the subdominant term in the potential. Thus, the 
model yields a “fat” bag, rather than a cylindrical confinement domain of uniform 
radius; however, Lehmann told me at the time that he believed the true QCD be¬ 
havior would show a constant-radius cylindrical domain, and he appears now (see 
below) to be right. As discussed in the articles I wrote with Piran, the analytic 
forms for both small and large R agreed very well with our numerical results, giving 
confidence that the numerical analysis had been carried out correctly. 

How well do the nonlinear dielectric models agree with QCD? There are two 
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aspects to this question, whether they give satisfactory static potentials, and whether 
they describe the flux confinement domain that is realized in QCD. To assess the 
static potentials tabulated in R50, one has to do a detailed fit to heavy quark 
spectroscopic data. This was done in papers of Margolis, Mendel, and Trottier (1986) 
and of Crater and Van Alstine (1988), both of which concluded that the log-log model 
potential is in good agreement with experimental data on heavy quark systems, with 
reasonable values of the quark masses. The fit of Margolis, Mendel, and Trottier used 
a value of A= 0.270 GeV, while that of Crater and Van Alstine used a value of 
A$s = 0.215GeV (note that their A is the k 1 / 2 of R50, which is related to A $s 
by A Mg = 0.959/t 1 / 2 ). These values of A^g are in reasonable accord with the value 
Ams = 0.218 GeV that Piran and I had quoted in R50, obtained by requiring the best 
fit of our potential to Martin’s phenomenological potential for heavy quark systems. 
These values of A^g should be compared with the three light quark experimental 
value ~ 0.369 GeV (Hinchliffe, 2005). For a simple extrapolation from the 

asymptotically free regime to the confining regime of QCD, the nonlinear dielectric 
model does reasonably well in accounting for heavy quark spectroscopy. 

As already noted, the confinement domain in the nonlinear dielectric models is 
an ellipsoid of revolution, with width increasing with the quark separation R. Let 
p be the cylindrical radial coordinate, and z the coordinate along the axis of the 
cylinder. On the medial plane z = 0, various quantities of interest can be computed 
in the large -R limit directly from the Lehmann-Wu asymptotic solution. The radius 
of the confinement domain on the medial plane is 



and the value of \D\ on the medial plane is 



2QbpK 

7r 


(1 - P 2 /P 2 m ) 


from which one can check that the flux integral gives 2n J 0 PTn pdp\D\ = Q. The profile 

of \D\ is evidently parabolic, and scales with p m oc R^. 

To compare this “fat bag” confinement domain with QCD, one must rely on lat¬ 
tice simulations, since in real-world QCD, the confining flux tube breaks up through 
quark-antiquark pair formation before the asymptotic regime is reached. Assuming 
that the lattices used are large enough to accurately approximate the continuum 
theory, the data from simulations that have been carried out show a confinement 
domain of constant diameter in the limit of large i?, as discussed and referenced in 
the book of Ripka (2004). The details of the simulated confinement domain favor the 
“dual superconductor” model, in which QCD is regarded as a dual of a Ginzburg— 
Landau superconductor, with magnetic monopole pairs replacing the Cooper pairs 
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of superconductivity. In this picture, in addition to the color fields, there is a dy¬ 
namical variable corresponding to the monopole condensate. A numerical analysis 
of flux confinement in a dual superconductor, using the methods described in my 
review with Piran R47, has been given by Ball and Caticha (1988), who give plots 
of the confinement domain; for further details and references, see both Ripka (2004) 
and the review of Baker, Ball, and Zachariasen (1991). For appropriate values of 
the three dual superconductor model parameters (a magnetic charge #, which can 
be related to an effective QCD coupling e e ff by the Dirac quantization condition 
g = 27r/eeff, a scalar magnetic condensate mass m//, and a gauge gluon mass my), 
good fits to the lattice simulations are obtained, and the dual superconductor model 
also gives a phenomenologically satisfactory static potential. (In a recent preprint, 
Haymaker and Matsuki (2005) argue that in lattice comparisons, the continuum 
my gives rise to two parameters that must be fitted, making four parameters in 
all including g.) However, since the dual superconductor gives a Coulomb poten¬ 
tial at short distances, without logarithmic modifications, the dual superconductor 
parameters cannot be directly related to the QCD scale A^§ as was possible for 
the scale parameter k of the nonlinear dielectric model. As a limiting case, the dual 
superconductor model gives the standard bag model with a field discontinuity at the 
boundary; a numerical solution of this model is also discussed in Ball and Caticha 
(1988). 

Although the nonlinear dielectric model and the dual superconductor model suc¬ 
cessfully describe important aspects of confinement in QCD, major steps would be 
needed to incorporate such classical action models into a proof of confinement. To 
do so one would have to prove that the true energy of a widely separated quark- 
antiquark pair in QCD is bounded from below by the energy calculated in one or 
the other of the two models. This would require achieving precise control over the 
qualitative approximations involved in the models, which include a mean-field ap¬ 
proximation to the functional integral as discussed in Adler (1982a), the replacement 
of the exact QCD effective action by the model effective action, and replacement 
of the octet of color quark charges by Abelianized effective charges lying in the 
maximal commutative subgroup. Although, as I argued in the case of the nonlin¬ 
ear dielectric model in Adler (1982a), these simplifications of the full problem are 
plausible, replacing qualitative approximations by precise mathematical statements 
with error estimates will be no small task. 

In any flux tube picture of confinement based on Abelianized charges, such as 
either the nonlinear dielectric model or the bag limit of the Ginzburg-Landau dual 
superconductor model, the string tension scales as the Abelianized quark charge, 
or as the square root of the corresponding Casimir. In a paper with Neuberger 
(Adler and Neuberger, 1983, R51), we pointed out that in the large- N c limit of 
SU(N C ) gauge theory, the string tension scales with the Casimir when changing from 
fundamental to adjoint representation quarks; hence to the extent that flux tube 
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models give a good description of confinement in N c = 3 QCD, different confinement 
mechanisms appear to be at work in QCD and in its large N c limit. 


Chiral Symmetry Breaking 

Not long after I had finished the review paper R47 with Piran summarizing our 
work on confinement models, Anne Davis suggested looking at another outstanding 
problem in QCD, that of chiral symmetry breaking. After studying the relevant 
literature (reviewed in the Introduction to our paper Adler and Davis (1984), R52), 
we decided to focus on setting up and solving a superconductor-like gap equation for 
fermion pairing in Coulomb gauge, systematically imposing the axial-vector current 
Ward identities to get the correct renormalization procedure. This method permitted 
us to study pairing using a Lorentz vector instantaneous confining potential with 
V oc r, getting infrared-finite results for physical quantities without imposing ad 
hoc infrared cutoffs. The model gives spontaneous breaking of chiral symmetry, but 
with values of the quark condensate (uu) and the pion decay constant f n that 
are considerably too low when the phenomenological confining potential (or string 
tension) is used as input. Similar results were also found by a group at Orsay, and 
we learned later that the utility of the axial-vector Ward identities in deriving the 
gap equation had also been noted by Delbourgo and Scadron (see the reprinted 
papers R52 and R53 for references). Extensions of the model of R52 to the finite 
temperature case were later discussed by Davis and Matheson (1984), Alkhofer and 
Ammundsen (1987), and Klevansky and Lemmer (1987). 

In a subsequent paper (Adler, 1986, R53) that I wrote for the Nambu Festschrift, 
I reviewed the work of various groups on gap equation models, and also noted a 
problem. In order for there to be no explicit breaking of chiral symmetry in the 
gap equation model, the instantaneous potential must be the time component of a 
Lorentz vector, so that it contains factors 70 that anticommute with 75 . However, 
experimental data on heavy quark spectroscopy show that the confining part of 
the potential is predominantly Lorentz scalar, and using a Lorentz scalar potential 
in the gap equation model would lead to explicit violation of chiral symmetry, and 
therefore invalidate the model. This suggests that the approximations leading to the 
gap equation model are not valid for the confining part of the potential. In R53,1 also 
gave equations that I had worked out for a retarded extension of the instantaneous 
potential model. The original intention had been for a graduate student in either 
Princeton or Cambridge to work on solving the extended model, but in view of 
the Lorentz structure problem this was not done (a covariant treatment of the gap 
equation model was later given by von Smekal, Amundsen, and Alkofer (1991)). 
For various proposals for addressing the Lorentz structure issue, see Lagae (1992), 
Szczepaniak and Swanson (1997), and Bicudo and Marques (2004). 

Shortly after the paper R52 was out, Cumrun Vafa, then a Princeton graduate 
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student, had a few conversations with me about his attempts to turn the Banks- 
Casher (1980) eigenvalue density criterion for chiral symmetry breaking into a proof 
that chiral symmetry breaking occurs in QCD. (For recent progress in applying the 
Banks-Casher criterion in the large- N c limit, see Narayanan and Neuberger (2004).) 
I didn’t have much in the way of concrete suggestions to offer, and Cumrun started 
also talking to Edward Witten, who very sagely suggested looking at a different 
problem, that of studying whether parity conservation can be spontaneously bro¬ 
ken in QCD. This problem proved tractable, and their papers (Vafa and Witten, 
1984a,b), proving that parity is not spontaneously broken in vector-like gauge theo¬ 
ries (and similarly for the isospin and baryon number symmetries), became part of 
Vafa’s thesis. The difference between the parity problem and the chiral symmetry 
problem can be understood by considering their respective order parameters. If par¬ 
ity is spontaneously broken, the pseudoscalar order parameter ui'y^u will receive a 
vacuum expectation. When the fermions are integrated out, one obtains a Lorentz 
invariant, parity-nonconserving operator functional X of the gluon fields that is real 
in Minkowski space, but picks up a factor of i when rotated to Euclidean space. 
This, together with positivity of the Euclidean space Dirac fermionic determinant 
in a vector-like theory, is the basis of the Vafa-Witten proof that adding a small mul¬ 
tiple of X to the action cannot make the ground state energy lower. In the chiral 
symmetry problem, the relevant order parameter is the parity conserving but chiral 
symmetry breaking scalar operator m/, which when the fermions are integrated out 
leads to a functional X' of the gluon fields that remains real when rotated to Eu¬ 
clidean space. Hence the Vafa-Witten argument suggests that the energy minimum 
may lie at a nonzero value of X\ but such a local analysis cannot find the global 
minimum, and hence does not give a proof of chiral symmetry breaking. Rigor¬ 
ous lattice inequalities given by Weingarten (1983) give a proof of chiral symmetry 
breaking only when additional strong assumptions are made, including the existence 
of the continuum limit and the confinement of color, together with use of anomaly 
matching conditions. 

Over twenty years later, the problem of proving the breakdown of chiral sym¬ 
metry in QCD is still open, as is that of proving confinement. In fact, there is 
considerable evidence that chiral symmetry breaking and confinement in QCD are 
related phenomena. For example, lattice simulations such as D’Elia et al. (2004) 
show that the deconfining and chiral transitions coincide; gap equation models of 
the type studied in R52 find chiral symmetry breaking for a confining potential but 
not for a Coulomb potential, and lattice inequalities of the type studied by Wein¬ 
garten also need confinement as an ingredient to show chiral symmetry breaking. 
Thus it appears that both of these outstanding problems in QCD are aspects of 
the larger problem of proving that QCD exists and has a mass gap, which is one of 
the seven Clay Foundation Millennium Problems in mathematics and mathematical 
physics. Perhaps in this century, with the added incentive of a $1 million reward, 
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rigorous proofs of confinement and chiral symmetry breaking in QCD will be found! 
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8. Overrelaxation for Monte Carlo and Other Algorithms 


Introduction 

As I have already noted, the investigations described in the previous chapter all 
involved extensive computer work. This got me interested in the issue of algorithms 
more generally, and led to two distinct research directions in the years that followed. 
One involved generalizing the acceleration methods for solving partial differential 
equations to the related problem of Monte Carlo simulations, as discussed in the first 
section that follows. The second involved neural networks and pattern recognition, 
and led among other things to work on image normalization methods, described 
briefly in the second section of this chapter. 


Overrelaxation to Accelerate Monte Carlo 

In preparation for numerically solving the partial differential equations for the 
leading-log models, I did general reading on numerical methods for handling partial 
differential equations. This taught me about the critical slowing down problem - the 
fact that as one refines meshes to get more accurate numerical solutions, the rate 
of convergence of the iterations slows down. I also learned about various strategies 
devised for defeating critical slowing down, and in particular about the successive 
over-relaxation (SOR) modification of the standard Gauss-Seidel iteration. In a 
Gauss-Seidel iteration of a positive functional, one replaces each successive variable 
by the value that locally minimizes the functional. In SOR, one builds in a system¬ 
atic overshoot beyond the local minimum, with the amount of overshoot tuned to 
the degree of mesh refinement, yielding more rapid convergence as a result. In the 
work of R47, Piran and I used SOR in all of our iterative solutions, and achieved 
substantial gains in convergence speed on our finest meshes. 

I became interested in Monte Carlo algorithms because it was clear that lattice 
gauge theory simulations probably would be the only way that one could study de¬ 
tails of the structure of the flux confinement domain in QCD. I knew from talks that 
I had heard in Princeton that there were two main Monte Carlo methods in use, the 
Metropolis method and the heat bath method, and also that the folk wisdom at the 
time was that heat bath was the best one could do, since it corresponded to “nature’s 
way” of achieving thermal equilibrium. However, since the zero temperature limit of 
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heat bath just corresponds to a Gauss-Seidel iteration, which I knew could be accel¬ 
erated by SOR, I suspected that the conventional wisdom was wrong, and that there 
should be extensions, to Monte Carlo thermalizations, of the standard acceleration 
methods for the solution of differential equations. Since the monopole numerical 
work had brought out the fact that the Yang-Mills action is multiquadratic, I de¬ 
cided to study this question in the simple context of multiquadratic actions, where 
the question becomes whether for quadratic actions, the SOR method for differen¬ 
tial equations has an extension to Monte Carlo thermalization. This is the question 
addressed in Adler (1981), R54, where I showed that SOR does indeed extend to the 
thermalization of multi-quadratic actions, by explicitly constructing in Eqs. (14a,b) 
the transition probability that obeys detailed balance when an overrelaxation pa¬ 
rameter is included in the iteration. (Note that the normalization factors in these 
equations have the n in the correct place, but the other factors inverted; this is 
corrected in Eqs. (9) and (11) of my later paper R55. The argument of R54 does 
not involve the normalization factors, and is unaffected.) For this overrelaxed ther¬ 
malization, I showed that the means of the thermalized variables iterate according 
to standard SOR; since standard SOR accelerates Gauss-Seidel, this implies that 
there should be a corresponding acceleration of the thermalization process as well. 

The 1981 paper R54 gave the earliest indication that Monte Carlo methods could 
be accelerated to improve critical slowing down, and for this reason was conceptu¬ 
ally important, as well as having later applications and extensions. To the best of 
my knowledge, I am supported by the literature on the subject, in stating that R54 
first introduced acceleration methods into Monte Carlo. Two compilations of Monte 
Carlo articles edited by Binder contain literature surveys, Binder et al. (1987), and 
Swendsen, Wang, and Ferrenberg (1992), relating to the critical slowing down prob¬ 
lem. In both surveys, the earliest listed reference is from 1983; neither survey cites 
my 1981 paper (or Whitmer’s 1984 paper - see below), although some of the cited 
articles do reference these papers. 

I didn’t immediately continue work on Monte Carlo acceleration myself, but 
suggested it as a thesis research area to my Princeton University graduate student 
Charles Whitmer. He applied the method to 0 4 and Higgs actions that are point split 
on a lattice with unit displacement ft according to cf) 4 (x) —> cf> 2 (x)cf) 2 (x + a/i), which 
makes them multiquadratic, and in the paper Whitmer (1984) reported improvement 
over conventional heat bath Monte Carlo. I got interested in the subject again a few 
years later, after Goodman and Sokal (1986) (who knew about the SOR method of 
R54 and Whitmer’s paper) proposed a stochastic extension of multigrid methods, 
and Creutz (1987) and Brown and Woch (1987) gave a simple implementation of 
the SOR idea for lattice gauge theory plaquette actions. This latter development 
eliminated the need for the problematic gauge fixing that I had used in R54 to 
keep the latticized gauge action multiquadratic, and opened the way to practical 
applications of SOR to gauge theory Monte Carlo studies. In the spring of 1987, 
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I went to Torino, Italy with my daughter Victoria, who had been eager to visit 
Europe after finishing her high school requirements. During this sabbatical term I 
was a visitor at the Institute for Scientific Interchange (ISI), at the invitation of 
Mario Rasetti and my former IAS colleague Tullio Regge; I also had an office at 
the University of Torino that I used a couple of days a week. Although I had been 
spending considerable time over the previous few years working on quaternionic 
quantum mechanics (see the next chapter), I decided on this trip to return to my 
old interest in Monte Carlo SOR, stimulated by the fact that experts in the Monte 
Carlo field had started to get interested. In a paper that I wrote while at ISI (Adler, 
1988a, R55) I gave a much more detailed analysis of overrelaxed thermalization for a 
quadratic action, and also gave extensions of the method to non-quadratic actions, 
including SU (n) gauge theory. 

After my return to the IAS from this sabbatical, I continued work on Monte- 
Carlo algorithms for several more years. In a paper written in the fall of 1987 after I 
returned from Italy (Adler, 1988b, R56), I gave an elegant formal analysis showing 
that the general linear iteration u 1 = Mu + N f corresponding to a splitting 1 = M + 
NL of the quadratic form L for a Gaussian action, has a corresponding stochastic 
generalization 


P(u -> u') = (P/v^tdetT^expl-iu' - Mu - Nf) T pT(u’ -Mu-Nf)} , 

with T = ^(L -1 — ML~ l M T )~ l a modified temperature matrix. This extends the 
SOR thermalization of R54 to a general linear iterative process. Later in the same 
academic year, I gave in Adler (1988c) a Metropolis variant of the SU(n ) method 
given in R55, that extended the method for the Wilson action used by Creutz and by 
Brown and Woch to general overrelaxation parameter to. In collaboration with Gyan 
Bhanot, a former IAS member and a Monte Carlo expert, we made a numerical study 
of the SU(2) version of this algorithm, with results reported in Adler and Bhanot 
(1989), R57. (Growing out of this collaboration, Bhanot spent several years as a half¬ 
time member of the IAS in the early 1990’s, in the course of which we wrote a number 
of further papers on a variety of Monte Carlo acceleration methods.) I also gave talks 
at lattice conferences; at the biennial Lattice Gauge conference Lattice 88, held at 
Fermilab that year, I gave a plenary talk reviewing work on algorithms for pure 
gauge theory, focusing primarily on the theory and application of overrelaxation 
methods (Adler, 1989, R58). Monte Carlo overrelaxation has become a standard 
part of the lattice gauge theorist’s tool kit; for a sampling of recent applications, see 
Kiskis, Narayanan, and Neuberger (2003), Holland, Pepe, and Wiese (2004), Meyer 
(2004), Pepe (2004), Shcheredin (2005), and de Forcrand and Jahn (2005). 
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Image Normalization 

During the 1990s, I interspersed my work on quaternionic quantum mechanics and 
particle physics with work on aspects of neural networks and pattern recognition. 
My neural network interests involved an analog device that I patented (Adler, 1993) 
and an article (Adler, Bhanot, and Weckel, 1996) analyzing its algorithmic aspects. 
In pattern recognition, from lunchtime conversations with Joseph Atick and Norman 
Redlich, I got interested in the problem of extracting those features of an image that 
are invariant under a symmetry transformation. This problem is closely analogous 
to that of extracting those features of a gauge potential that are gauge-invariant, 
and in Adler (1998), R59 I gave a general formal solution, based on imposing image 
normalizing constraints analogous to gauge-fixing constraints. I have reprinted here 
only the first two sections of this unpublished article (without references), in which 
the general theory is set up; further sections of the article give applications to a 
variety of viewing transformations of a planar object. Shortly afterwards, when one 
of the IAS string theory postdocs was interested in switching to a computer-related 
career, I suggested applying my methods to the problem of the similarity and affine 
normalization of partially occluded planar curves (such as the boundary of a planar 
object). We worked this out together and it was published as Adler and Krishnan 
(1998), R60, The excerpt R59 of the general paper that is reprinted here gives the 
background needed to follow the extension of the planar algorithm to curve segments 
given in R60. 
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9. Quaternionic Quantum Mechanics, Trace Dynamics, and 

Emergent Quantum Theory 


Introduction 

During the twenty years from 1984 to 2004, a large part of my time was spent on 
investigations into foundational areas of quantum mechanics. Most of my research 
from this period was later presented in two books that I wrote, Quaternionic Quan¬ 
tum Mechanics and Quantum Fields (Oxford University Press, New York, 1995) 
and Quantum Theory as an Emergent Phenomenon (Cambridge University Press, 
Cambridge, 2004). I have not included in this reprint volume any research papers 
incorporated (some considerably improved) into the two books, since this would 
be infeasible because of length limitations. So what I discuss in this chapter are a 
few papers dealing with quaternionic topics written during the period between the 
two books, together with a brief description of how I got interested in quaternionic 
quantum theory, and later on, in the possibility of a pre-quantum theory. 


Quaternionic Quantum Mechanics 

My interest in quaternionic quantum mechanics grew out of my interest in the Harari 
(1979)-Shupe (1979) model for composite quarks and leptons. They postulated an 
order-dependence for the preon wave functions (e.g., TTU, TUT, VTT were con¬ 
sidered to be three distinct color states), which suggested that quantum theory over 
a noncommutative field might be involved. I was never able to use quaternions or 
related ideas to make a successful preon model, either during the period before my 
book (see Adler, 1979, 1980) or after (Adler, 1994a), but the issues raised, and 
interactions with key people acknowledged in the Preface of the 1995 volume, led 
me to undertake a systematic study of quaternionic quantum mechanics. Perhaps 
the most important new result contained in my papers (Adler, 1988) and in my 
book is the fact that the S-matrix in quaternionic scattering theory is complex , 
not quaternionic, which was a surprise to the experts in the field and invalidated 
proposed searches (such as Peres, 1979) for quaternionic effects manifested through 
noncommuting scattering phases. I also clarified the relationship between time rever¬ 
sal symmetry in quaternionic quantum theory (where it is unitary) and in complex 
quantum theory (where it is antiunitary), proved that positive energy quaternionic 
Poincare group representations are complex and not intrinsically quaternionic, and 
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gave a quaternionic generalization of projective group representations (to which I 
shall return shortly). These were but a few of the many topics dealt with in my 
1995 book. My quaternionic investigations also motivated work I did in new direc¬ 
tions in standard quantum mechanics, such as a paper that I wrote showing that 
SU (3) x SU(12) is the minimal grand unified theory in which, species by species for 
charged fermions, no Dirac sea is required (Adler, 1989). 

After my book on quaternionic quantum mechanics was completed, a number 
of papers that I wrote with collaborators clarified issues that were left unresolved, 
or were inadequately treated, in the book. One of these issues dealt with the non- 
adiabatic geometric phase in quaternionic Hilbert space. This was discussed in my 
book, but on a visit to the IAS, Jeeva Anandan pointed out that my treatment 
was incomplete, and sketched what was needed to improve it. I filled in the details 
and drafted a manuscript, which became a joint paper (Adler and Anandan, 1996, 
R61) that was published in the Larry Horwitz Festschrift issue of Foundations of 
Physics . A second issue that was left hanging was the analog of coherent states 
in quaternionic quantum theory. My thesis student Andrew Millard and I studied 
this, and wrote a paper (Adler and Millard, 1996a, R62) giving the extension of 
the Perelomov coherent state formalism to quaternionic Hilbert spaces. We also 
showed that the closure requirement forces an attempted quaternionic generalization 
of standard coherent states based on the Weyl group to reduce back to the complex 
case, settling a question raised in discussions with me by John Klauder. The other 
issues that were dealt with after publication of the quaternionic book were the 
structure of quaternionic projective representations, and the relationship between 
standard complex quantum mechanics and the dynamics based on a trace variational 
principle that I had introduced in the field theory chapter of the 1995 book. These 
form the subject of the next two sections. 


Quaternionic Projective Group Representations 

Given two group elements a with product &a, a unitary operator representation Ub 
in a Hilbert space is defined by UbU a = U & a . A more general type of representation, 
called a ray or projective representation, is relevant to describing the symmetries of 
quantum mechanical systems. In his famous paper on unitary ray representations 
of Lie groups, Bargmann (1954) defines a projective representation as one obeying 
UbU a = Ub a ^(b,a)^ with u ;(&, a) a complex phase. 

This definition is familiar, and seems obvious, until one asks the following ques¬ 
tion: Bargmann’s definition is assumed to hold as an operator identity when acting 
on all states in Hilbert space. However, we know that it suffices to specify the 
action of an operator on one complete set of states in Hilbert space to specify 
the operator completely. Hence why does one not start instead from the definition 
UbU a \f) — Uba\f)^(f\b i a) } with {|/)} one complete set of states, as defining a pro- 


Quarternionic Quantum Mechanics, Trace Dynamics, and Emergent Quantum Theory 


105 


jective representation in Hilbert space? Let us call Bargmann’s definition a “strong” 
projective representation, and the definition with a state-dependent phase a “weak” 
projective representation. Then the question becomes that of finding the relation 
between weak and strong projective representations. 

Although I have formulated this question here in complex Hilbert space, it arose 
and was solved in the context of quaternionic Hilbert space, where the phases 

quaternions, which obey a non-Abelian group multiplication law iso¬ 
morphic to 50(3) ~ SU( 2). The strong definition was adopted for the quaternionic 
case by Emch (1963, 1965), but in Sec. 4.3 of my book on quaternionic quantum 
mechanics I introduced the weak definition in order for quaternionic projective rep¬ 
resentations to include embeddings of nontrivial complex projective representations 
into quaternionic Hilbert space; the state dependence of the phase is necessary be¬ 
cause even a complex phase to does not commute with general quaternionic rephas¬ 
ings of the state vector |/). I noted in my 1995 book that the weak definition can 
be extended to an operator relation by defining 

tt{b,a) = '^2\f)u>(f-b,a)(f\ , 

f 

so that the weak definition then takes the form 

U b U a = U ba Q,(b,a) , 

which gives the general operator form taken by projective representations in quater¬ 
nionic quantum mechanics. I also introduced in Sec. 4.3 of my book two special¬ 
izations of this definition, motivated by the commutativity properties of the phase 
factor in complex projective representations. I defined a multicentral projective rep¬ 
resentation as one for which 

[£}(&, a),U a ] = [£}(&, a), t4] = 0 

for all pairs &, a (note that in Eq. (4.51a) of my book, U ab should read C/^, so that the 
two conditions just given suffice), and I defined a central projective representation 
as one for which 


[«(M),E>c] = o 

for all triples a,&,c. 

Subsequent to the completion of my book, I read Weinberg’s first volume on 
quantum field theory (Weinberg, 1995) and realized, from his discussion in Sec. 2.7 
of the associativity condition for complex projective representations, that there must 
be an analogous associativity condition for weak quaternionic projective represen¬ 
tations. I worked this out (Adler, 1996, R63), and showed that it takes the operator 
form 


U a 1 fi(c, b)U a = f2(c&, a) 1 fi(c ; ba)Q(b, a) 


j 
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which by the definition of fi(6,a) shows that U a 1 fi(c ,b)U a is diagonal in the basis 
{|/)}, with the spectral representation 

U~ l ^{^b)U a = ^ \f)u(f]cb,a)u>(f]c,ba)u>(f]b,a)(f\ . 

f 

On the basis of some further identities, I also raised the question of whether one can 
construct a multicentral representation that is not central, or whether a multicentral 
representation is always central. 

Subsequently, I discussed the issues of quaternionic projective representations 
with Andrew Millard. He explained them to his roommate Terry Tao, a mathematics 
graduate student working for Elias Stein, and at my next conference with Andrew, 
Tao came along and presented the outline of a beautiful theorem that he had devised. 
This was written up as a paper of Tao and Millard (1996), and consists of two parts. 
The first part is an algebraic analysis based on the spectral representation given 
above, which leads to the following theorem 

Structure Theorem: Let U be an irreducible projective representation of a 
connected Lie group G. There then exists a reraying of the basis \f) under which 
one of the following three possibilities must hold. 

(1) U is a real projective representation. That is, to(f; &, a) = u;(6, a) is independent 
of |/) and is equal to ±1 for each b and a. 

(2) U is the extension of a complex projective representation. That is, the matrix 
elements (f\U a \f) are complex and u ;(/;&, a) = a;(6 ,a) is independent of |/) 
and is a complex phase. 

(3) U is the tensor product of a real projective representation and a quaternionic 

phase. That is, there exists a decomposition U a = \f)°'a(f\, where the 

unitary operator U® has real matrix elements, a a is a quaternionic phase, and 
Ug = ±UgU* for all b and a. 

Prom the point of view of the Structure Theorem, case (1) corresponds to the 
only possibility allowed by the strong definition of quaternionic projective represen¬ 
tations, as demonstrated earlier by Emch (1963, 1965), while case (2) corresponds to 
an embedding of a complex projective representation in quaternionic Hilbert space, 
the consideration of which was my motivation for proposing the weak definition. Spe¬ 
cializing the Structure Theorem to a complex Hilbert space, where case (3) cannot 
be realized, we see that in complex Hilbert space the weak projective representation 
defined above implies the strong projective representation; hence no generality is 
lost by starting from the strong definition, as in Bargmann’s paper. 

The second part of the Tao-Millard paper is a proof, by real analysis methods, 
of a Corollary to the structure theorem, stating 
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Corollary 1: Any multicentral projective representation of a connected Lie group 
is central 

This thus solved the question of the relation of centrality to multicentrality that 
I raised in my paper R63. 

Subsequent to this work, I had an exchange with Gerard Emch in the Journal of 
Mathematical Physics debating the merits of the strong and weak definitions. After 
a visit to Gainesville where we reconciled differing notations, we wrote a joint paper 
(Adler and Emch, 1997, R64) clarifying the situation, and reexpressing the strong 
and weak definitions in the language and notation often employed in mathematical 
discussions of projective group representations. 


Trace Dynamics and Emergent Quantum Theory 

My work on emergent quantum theory arose from the merging of two lines of 
thought. The first line of thought arose from answering the question of whether 
quaternionic quantum mechanics ameliorates the measurement problem of standard 
quantum mechanics; the answer is “no”, because quaternionic quantum theory still 
has a unitary time evolution, and so the usual problems persist. However, in the 
course of working this through I read some of the literature on the measurement 
problem in standard quantum theory, and came away convinced that there were real 
issues to be addressed. The second line of thought arose from my attempts to con¬ 
struct quaternionic quantum field theories. I found that the canonical quantization 
method could not be extended to the quaternionic case, and so I had to resort to 
an alternative formalism, which I variously called “generalized quantum dynamics”, 
“total trace dynamics”, or finally, simply “trace dynamics”, to generate operator 
equations without “quantizing” a classical theory. This was done by using a vari¬ 
ational principle based on a Lagrangian constructed as a trace of noncommuting 
operator variables, making systematic use of cyclic permutation under the trace op¬ 
eration. These ideas were developed in the paper Adler (1994b) and were described 
in Chapter 13 of my 1995 book; in Chapter 14, I suggested that the nonlinearity 
of trace dynamics could make it relevant for resolving the measurement problem 
in quantum theory. However, the problem of relating the trace dynamics formal¬ 
ism to the standard canonical formalism of complex quantum field theory remained 
unsolved. 

One of the questions I had posed to Andrew Millard was that of better under¬ 
standing trace dynamics, in the hope of finding a connection to standard quantum 
theory. After I arrived in Aspen in the summer of 1995, Andrew sent me a memo 
containing his discovery that in trace dynamics with a Weyl symmetrized Hamilto¬ 
nian and noncommuting boson degrees of freedom q ri p r , the operator ^ r [q r ^Pr\ is 
conserved. I soon found that the generalization to include fermions is the conserved 
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operator that we denoted by C, defined by 

C= [(lr,Pr] - ^2 {q r ,Pr} , 

r , boson r, fermion 

and that this operator is conserved as long as the trace Hamiltonian has no fixed 
operator coefficients, which is equivalent to saying the the trace Hamiltonian has 
a global unitary invariance. It then seemed natural to suggest that the equiparti- 
tioning of C in a statistical thermodynamical treatment would provide the missing 
connection between trace dynamics and standard quantum mechanics. 

The implementation of this idea was published in Adler and Millard (1996b), and 
I developed it further over the following years with many collaborators, as described 
in Sec. 5 of the “Introduction and Overview” that opens my 2004 book on emergent 
quantum theory. This book, which is set within the framework of complex Hilbert 
space, gives a complete, self-contained development of trace dynamics as a (non- 
commutative) dynamics underlying quantum theory. From the statistical mechanics 
of this underlying theory there emerge, in a mutually complementary way, both the 
unitary and the nonunitary parts of orthodox quantum theory. The unitary part of 
quantum theory (the canonical algebra and the Heisenberg representation time evo¬ 
lution of operators) comes from an application of generalized equipartition theorems 
in the statistical thermodynamics of trace dynamics. The nonunitary part of quan¬ 
tum theory, in the form of stochastic state vector reduction models from which the 
Born rule for probabilities can be derived, comes from the Brownian motion correc¬ 
tions to this thermodynamics. Thus, trace dynamics provides a unified framework 
from which both the unitary dynamics of quantum systems, and the nonunitary 
evolution describing state vector reduction associated with measurements, emerge 
in a natural way. 

Although quantum mechanics and quantum field theory have been the undis¬ 
puted basis for all progress in fundamental physics during the last 80 years, the 
extension of the current theoretical frontier to Planck scale physics, and recent en¬ 
largements of our experimental capabilities, may make the 21st century the period 
in which possible limits of quantum theory will be probed. My 2004 book suggests a 
concrete framework for exploration of the proposition that quantum mechanics may 
not be the final layer of fundamental theory. It also addresses the phenomenology 
of modifications to quantum theory, specifically as implemented through stochastic 
additions to the Schrodinger equation. I have continued with these phenomenolog¬ 
ical studies since completion of the book; my most recent papers (Bassi, Ippoliti, 
and Adler, 2005; Adler, Bassi, and Ippoliti, 2005; Adler, 2005) have dealt with ana¬ 
lyzing possible tests of stochastic localization theories in nanomechanical oscillator 
and gravitational wave detector experiments. 


QuarternionicQuantum Mechanics, Trace Dynamics, and Emergent Quantum Theory 


109 


References for Chapter 9 

Adler, S. L. (1979). Algebraic Chromodynamics. Phys. Lett B 86, 203-205. [103] 

Adler, S. L. (1980). Quaternionic Chromodynamics as a Theory of Composite Quarks and 
Leptons. Phys. Rev. D 21, 2903-2915. [103] 

Adler, S. L. (1988). Scattering and Decay Theory for Quaternionic Quantum Mechanics, 
and the Structure of Induced T Nonconservation. Phys. Rev. D 37, 3654-3662. [103] 

Adler, S. L. (1989). A New Electroweak and Strong Interaction Unification Scheme. Phys. 
Lett B 225, 143-147. [104] 

Adler, S. L. (1994a). Composite Leptons and Quarks Constructed as Triply Occupied Quasi¬ 
particles in Quaternionic Quantum Mechanics. Phys. Lett. B 332, 358-365. [103] 

Adler, S. L. (1994b). Generalized Quantum Dynamics. Nucl. Phys. B 415, 195-242. [107] 

Adler, S. L. (1996) R63. Projective Group Representations in Quaternionic Hilbert Space J. 
Math. Phys. 37, 2352-2360. [105-107] 

Adler, S. L. (2005). Stochastic Collapse and Decoherence of a Non-Dissipative Forced Har¬ 
monic Oscillator. J. Phys. A: Math. Gen. 38, 2729-2745. [108] 

Adler, S. L. and J. Anandan (1996) R61. Nonadiabatic Geometric Phase in Quaternionic 
Hilbert Space. Found. Phys. 26, 1579-1589. [104] 

Adler, S. L., A. Bassi, and E. Ippoliti (2005). Towards Quantum Superpositions of a Mirror: 
An Exact Open Systems Analysis - Calculational Details. J. Phys. A: Math. Gen. 38, 
2715-2727. [108] 

Adler, S. L. and G. G. Emch (1997) R64. A Rejoinder on Quaternionic Projective Repre¬ 
sentations. J. Math. Phys. 38, 4758-4762. [107] 

Adler, S. L. and A. C. Millard (1996a) R62. Coherent States in Quaternionic Quantum 
Mechanics. J. Math. Phys. 38, 2117-2126. [104] 

Adler, S. L. and A. C. Millard (1996b). Generalized Quantum Dynamics as Pre-Quantum 
Mechanics. Nucl. Phys B 473, 199-244. [108] 

Bargmann, V. (1954). On Unitary Ray Representations of Continuous Groups. Ann. Math. 
59, 1-46. [104-106] 

Bassi, A., E. Ippoliti, and S. L. Adler (2005). Towards Quantum Superpositions of a Mirror: 
An Exact Open Systems Analysis. Phys. Rev. Lett. 94, 030401. [108] 

Emch, G. (1963). Mecanique Quantique Quaternionienne et Relativite Restreinte. I, Helv. 
Phys. Acta 36, 739-769; II, Helv. Phys. Acta 36, 770-788. [105,106] 

Emch, G. (1965). Representations of the Lorentz Group in Quaternionic Quantum Mechanics 
(presented at the summer 1964 Lorentz Group Symposium), in Lectures in Theoretical 
Physics Vol. VIIA, W.E. Brittin and A. O. Barut, eds. (University of Colorado Press, 
Boulder), pp. 1-36. [105,106] 



110 


Adventures in Theoretical Physics 


Harari, H. (1979). A Schematic Model of Quarks and Leptons. Phys. Lett. B 86, 83-86. [103] 

Peres, A. (1979). Proposed Test for Complex versus Quaternion Quantum Theory. Phys. 
Rev. Lett 42, 683-686. [103] 

Shupe, M. A. (1979) A Composite Model of Leptons and Quarks. Phys. Lett. B 86, 87-92. 
[103] 

Tao, T. and A. C. Millard (1996). On the Structure of Projective Group Representations in 
Quaternionic Hilbert Space. J. Math. Phys. 37, 5848-5857. [106] 

Weinberg, S. (1995) The Quantum Theory of Fields , Volume I: Foundations (Cambridge 
University Press, Cambridge), Sec. 2.7. [105] 



Ill 


10. Where Next? 


In looking back at my work, I see one pattern that is repeated over and over. Many 
of the most interesting research results that I have obtained were unanticipated 
consequences of other, quite different research programs. In the course of detailed 
calculations, or speculative explorations, I noticed something that seemed worth 
pursuing, even though tangential to my original motivations, and this new direction 
ended up being of much greater interest. This happened with my calculations of 
weak pion production, which led as spin-offs to the forward lepton theorem, the 
neutrino sum rule, and soft pion theorems. It happened again with my exploration 
of gauging of the axial-vector current as an explanation for the muon mass, which led 
to anomalies. My interest in an eigenvalue condition in QED led to the calculation 
of photon splitting, and later on to an improved method for analyzing collider data. 
My attempts at a composite graviton led to an investigation of Einstein gravity as 
a symmetry breaking effect. My interest in the (spurious) Argonne threshold events 
induced me to extend my weak pion work to neutral currents, which contributed to 
the first unique determination of the electroweak couplings by Abbott and Barnett. 
My attempt to relate monopole background fields to confinement played a role in the 
multimonopole existence proof of Taubes. My computational experience in solving 
effective action confinement models led to overrelaxation as an acceleration method 
for Monte Carlo. And most recently, my interest in composite models for quarks 
and leptons led to a long exploration of the fundamentals of quantum theory, first 
through my study of quantum theory in quaternionic Hilbert space, and growing 
out of that, through my development of trace dynamics as a possible pre-quantum 
theory. 

I think this pattern is no accident, but rather a reflection of my guiding philoso¬ 
phy in doing research, which has been that it is more important to start somewhere, 
even with a speculative idea or an apparently routine calculation, than to sit around 
waiting for an “important” idea. Once immersed in the nitty-gritty of an investiga¬ 
tion, things have a way of appearing, that often lead off in very fruitful directions. 
So given this, when I look ahead, I can only say the following: I have some rough 
ideas as to where I would like to start in new explorations in fundamental theory 
and particle phenomenology, but I cannot say where these may ultimately lead, in 
the course of my continuing adventures in theoretical physics. 


112 


Index of Names in the Commentaries 


Abbott, L. F. 70, 71, 111 
Abers, E. S. 38, 49 
Acharya, R. 60, 62 
Achasov, N. N. 52 
Adler, J. 30 
Adler, V. 100 
Akhoury, R. 84, 85, 94 
Alkhofer, R. 92, 94, 96 
Allasia, D. 18, 25 
Amundsen, P. A. 92, 94, 96 
Anandan, J. 104, 109 
Anderson, P. W. 3, 9, 25 
Atick, J. 101 
Aviv, R. 66, 72 
Ayon-Beato, E. 76, 80 

Bahcall, J. N. 56-58, 63 
Baier, V. N. 58, 63 
Baker, M. 45, 46, 58, 60, 63, 64, 91, 
94 

Baker, N. J. 21, 25 
Balian, R. 61-63 
Ball, J. S. 91, 94, 95 
Banks, T. 93, 95 
Barbieri, R. 67, 68, 72 
Bardeen, W. A. 35-38, 41-43, 46, 49, 
50, 58-60, 63 

Bargmann, V. 104-106, 109 

Barnett, R. M. 70, 71, 111 

Bassi, A. 108, 109 

Beg, M. A. B. 45, 48, 50 

Bekenstein, J. 76, 80 

Bell, J. S. 7, 8, 25, 32-34, 40, 50 


Belavin, A. A. 62, 63 
Beneke, M. 67, 72, 79, 80 
Berg, D. 57, 58, 64 
Bergmann, P. G. 74, 80 
Bernardini, G. 7 
Bernstein, J. 5, 6, 10, 14, 25 
Bhanot, G. V. 100-102 
Bialynicka-Birula, Z. 56, 63 
Bialynicki-Birula, I. 56, 63 
Bicudo, P. 92, 95 
Binder, K. 99, 102 
Birman, J. L. 1-3 
Bjorken, J. D. 17-20, 25, 28, 30, 

34-36, 44-46, 50 
Block, M. M. 7 

Bogomolny (Bogomol’nyi), E. B. 

61-63, 83, 95 
Bolton, T. 18, 26 
Borie, E. 68, 72 
Bott, R. 85, 86 
Bouchiat, C. 44, 50 
Boulware, D. G. 36, 49 
Brandt, R. A. 39, 50 
Broadhurst, D. J. 19, 25, 67, 72 
Brown, F. R. 99, 100, 102 
Brown, L. S. 84, 95 
Brown, R. W. 38, 49, 54 
Bugg, D. V. 10, 25 

Cabibbo, N. 17, 25 

Callan, C. G. 16, 18, 25, 39, 42, 44-46, 
48, 50, 55, 56, 58-60, 63 
Callias, C. 85, 95 



Index of Names 


113 


Candelas, P. 78 
Carlitz, R. D. 84, 95 
Casher, A. 93, 95 
Caticha, A. 91, 95 
Chanowitz, M. S. 47, 48, 50, 51 
Chew, G. F. 5, 19, 25, 26, 28 
Chou, K.-C. 5, 26 
Christ, N. 46, 51 
Christenson, J. H. 9, 26 
Cohen, M. H. 2, 3 

Coleman, S. R. 15, 32, 33, 47, 51, 77, 
81 

Colglazier, E. W. 69-71 
Collins, J. C. 48, 49, 51 
Conrad, J. M. 18, 26 
Crater, H. W. 90, 95 
Crawford, J. D. 74 
Creamer, D. B. 84, 95 
Creutz, M. 99, 100, 102 
Crewther, R. J. 47, 51 
Cronin, J. W. 9, 26 . 

Cvetic, G. 67, 72 

Dashen, R. F. 11, 14, 15, 17, 20-22, 
24, 26, 41, 42, 59, 67, 71 
David, F. 78, 79, 81 
Davis, A. C. 92-95 
de Forest, T. 17, 26 
Delbourgo, R. 92 
D’Elia, M. 93, 95 
de Forcrand, P. 100, 102 
de Rafael, E. 67, 73 
De Rujula, A. 67, 72 
Deser, S. 74, 81 
Dib, C. 67, 72 
Dicke, R. H. 74 
Dicus, D. A. 38, 49 
Di Giacomo, A. 93, 95 
Dirac, P. A. M. 61, 64 
Dorokhov, A. 67, 72 
Dothan, Y. 20, 24 


Drechsel, D. 21, 28 
Drell, S. D. 30, 50 
Duncan, A. 48, 49, 51 
Dyson, F. J. 20 

Ehrenreich, H. 2, 3, 9, 10 
Eidelman, S. 67, 72 
Einstein, A. 74 
Ellis, G. F. R. 74, 81 
Ellis, J. 47, 48, 50, 51 
Emch, G. 105-107, 109 
Ericson, T. E. O. 68, 72 

Faissner, H. 7, 8, 26 
Farrar, G. R. 66, 71 
Fateyev, V. A. 62, 63 
Fearing, H. W. 20, 26 
Federbush, P. G. 59, 64 
Fermi, E. 69 

Ferrenberg, A. M. 99, 102 

Feynman, R. P. 5, 10, 16, 26 

Fitch, V. L. 9, 26 

Forgacs, P. 87, 95 

Fortson, N. 74 

Franzinetti, C. 7 

Fritzsch, H. 35, 45, 46, 50, 51 

Fubini, S. 5, 14, 18, 21, 25, 26 

Fukuda, H. 34, 51 

Fulling, S. 74, 81 

Furlan, G. 14, 21, 25, 26 

Gell-Mann, M. 5, 6, 9-11, 14-17, 20, 
22, 25, 26, 35, 45, 46, 48, 50, 51, 
60 

Georgi, H. 67, 72 
Gerstein, I. S. 41, 51 
Gibbons, G. W. 76, 81 
Gilman, F. J. 19, 21, 24, 35, 51 
Goldberger, M. L. 4-6, 11, 13, 26, 74 
Goldhaber, A. S. 4, 39, 65, 71, 85, 94 
Goldstone, J. 22, 26 
Goodman, J. 99, 102 


114 


Adventures in Theoretical Physics 


Gorringe, T. 20, 26 

Gottfried, K. 19, 27 

Gounaris, G. 38, 54 

Gross, D. J. 44-46, 50, 51, 60, 63, 77 

Giirsey, F. 14, 27 

Hagen, C. R. 39, 51 
Han, M. Y. 35, 51 
Hara, Y. 35, 51 
Harari, H. 103, 110 
Hasslacher, B. 46, 51 
Hawking, S. W. 74, 81 
Haymaker, R. W. 91, 95 
Healy, J. B. 69-71 
Hill, C. T. 48 
Hinchliffe, I. 90, 95 
Holland, K. 100, 102 
Hopfield, J. J. 1, 3 
Houston, P. 85, 95 
Horvath, Z. 87, 95 
Horwitz, L. 104 

Iliopoulos, J. 44, 50 
Ioffe, B. L. 44, 53 
Ippoliti, E. 108, 109 
Itzykson, C. 61-64 
Iwasaki, Y. 47, 48, 51 

Jackiw, R. 32-34, 36-41, 44, 45, 47, 
50-52, 60, 62-64 
Jaffe, A. M. 14, 85, 86, 95 
Jahn,O. 100, 102 
Jauch, J. M. 31, 52 
Jegerlehner, F. 67, 72 
Joglekar, S. D. 48, 51 
Johnson, K. 36-40, 44-46, 52, 58-60, 
63, 64 

Jona-Lasinio, G. 14, 27 
Joos, P. 21, 27 
Jun, J.-H. 84, 94 

Karliner, I. 69-72 


Kataev, A. L. 19, 25, 27, 67, 72 
Khuri, N. N. 78, 81 
Kiskis, J. 100, 102 
Kitagaki, T. 22, 27 
Klauder, J. R. 104 
Kleefeld, F. 10, 28 
Klein, O. 77, 78, 81 
Klevansky, S. P. 92, 95 
Kopeliovich, B. Z. 8, 27 
Krishnan, R. 101, 102 

Lagae, J.-F. 92, 95 
Lahiri, A. 76, 81 
Lautrup, B. 60, 64 
Lee, B. W. 27, 42, 49, 65, 66, 68, 72 
Lee, C. 84, 95 
Lee, T. 67, 72 
Lee, T.-D. 17, 20 
Lee, W. 68, 72 
Lehmann, H. 89, 90, 95 
Lemmer, R. H. 92, 95 
Levy, M. 5, 6, 10, 26 
Lieberman, J. 69-72, 75, 76, 80 
Llewellyn Smith, C. H. 8, 19, 20, 27 
Low, F. E. 5, 12, 20, 26, 27, 36, 44-46, 
52, 60 

Lucini, B. 93, 95 
Lurie, D. 6, 11-13, 27 

Mack, G. 47, 52 
Maki, A. 35, 52 
Mangano, M. L. 8, 27 
Margolis, B. 90, 95 
Marques, G. M. 92, 95 
Martin, A. 90 
Martin, P. C. 4, 39 
Matheson, A. M. 92, 95 
Matinyan, S. G. 87, 96 
Matsuki, T. 91, 95 
Maxwell, C. J. 19, 25 
Mendel, R. R. 90, 95 


Index of Names 


115 


Mentzel, M. 57, 58, 64 
Meyer, H. B. 100, 102 
Meyer, Ph. 44, 50 
Michel, L. 5, 6, 10, 14, 25 
Millard, A. C. 104, 106-110 
Milstein, A. I. 58, 63 
Milton, K. A. 67, 73 
Minguzzi, A. 56, 64 
Minkowski, P. 48, 52 
Misner, C. W. 74, 81 
Miyamoto, Y. 34, 35, 51, 52 
Monsay, E. H. 70, 73 
Mottola, E. 84, 85, 96 
Mueller, A. H. 46, 51 

Nahm, W. 84, 96 

Nambu, Y. 5, 6, 10-14, 21, 26, 27, 35, 
51, 52, 92 

Narayana Swamy, P. 60, 62 
Narayanan, R. 93, 96, 100, 102 
Neuberger, H. 91-94, 96, 100, 102 
Ng, Y. J. 69-72, 75, 80 
Nielson, N. K. 48, 52 
Nishijima, K. 13, 14, 27 
Nussinov, S. 68-70, 72 

Oakes, R. J. 22, 26, 50 
Okubo, S. 35, 52 
Oppenheimer, J. R. 20 
O’Raifeartaigh, L. 85-87, 95 
Ore, F. R. 62, 64 

Paczynski, B. 57 

Paffuti, G. 93, 95 

Pagels, H. 48, 52, 87, 96 

Pais, A. 17, 20 

Pakvasa, S. 70, 73 

Palla, L. 87, 95 

Parisi, G. 61-64 

Paschos, E. A. 22, 27, 68-70, 72 

Pasquini, B. 21, 28 

Pearson, R. B. 76, 77, 80 


Pepe, M. 100, 102 
Peres, A. 103, 110 
Peris, S. 67, 73 
Perkins, D. H. 68, 73 
Perrottet, M. 67, 73 
Pica, C. 93, 95 
Pipkin, F. 4 
Piran, T. 87-92, 94, 98 
Poggio, E. C. 67, 73 
Politzer, H. D. 46 
Polkinghorne, J. C. 38, 52 
Polyakov, A. M. 62, 63, 83, 96 
Prasad, M. K. 83, 96 
Pratap, M. 66, 73 
Preparata, G. 45, 52 
Preskill, J. 77, 81 

Quinn, H. R. 67, 73 

Radicati, A. 17, 25 
Rasetti, M. 100 
Ravndal, F. 8, 28 
Ray, D. 76, 77, 81 
Rebbi, C. 62, 64 
Redlich, N. 101 
Regge, T. 100 
Rein, D. 8, 22, 28 
Renner, B. 22, 26 
Rinker, G. A. 68, 72 
Ripka, G. 90, 91, 96 
Rohrlich, F. 31, 52 
Roper, L. D. 9, 28 
Rosenberg, L. 31, 52 
Rosenbluth, M. N. 56, 63 
Rosetti, S. 21, 25, 26 
Rupp, G. 10, 28 

Sakharov, A. D. 75, 77, 78, 81 
Sakuda, M. 22, 27 
Salam, A. 22, 26 
Sang, R. 57, 64 



116 


Adventures in Theoretical Physics 


Savvidy, G. K. 87, 96 
Sawyer, R. F. 66, 72 
Scadron, M. D. 92 
Schienbein, I. 22, 27 
Schmidt, I. 67, 72 
Schreiner, P. A. 22, 28 
Schubert, C. 57, 58, 63 
Schwinger, J. 4, 14, 28, 34, 39-41, 53, 
56, 64 

Sehgal, L. M. 8, 22, 28 
Sen, S. 38, 53 
Shaevitz, M. H. 18, 26 
Shaisultanov, R. Zh. 58, 63 
Sharp, D. 49 
Shcheredin, S. 100, 102 
Shirley, 14 

Shrauner, E. 6, 10-13, 21, 27, 28 
Shupe, M. A. 103, 110 
Sirlin, A. 46, 53 
Smith, J. 66, 73 
Sokal, A. D. 99, 102 
Solovtsov, I. L. 67, 73 
Solovtsova, O. R 67, 73 
Sommerfield, C. M. 83, 96 
Stein, E. M. 106 
Steinberger J. 34, 40, 53 
Stoneham, R. J. 57, 58, 64 
Strominger, A. 78, 79, 81 
Sutherland, D. G. 32, 33, 34, 37, 47, 
53 

Swanson, E. S. 92, 96 
Swendsen, R. H. 99, 102 
Symanzik, K. 39, 42, 45, 48, 53, 55, 
58, 59, 64 

Szczepaniak, A. P. 92, 96 
Tao, T. 106, 110 

Taubes, C. 83, 85-87, 95, 96, 111 
Tavkhelidze, A. 35, 53 
Teitelboim, C. 78 
Teplitz, V. L. 38, 49 


Terent’ev, M. V. 43, 53 
Thirring, W. 5, 6, 25 
’t Hooft, G. 32, 53, 62, 64, 74, 81, 83, 
88, 96 

Thorne, K. S. 74, 81 
Tiator, L. 21, 28 
Tomboulis, E. 48, 52, 87, 96 
Tomozawa, Y. 16, 28 
Touschek, B. F. 13, 28 
Treiman, S. B. 4-6, 9, 11, 13, 16, 

25, 30, 31, 42, 43, 49, 65-68, 71, 
74 

Trottier, H. D. 90, 95 
Tsao, H.-S. 69-72, 75, 80 
Tuan, S. F. 70, 73 
Tung, W.-K. 41, 44-46, 50, 53 
Turlay, R. 9, 26 

Uhlenbeck, K. K. 85, 86 
Uy, Z. E. S. 66, 73 

Vafa, C. 92, 93, 96 
Vainshtein, A. I. 44, 53 
Van Alstine, P. 90, 95 
van Beveren, E. 10, 28 
van Nieuwenhuizen, P. 30 
Van Royen, R. 60, 64 
Veltman, M. J. G. 7, 10, 32, 33, 37, 
47, 53, 74, 81 
Veneziano, G. 48, 53 
Veretin, O. 67, 72 
von Dardel, G. 7 
von Hippel, F. 22, 28 
von Smekal, L. 92 ? 96 

Wald, R. M. 75, 76, 81, 82 
Walecka, J. p. 17, 26 
Wang, J.-S. 99, 102 
Ward, R. S. 87, 96 
Wataghin, V. 5, 26 
Weber, A. 8 
Weckel, J. D. 101, 102 



Index of Names 


117 


Weinberg, E. J. 85, 86, 97 
Weinberg, S. 16, 22, 26, 28, 44, 53, 67, 
73-75, 82, 105, 110 
Weingarten, D. 93, 97 
Weisberger, W. I. 15, 16, 21, 25, 29 
Werthamer, N. R. 3, 9 
Wess, J. 43, 53 
West, G. B. 60, 64 
Wheeler, J. A. 74, 81 
Whitmer, C. 99, 102 
Wiese, U. J. 100, 102 
Wilczek, F. 46, 77, 81, 82, 87 
Wilke, C. 58, 64 
Willey, R. S. 46, 58, 64 
Wilson, K. G. 45, 47, 53, 60 
Wiser, N. 2, 3 


Witten, E. 43, 54, 75, 82, 93, 96 
Woch, T. J. 99, 100, 102 
Wong, T. F. 38, 49, 54 
Wu, T. T. 89, 90, 95 
Wunner, G. 57, 58, 64 

Yang, C. N. 17, 20 
Yankielowicz, Y. 48, 53 
Young, B.-L. 38, 49, 54 
Yu, J. Y. 22, 27 

Zachariasen, F. 91, 94 

Zee, A. 42, 43, 49, 65, 66, 78, 82 

Zichichi, A. 7 

Zuber, J. B. 61-64 

Zumino, B. 39, 43, 53, 54 



This page is intentionally left blank 



REPRINTED PAPERS 



This page is intentionally left blank 



R1 


121 


Copyright © 2004 by The Abdus Salam International Centre for Theoretical Physics. Reprinted with permission. 
Photograph copyright© 1995 courtesy of Randall Hagadorn. 

One Hundred Reasons to be a Scientist 


From Elements of Radio to Elementary Particle Physics 

Stephen L. Adler 

Institute for Advanced Study at Princeton, USA 


I was bom in 1939 in New York City to Irving and 
Ruth Relis Adler. My father was a mathematics teacher 
and my mother had also majored in mathematics in 
college. I was directed towards science by my parents 
from an early age. When I was two years old my father 
built me a gadget box from pieces of hardware, and 
around the same time my mother made me a home-made 
version of the “Pat the Bunny” book, each page 
containing a different tactile or manual operation for me 
to perform. When I was older my father built me 
electrical toys such as telegraphs, a “burglar alarm” that 
rang a bell when a door was opened, and a miniature 
traffic light. We also engaged in nature activities, 
collecting snakes and butterflies. When I was eight I 
participated in a young people’s astronomy course at the 
Museum of Natural History in New York, and my 
fascination with the fossils I saw at this museum led me 
to think briefly of being a paleontologist, but this interest 
soon waned. 

My actual career path began in sixth grade of 
elementary school, when a classmate started to talk to me 
about his interest in radio; I visited him at home and saw 
his equipment and tools. This introduction developed into 
a serious interest in electricity, radio, and electronics 
© Courtesy of the institute for Advanced study while I was still in elementary school. I built various 

electrical devices, such as electric motors with rotor 
laminations cut from tin cans and permanent magnet 
stators taken from old radio loudspeakers. (I still have one of these on my bookcase at the Institute for 
Advanced Study). With encouragement from my father I read Marcus and Marcus’s classic World War II 
text “Elements of Radio”; my father made a point of letting me be the family radio expert, while he was the 
consultant on the few bits of algebra in the text. Also at my father’s suggestion, I started to canvass the 
neighborhood door to door, pulling a small wagon and asking for old radios, appliances, and television sets 
people were planning to throw away. I stripped the parts out of these, and used them to build radios, 
amplifiers, and even an oscilloscope using a salvaged 7-inch television tube. I also learned enough Morse 
code to get my Technician Class amateur radio license, and built a small rig using a war surplus aircraft 
receiver and a home-built transmitter. However, amateur radio activity did not interest me nearly as much as 
building electronic equipment, which I continued through various science projects in high school. 

Given this exposure to electronics, it would have been natural for me to pursue a career in electrical 
engineering, but towards the beginning of my high school years I got a first glimpse of the fascinating world 
of high energy physics research. For two summers my family had vacationed in a state park near Ithaca, NY, 
and Phillip Morrison, an old friend of my father’s, gave us a tour of the physics laboratories at Cornell, where 
Robert Wilson had built a succession of particle accelerators. I liked the ambience of these laboratories, and 
was impressed with the fact that if I pursued physics as a career I would learn and use electronics, but not 
necessarily the other way around. By my junior year in high school, I had decided on experimental physics as 
a career. 

My first physics research laboratory experience came at the end of my senior year in high school, when I 
attended a two-week course in X-ray diffraction techniques for industrial engineers given at Brooklyn 
Polytechnic Institute by Isadore Fankuchen, who would every now and then include a bright high school 
student in his class. I was able to do all the theoretical and laboratory work, and learned many things, such as 
crystal lattice structure and Fourier transforms, that are standard physicist’s tools. Almost immediately 
afterwards, I went to a summer job at Bell Labs in Manhattan, along with eight other science-oriented high 
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school graduates. Many of them had already learned calculus, and so I decided to teach myself calculus that 
summer. 

My father gave me his old calculus text, along with the sage advice to do every third problem—because I 
had to do problems to learn the material, but there was not time (and it would be too boring) to try to do all of 
them. So I spent my commuting time, and spare time at work, doing calculus problems. As a result, when I 
entered Harvard in the fall I was able to place directly into Advanced Calculus, which had a major impact in 
how fast I was able to proceed with my physics education. 

I entered college intending to be an experimentalist, but my friendships with various classmates, among 
them Daniel Quillen (later a Fields medalist) got me interested in mathematics. I found that I was very good 
at the theoretical aspects of my classes, but although competent in the laboratory, I lacked the touch of the 
gifted experimentalist. So, by the middle of my freshman year, I had decided to shift my sights from 
experimental to theoretical physics. Along with Fred Goldhaber, who was to be my first year roommate in 
graduate school at Princeton, I took essentially the whole graduate course curriculum at Harvard during my 
junior and senior years. Memorable teachers at Harvard included Ed Purcell, Frank Pipkin, Paul Martin, and 
Julian Schwinger. As a result of my Harvard preparation, at Princeton I was able to take my General Exams 
at the end of the first year, and then to start thesis research with Sam Treiman at the beginning of my second 
year. 

Treiman suggested that I look for calculations to do in the newly emerging area of accelerator neutrino 
experiments, and this was the beginning of my career in high energy physics. A major part of my thesis work 
was a calculation of pion production from nucleons (protons or neutrons) by a neutrino beam. Although this 
was a long and tedious project, it gave me a detailed introduction to the “vector” and “axial-vector” currents 
through which neutrinos interact with nucleons. This knowledge growing out of my thesis project was the 
foundation for my most significant scientific contributions during the period 1964 through 1972, which all 
involved in some way or another the discovery of further results connected with vector and axial-vector 
currents. These included various low energy theorems for pion emission derived from the hypothesis of a 
“partially conserved” axial vector current, various sum rules including the Adler-Weisberger sum rule for the 
axial vector coupling to nucleons and a sum rule for deep inelastic high energy neutrino scattering cross 
sections, as well as the co-discovery (along with Bell and Jackiw) of the “anomalous” divergence properties 
of the axial-vector current. The theoretical analysis of anomalies led to a deeper understanding of neutral 
pion decay into gamma rays, provided one of the first pieces of evidence for the fact that each quark comes in 
three varieties (now called “colors”), and has had a multitude of other consequences for theoretical physics 
over the last thirty-five years. 

During the years since 1972,1 have worked on a variety of other topics in theoretical high energy physics, 
including neutral current phenomenology, strong field electromagnetic processes (such as photon splitting 
near pulsars), and acceleration methods for Monte Carlo simulation algorithms. Throughout the last twenty 
years I have devoted about half of my research time to studying embeddings of standard quantum mechanics 
in larger mathematical frameworks. One aspect of this work involved a detailed study of a quantum 
mechanics in which quaternions replace the usual complex numbers. Another, more recent aspect, has 
involved the study a possible “pre-quantum” mechanics based on properties of the trace of a matrix, from 
which quantum theory can emerge as a form of thermodynamics. I have written books describing both of 
these studies. For the next few years, I plan to return to my original area of particle phenomenology, in the 
context of supersymmetric models for further unifying the elementary particles and the forces acting on them. 
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Theory of the Valence Band Splittings at k = 0 in Zinc-Blende 

and Wurtzite Structures 

Stephen L. Adler* 

General Telephone & Electronics Laboratories, Inc., Bayside Laboratories, Bayside, New York 

(Received September 28, 1961) 

A theory of the valence band splittings at k~0 in zinc-blende and wurtzite structures is proposed, in 
which the wurtzite levels are treated as perturbations of those in zinc blende. Starting from one-electron 
Hamiltonians for the two structures, the two-parameter formulas originally derived by Hopfield are ob¬ 
tained, with a minimum of approximations, along with explicit expressions for the parameters in terms of 
Hamiltonian matrix elements. The two-parameter formulas are compared with experimental data and 
agreement is found to be good. A simple tight-binding (linear combination of atomic orbitals), 3p valence 
band, point-ion lattice model is used to calculate an effective charge for ZnS from the known valence band 
splittings in the wurtzite and zinc-blende dimorphs; a value of 2.3e is obtained. 


1. INTRODUCTION 2. THEORY 


T HE object of this paper is to explore a theory of 
the k— 0 valence band energy splittings and wave 
functions in zinc-blende and wurtzite structures. This is 
of interest because recent experimental work on hex¬ 
agonal (wurtzite) and cubic (zinc blende) ZnS, 1 hex¬ 
agonal CdS, 1 CdSe, 2 and ZnO, 3 cubic ZnSe, 4 and other 
II-VI semiconductors has made available data on their 
valence band energy splittings and wave function 
symmetries. 

Previous theoretical work on wurtzite and zinc-blende 
valence band splittings has been reported by Birman 6 
and by Hopfield. 6 Birman’s theory did not include the 
spin-orbit interaction. Hopfield’s work, based on a quasi- 
cubic model of the wurtzite structure, gave useful 
formulas for the wurtzite and zinc-blende valence band 
splittings, which fit experimental data obtained for ZnS 
to within 10%. The theory presented below starts from 
the rigorous one-electron Hamiltonians for wurtzite and 
zinc blende and yields, with a minimum of approxima¬ 
tions, the formulas proposed by Hopfield. In addition, 
the crystal field and spin-orbit parameters are expressed 
in terms of matrix elements of the wurtzite and zinc- 
blende Hamiltonians. 

Although the anion and cation are for the sake of 
definiteness assumed to be S and Zn, this does not enter 
into the derivation. The two-parameter formulas should 
be valid in other substances than ZnS crystallizing in 
zinc-blende and wurtzite dimorphs, provided that the 
approximations made in the derivation still hold. 


* Summer visitor, 1960; now at Physics Department, Princeton 
University, Princeton, New Jersey. 
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Some details of the zinc-blende and wurtzite ge¬ 
ometries will be needed. 7 In zinc blende there are two 
atoms per unit cell, and in wurtzite, four. Call the two 
sulfur atoms in the wurtzite basis sulfur 1 and 2, re¬ 
spectively. The nearest-neighbor configurations in zinc 
blende and ideal wurtzite are identical and the second- 
nearest-neighbor (nearest like ion) configurations are 
nearly so (Fig. 1). This local structural similarity will be 
exploited by using axis systems for the two structures in 
which the nearest neighbors of the sulfur 1 site in the 
wurtzite basis and the sulfur site in the zinc blende basis 
have identical coordinates. The axis systems which will 
be used throughout the rest of this paper, unless ex¬ 
plicitly stated otherwise, are illustrated in Fig. 2. Note 
that the axes for zinc blende are not the usual Cartesian 
axes employed for this structure. 5 

The Hamiltonians for band-theoretic treatment of 
wurtzite and zinc blende, including in each the spin- 
orbit interaction term, are 

#zb= P 2 /2 M+ v ZB (r) + (V V ZB X p) • o, 

#w= p 2 /2w+ F w (r)+ (#/4w 2 c 2 ) (V F W X p) v, 

where Fzb and Fw are the respective crystal potentials 
in the two substances. A modification of the linear com¬ 
bination of atomic orbitals (LCAO) procedure will be 
used to find expressions for the valence band energy 
splittings and wave functions at k=0 in the Brillouin 
zone. In the usual LCAO formalism, 8 the Hamiltonian 
at k=0 is diagonalized in a space spanned by the cell 
periodic functions X^j^ ft (r— R/). Here \f/ n is 

a free-ion orbital, n denotes a complete set of quantum 
numbers and R/ is the position of the 0th basis atom 
in the j th unit cell. In the modification used in this 
paper, the rigorous zinc-blende (ZB) Bloch functions 
at k=0 are expanded in Wannier functions, 

y n ZB = N -t £-x ft ( r -R/), 

where R/ is the coordinate of the sulfur atom in the 


7 J. L. Birman, Phys. Rev. 109, 810 (1958). 

8 J. C. Slater and G. F. Koster, Phys. Rev. 94, 1498 (1954). 


Copyright© 1962 by the American Physical Society. Reprinted with permission. 


Adventures in Theoretical Physics 


124 


VALEN CE BAND SP 


zinc blende wurtzite 



Fig. 1. First and second neighbors in zinc blende and wurtzite. 
Large circles are S atoms, small ones Zn. Open circles are in the 
same plane. Comparing the two structures, we note that only 
three of the twelve second neighbors differ, and even these are 
disposed symmetrically. These are shown as the 3 atoms “above" 
and are rotated by tt/ 3 in zinc blende with respect to their 
positions in wurtzite. 

yth cell in zinc blende, These functions form an ortho¬ 
normal set (('Fft^ B |'I r n - 2B ) = 5„n') and are a complete 
cell-periodic set in zinc blende. In wurtzite (W), the 
Hamiltonian matrix at k=0 will be computed using as 
basis functions the linear combinations of zinc-blende 
Wannier functions, 

[X(r-R/)±X„(r-R/)], 

where R/ and R/ are the coordinates of the two sulfur 
atoms in the cell in wurtzite. We remark that 
<T'„ W+ 1 = (^ ft w+ | ffwl ^ft^ w ”) = 0; in other words, 

the wurtzite Hamiltonian matrix breaks up into two 
submatrices spanned, respectively, by the functions 
'F 7l w+ and This is proved by noting that the 

symmetry operation C 2 , the twofold screw axis in wurt¬ 
zite, which interchanges type-1 and type-2 sites, leaves 
H w and the ^ n w+ functions invariant while changing 
the sign of the functions. Furthermore, the func¬ 
tions ^ ft w+ and ^ n w “ are similar in form, respectively, 
to the zinc-blende valence band wave functions 
^nk ZB ~ iV r “*X^j , exp(tk'R 3 )X n (r— R,-) at the points 
k= (0,0,0) and k= (0,0,2t/c) in the zinc-blende Bril- 
louin zone. Consequently, the wurtzite energy eigen¬ 
values determined by the submatrix (\p n w +| Hw|^n' W+ ) 
correspond to levels at k= (0,0,0) in zinc blende, while 
those determined by the submatrix 
correspond to levels at k= (0,0,2t/c), which is the point 
A at the Brillouin zone edge in zinc blende. Only the 
wurtzite levels corresponding to those at k= (0,0,0) in 
zinc blende will be considered. 

To a good approximation, the functions 'F„ w+ are an 
orthonormal set. The reason is that the nearest like ion 
(second-nearest neighbor) configurations in wurtzite and 
zinc blende are almost identical, and consequently 
nearest-like-ion overlaps can be expected to give roughly 
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Fig. 2. Axes for zinc blende and wurtzite. The axes x, y , 2 are 
the ones used in this paper. These are the axes usually employed 
for the wurtzite structure. The axes x', yz* are the conventional 
axes for the zinc-blende structure. The open and blackened circles 
represent sulfur and zinc sites, respectively. 

equal contributions to {^ n ZB |^ ft ^ B )and (^ ft w+ |^„' w+ ). 
Assuming higher overlaps can be neglected, we have 
^ n w+|^r n ,w+)«^ n zB|^r n ,zB) = g ftn , Thus, by means 

of a linear combination of zinc-blende Wannier functions 
one can construct a nearly orthonormal set of basis 
functions for the wurtzite structure, suitable for a com¬ 
parison of the energy levels in the two structures at 

k= (0,0,0). 

Zinc Blende 

Let us consider in more detail the zinc-blende Bloch 
functions at the top of the valence band. If we omit the 
spin-orbit interaction term in the zinc-blende Hamil¬ 
tonian, the top of the valence band will be sixfold de¬ 
generate, with state vectors ( X ), | F), | Z)) ■ (| +), | —)) 
where \X), | F), J Z) are Bloch states transforming like 
x, y, z under the operations of the zinc-blende symmetry 
group Td. The zinc-blende Hamiltonian with the spin- 
orbit interaction term is diagonal in the manifold 
spanned by 

i>= |n)| + ), 

I 2)= (1/a/ 6)[Y2 n> —}—2 Z) +)], 

8 | 3)= (1/a/ 6)[Y2 n> +)+2 Z) ->], 

l 4)=|n)|-), (l) 

f 5)= (1/V3)[Y2 n) -)+ Z) +)], 

l7 t 6)=dM)[v2 n> +>- z> -)], 

|n)= (1/V2)(|x)+;| Y)), |n>=(l/v 5 )(|X>-i|F». 

The first group of states transforms as a basis for the 
irreducible representation r 8 of the double group of Td ; 
the second set is a basis for r 7 . Use of the finite basis, 
Eq. (1), to diagonalize the Hamiltonian is equivalent 
to treating the spin-orbit term by first-order perturba¬ 
tion theory. Thus, the error in the energies made by the 
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neglect of admixtures of wave functions from other 
bands is of the order 8 2 /E g , where 8 is the zinc-blende 
spin-orbit splitting and E g is the band gap. Since 
5*/.E ff «sO.O676/3.6, the fractional error made in the 
valence band splitting is small. The prediction that the 
zinc-blende valence band is split into a r 8 level and a 
r 7 level agrees with experiment. 1 


Wurtzite 

The basis functions in wurtzite are taken as sums of 
zinc-blende Wannier functions. Consider in particular 
those constructed from the zinc-blende valence band 
wave functions, using for these the approximate forms 
Eqs. (1). Since the threefold rotation operation and the 
reflection plane parallel to its axis are symmetry opera¬ 
tions in both zinc blende and wurtzite, the behavior of 
the sets of states 11), ■ ■ *, 16); 11 + ), • * ■, |6 + ) under 
these operations will be identical. It is thus possible to 
show, by using characters of the double group of C-z Vj 
that the states 11 + ), • * |6 + ) transform according to 

representations of the double group of Ca v as follows® 


a 4 (1/V2)C 
Ab (1/V2) 

(r 9 ) 


i + >+ 

i + >- 


4+>] 

4+>] 




This permits a simplification of the submatrix of the 
wurtzite Hamiltonian spanned by 11 + ), * * •, 16 + ), to 

1+ 4+ 2+ 5+ 3+ 6+ 


1 + 

4+ 

2+ 

5+ 

3+ 

6+ 


a 0 0 0 0 0 
0 a 0 0 0 0 
0 0 b c 0 0 
0 0 c* d 0 0 
0 0 0 0 6 c 
0 0 0 0 c* d 


The prediction that the wurtzite valence band consists 
of a r 9 level and two I\ levels agrees with experiment. 

Just as in zinc blende, the admixture of other wave 
functions forming the basis will be neglected. Let us 
write 


where 5, as before, is the zinc-blende spin-orbit splitting, 
and 5 ftn - is the Kronecker delta. The near degeneracy of 
the valence band in zinc blende has been made a com¬ 
plete degeneracy in Ho by subtraction of 55 ftft ' (£*= i 4 8 n k) ; 
this term has been included in the perturbation Hi. In 
this way we have defined a problem in degenerate 
perturbation theory. If the degenerate manifold is 
treated exactly, neglect of the admixture of other wave 
functions of the basis is equivalent to neglecting second- 
and higher order terms in a perturbation expansion, and 
results in an error ~\{\Hi\)\ 2 /B g . Here |{|fl r i|)| is 
the magnitude characteristic of matrix elements of the 
perturbation, which can be expected to be of the order 
of the valence band splittings in wurtzite «0.05 ev. 
Thus, the fractional error made in the valence band 
splitting «0.05/3.6, which is small. 


Two-Parameter Formulas 

Because Hi factors completely into 2X2 submatrices, 
exact solution of the eigenvalue problem within the 
degenerate manifold is easy. The results for the wurtzite 
and zinc-blende valence band energy splittings and the 
wurtzite valence band eigenstates are: 


AEw • 


E a —E b = 


a+5 


[(t) 


a+5^ 2 2a5"|* 

3 J ’ 


e e 47 a+a V 2a5 T 


AEzb: S=(l|fl r ZB |l)-(5|fl r ZB |5); 

a)=|l + >, 

8 a 


v2a 

N h \b) = —15 + )+ 
3 

Ne\c)= 15+) + 

3 


6 a r /a+5\ 2 2a5~|* 


5 a p /a+5\ 2 2a5"|* 

3 J 


5 a ~/a+5\ 

2 6 L\ 2 / 


2 + >. 


( 2 ) 


(3) 


it=*i 

(n\Hi\n') = (*™+\H w \* n .™+) 

~ (^ r n ZB |-S r ZB|^ r n' ZB ) + 55„^(XI 

ife=l 


9 The designation in parentheses refers to the representation of 
the double group of C&* according to which the functions approxi¬ 
mately transform. The basis functions for wurtzite would trans¬ 
form exactly according to irreducible representations of Ctv if we 
took the zinc-blende valence band Bloch functions to be linear 
combinations of ionic 3p orbitals. We have instead used the exact 
zinc-blende Bloch functions. Because the wurtzite Hamiltonian is 
diagonalized with respect to only a finite basis, and because the 
functions in the basis transform only approximately according to 
irreducible representations of Ce*, the wurtzite valence band wave 
functions obtained have only approximately the correct symmetry 
properties. Nevertheless, in the remainder of this paper die wurt¬ 
zite functions will be labeled by the indicated representations 
of Ct v . 


Here Nb and N c are normalization constants and a is 
a crystal field parameter defined by 

a =[<n+ 1 w I n+> - <n | h zb | n)] 

-1(Z+\Hw\Z+)-(Z\H zb \Z)3. 

In this expression Z7w and H ZB signify the Hamiltonians 
without the spin-orbit interaction terms. 10 

Equations (2) are the two-parameter formulas origi¬ 
nally derived by Hopfield. 6 They have been obtained by 
making only three approximations: 


16 It is easy to show that the fractional error resulting from 
neglect of these terms is of the order 

] <1+ ] {%/FwX P11 + ) ] ^ ^ 1 n „ 4 

|(l + |Fw|l + )| »V(f*> ' 

where (r 2 ) is the mean square sulfur ion radius. 
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(1) Assumption of approximate orthonormality of 
the wurtzite basis; 

(2) Neglect of energy terms ~$/E g \ 

(3) Neglect of energy terms « | ) 2 /E g . 

Note that the equations for the splittings in wurtzite 
are completely symmetric in a and 5. This means that 
a and 5 cannot be determined uniquely from the wurtzite 
splittings: if (a,5)= (a, b) is one solution consistent with 
the data, then (a,5) = (5, a) is another. The ambiguity 
just corresponds to the fact that solving (2) for a and 5 
in terms of E a —Eb , Eb—E e leads to a quadratic equa¬ 
tion, both roots of which are allowable solutions. This 
symmetry of the two-parameter formulas is not ex¬ 
plicitly evident in the version of them given by Balkan- 
ski and Cloizeux, 11 

Finally, it is to be emphasized that the anion and 
cation have been assumed to be S and Zn in the above 
derivation purely for the sake of convenience in referring 
to them. The theory should be valid in other semicon¬ 
ductors with p-]ike valence bands and with spin-orbit 
and crystal field splittings which are small relative to 
the band gap. 

3. APPLICATION TO ZnS, CdS, AND OTHER 
II-VI COMPOUNDS 

The two-parameter formulas describe three splittings 
in terms of two parameters, They fit well the values 
given by Birman et ah 1 for the spin-orbit splitting of the 
zinc-blende form and the two valence band splittings of 
the wurtzite form of ZnS, From the data (Table I) at 
77°K, 5=0,068 ev and E*-E h +%(E h -E 0 ) = \(a+S) 
=0.069 ev, giving a= 0.070 ev. The theory then gives 
0.080 ev and 0,029 ev for the wurtzite splittings, within 
10% of the experimental values of 0,084 ev and 0,027 ev. 
The order of levels predicted is also correct, 

A reasonable result is also obtained when Eqs, (2) 
are applied to data for CdS. Crystals of cubic CdS have 
not yet been grown. However, with data for hexagonal 
CdS, the formulas can be used to predict the value of 
the spin-orbit splitting which would be observed in 
cubic CdS. Two values are obtained as a result of the 


Table I. Valence band splittings in ZnS and CdS. 


Temperature 

(°K) 

5 (ev) E a —Eb (ev) 

Eb—Ec (ev) 

77 

Cubic ZnS 

0.068 


14 

0.065 


77 

Hexagonal ZnS 

0.027 

0.084 

14 

0.026 

0.082 

77 

Hexagonal CdS 

0.016 

0.062 


11 M. Balkanski and J. Cloizeux (to be published). 


ambiguity discussed above: 5 = 0.065 ev or 5 = 0.029 ev 
(at 77°K). The first of these is close to the ZnS splitting 
and is probably the correct solution, since the valence 
band spin-orbit splitting should be determined primarily 
by the wave function and potential near the sulfur ion 
and should depend only weakly on the nature of the 
cation. Measurements on CdSe 2 and on ZnSe, 4 which 
show nearly the same spin-orbit splitting for both sub¬ 
stances, are evidence for the validity of this type of 
argument. 

From the formulas, Eqs. (3), the parentage of the 
lines in wurtzite ZnS can be determined. Taking 
a=0.070 ev, one obtains 

|5)=0.48|5+)+0.88|2+), 

| c) = 0,8815 + > —0,4812 4_ >. 

These can also be written 

|5) = 0,90|n+)| -)-0,44|Z+)|+), 

|c) = 0,44|n + )| -)+0.90|Z+)|+), 

These expressions have a simple interpretation (see the 
splitting diagram, Fig. 3). The states 5) and 12) trans¬ 
form according to the irreducible representations T& and 
IT, respectively, of the double group of the wave vector 
at k= (0,0,0) in zinc blende. These states are zinc-blende 
valence band eigenstates which differ in energy by 5, the 
spin-orbit energy. Equations (3) indicate how these 
zinc-blende levels mix when the crystal field perturba¬ 
tion is “turned on,” The states |n+)|-)and |Z + }|+) 
transform, respectively, according to the irreducible 
representations r 5 and Ti of the single group C$ VJ the 


w 

so 


ZB 

SO 



W 

X SO 


W 

SO 



Fig. 3. Splitting diagram indicating the mixings and splittings 
of the valence band levels as the perturbations, the spin-orbit 
interaction, and the crystal field, Fw— Fzb, are turned on in 
opposite orders. As in the text, W means wurtzite and ZB means 
zinc blende; SO and XSO mean, respectively, with and without the 
spin-orbit interaction. At the right of the figure, the lowest r 7 
level in the W SO column is joined by a double line to the IT level 
and by a single line to the r 5 level. This signifies that the wave 
function of die lowest IT level is a mixture of functions which 
transform according to r 6 and IT of the single group C* v , and that 
the coefficient of the IT wave function is larger than the coefficient 
of the r 6 wave function. The other single and double lines have 
a similar significance. 
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group of the wave vector at k= (0,0,0) in a wurtzite 
structure in which there is no spin-orbit interaction. 
According to Eqs. (3), when the spin-orbit interaction 
in wurtzite is “turned off” by setting 5 = 0, the state | b) 
becomes |n + )|—) (transforming according to r 5 ), the 
state \c) becomes |Z + )| +) (transforming according to 
ri), and the r* state has the higher energy. The predic¬ 
tion that the r* level would exceed the Ti level in energy 
in wurtzite ZnS if the spin-orbit interaction could be 
eliminated agrees with the observations showing that 
in hexagonal ZnO, in which the spin-orbit interaction is 
extremely weak due to the low anion atomic number, 
the r6-like levels lie above a IYlike level, 12 

4. ESTIMATION OF THE EFFECTIVE CHARGE 

An attempt was made to make an a priori calculation 
of a, and by this means to estimate an effective charge 
for ZnS. The zinc-blende Bloch states |X), | F), \Z) 
were assumed to beLCAO states constructed from sulfur 
3p ionic orbitals. Only zeroth and first-neighbor inter¬ 
action integrals were retained, and a point-ion model 
of the zinc-blende and wurtzite lattices, with effective 
ionic charges +A^ and — \e for zinc and sulfur ions, re¬ 
spectively, was used. This model gives a= —(3E/5) 
X fo°dr ^(Rap) 2 , where E is the coefficient of r 2 .P 2 (cos0) 
in the expansion of Vw about a sulfur ion site and R Sp 
is an ionic radial orbital. Evaluation of E by an Ewald 
summation method 13 gives X=2.3, clearly too large. 
Taking into account the effect of mixings of sulfur 3d 
states into the zinc-blende valence band wave function 
and the deviation of wurtzite ZnS from ideality was 
found to produce little change in the value of X obtained. 
(The expansions of Vw and Fzb used are given in the 
Appendix.) Mixings of zinc 4 p states into the zinc-blende 
wave function may have an important effect on X, but 
the overlap integrals necessary to estimate this were 
not calculated. 

12 However, in ZnO, unlike ZnS, the order oi levels is r 7 , r 9 , r 7 ; 
this is discussed by Hopfield® and by Thomas. 3 

13 B. R. A. Nijboer and F. W. De Wette, Physica 23, 309 (1957). 
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APPENDIX 

We give here the first few terms of the expansion in 
solid harmonics of the potentials in ideal wurtzite and 
zinc-blende point-ion lattices. The nearest like-ion 
distance is dss and the effective charge parameter is X. 
The origin for the expansion is a sulfur site such as is at 
the orgin in Fig. 2; in other words, a sulfur site with 
nearest neighbors with direction cosines (0,0,1); 
(0, -2VZ/3, -1/3); (vZ/v3, vZ/3, -1/3); (-VZ/^, 
vZ/3, —1/3) on the unprimed axes. All sums were 
evaluated by an Ewald method and were checked to the 
number of places indicated by summing with two differ¬ 
ent values of the convergence parameter. 

Zinc blende (conventional axes, i.e., primed axes in 

Fig. 2): 

Vzb x'y'z' 

- -—Aq-^-Bq -b • • * > 

e 2 \/dss dss 3 

Bo= 76.8. 

Wurtzite (unprimed axes in Fig. 2): 

V w z \{2 

-Co+Do-bEo- 

e*\/dss dss ^ss 2 

3(z z +y 2 )zJ (3^y—y 3 ) 

+Eo-bGo-+ ■ ■ ■, 

dss z dss* 

—0.0397, Fo= 14.43, 

Eo— 0.142, Go= 10.1. 

Note that E= (—e 2 X/dss 3 )Eo- 
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Quantum Theory of the Dielectric Constant in Real Solids 

Stephen L. Adler* 

General Electric Research Laboratory , Schenectady , New York 
(Received October 12, 1961) 

The quantum theory of the frequency- and wave-number-dependent dielectric constant in solids is ex¬ 
tended in order to study the full dielectric constant tensor and to include local-field effects. Within the 
framework of the band theory, an explicit expression for the dielectric constant tensor, neglecting local- 
field effects, is derived. In addition to components which are the ordinary longitudinal and transverse di¬ 
electric constants, there are components which couple transverse and longitudinal electromagnetic dis¬ 
turbances. A formalism for calculating the local-field corrections to the dielectric constant is developed in 
detail for the case of the longitudinal dielectric constant of a cubic insulating solid. In the coarsest (dipole) 
approximation, the theory gives a Lorenz-Lorentz formula modified by self-polarization corrections arising 
from the polarization of the charge in a unit cell by its own field. 


Q UANTUM mechanical treatments of the fre- 
quency-and wave-number-dependent dielectric 
constant in solids have been given by Nozi&res and 
Pines 1 and by Ehrenreich and Cohen. 2 These authors 
give explicit expressions for certain components of the 
dielectric constant tensor, valid within the framework 
of the random phase approximation (RPA). Expres¬ 
sions are not given for the remaining components of 
the dielectric constant tensor and local field effects are 
neglected. This paper will generalize the treatment of 
Ehrenreich and Cohen 2 so as to include additional 
effects of interest in real solids. In Sec. I an expression 
for the full frequency- and wave-number-dependent 
dielectric constant tensor in a solid of arbitrary sym¬ 
metry will be derived, still neglecting local field effects. 
The additional components obtained correspond to a 
coupling between longitudinal and transverse electro¬ 
magnetic disturbances. This coupling, which does not 
appear in an isotropic free electron gas, is present in 
solids of even cubic symmetry and vanishes only for 
propagation along special directions of high symmetry. 
In Sec. II, local field effects in insulators of cubic sym¬ 
metry will be discussed. An integral equation will be 
set up which determines the longitudinal dielectric 
constant with local field corrections in the case of wave¬ 
lengths large compared to the lattice constant but small 
compared to the over-all crystal dimensions. The in¬ 
tegral equation will be rewritten by making a multipole 
expansion of the potential in a given cell arising from 
the charge density in all other cells. The solution, when 
only dipole terms are retained, is the modified Lorenz- 
Lorentz formula 

4x(a—Ci) 

e—1=-, 

1-(4tt/ 3)(a—CO 

where a is the polarizability of the solid calculated 
without making local field corrections, and Ci is a re- 


* Present address: Department of Physics, Princeton Uni¬ 
versity, Princeton, New Jersey. 

1 P. Nozi&res and D. Pines, Phys. Rev. 109, 741, 762, 1062 
(1958); ibid. Ill, 442 (1958); ibid. 113, 1254 (1959). 
i H. Ehrenreich and M. H. Cohen, Phys. Rev. 115, 786 (1959). 


duction in a due to polarization of the charge in a given 
cell by its own field. 

The calculations of Secs. I and II are performed 
within the framework of the one-electron (energy band) 
approximation and use a linearized Liouville equation 
to determine the single-particle density matrix. Since 
in this context linearization is equivalent to the RPA, 2 
the results obtained in Secs. I and II are still valid only 
within the framework of the RPA. 

I. DIELECTRIC CONSTANT TENSOR 

We will first introduce a phenomenological dielectric 
constant tensor. Let A(r,/) and <f>(r,t) be the potentials 
describing fields acting on a system of charged par¬ 
ticles. In response to the fields, charge and current 
densities p ind (r,/) and j ind (r,/), which satisfy the equa¬ 
tion of continuity v-j ind +dp iml /tM=0, will be induced 
in the system. Let us immediately introduce Fourier 
transforms A(q^o), <£(q,co), j lnd (q,co), etc., by 

A(r,/)= Jdqdco A(q,co) expi(q-r— cot), (1.1) 

and similar equations. 

In their treatments of the dielectric constant, Nozieres 
and Pines 1 and Ehrenreich and Cohen, 2 following a 
practice originated by Lindhard, 3 define longitudinal 
and transverse dielectric constants z L and z T by the 
equations 

—iw[V L - r) (q^o)—1J- E (L ' r) (q,co) 

= 4 ^jmdCLT)( qjC0 ). (1.2) 

The two constants describe respectively the longi¬ 
tudinal current induced by a purely-longitudinal electric 
field and the transverse current induced by a purely- 
transverse electric field. In the case of a free-electron 
gas a longitudinal (transverse) current cannot be in¬ 
duced by a transverse (longitudinal) electric field. 
Consequently z L and zF give a complete description of 
the linear dielectric properties. In solids, in general a 

8 J. Lindhard, Kgl. Danske Videnskab. Selskab, Mat.-fys. 
Medd. 28, 8 (1954). 


413 


Copyright© 1962 by the American Physical Society. Reprinted with permission. 




R3 


129 


414 


STEPHEN L. ADLER 


purely-transverse or a purely-longitudinal electric field 
induces both transverse and longitudinal currents. In 
this case the linear dielectric properties are fully de¬ 
scribed by a dielectric-constant tensor defined by 

—iw[£(q,co) — l]-E(q^o)=47rj ind (q,co). (1.3) 


The longitudinal and transverse constants t L ( q,co) and 
£ r (q,co) can be simply related to e(q,co). Let £ be a unit 
vector parallel to the direction of propagation q, and 
define 


l r =l-44 


(1.4) 


where 1 is the unit dyadic. Then letting E(q^o) be 
purely longitudinal or purely transverse gives 

£ L (q J co)=lL*£(q J co).lL J ^ ^ 

£ r (q,co)=lr 4 £(q,co)* lr. 

The remaining components of the dielectric-constant 
tensor are 1 l * £(q,w) • 1 T and l r - £(q,co) ■ 1 l, which vanish 
for a free-electron gas but do not in general vanish for 
a solid. They describe, respectively, the longitudinal 
ttransverse) current induced by a transverse (longi- 
(udinal) electric field. 

An explicit expression for £(q,co) will be calculated in 
the energy-band approximation. Consider the single- 
partide Liouville equation 


ihdp/dt= p7,pj, (1.6) 


where p is the single-particle density matrix and 


H= (l/2m)[p— (e/ c) A(r,2)IP+e<£(r,0+ U (r). (1.7) 


Here U (r) is the periodic lattice potential. Let the state 
functions for the unperturbed lattice be |kZ)=F _ Wi 
Xexp(ik-r) with u^i cell-periodic and V the volume of 
the crystal. They satisfy the Schrodinger equation 
[p 2 /2w+Z7(r)[]| kZ)=E kI | kZ), in which k is the wave 
vector and Z the band index. Linearize the Liouville 


equation with respect to A and <f> by setting p=p (0) +p <1) 
etc. The unperturbed density matrix, p (0) , satisfies 
p co) | kZ)=/ 0 (E w ) | kZ) with /o(.Ew) the Fermi-Dirac dis¬ 
tribution function. The perturbation p (1) is linear in A 
and <f>. Dropping quadratic terms gives 

^(Z'k+qlp^lZkVdZ 

= (E rk+q -E Ik )(Z'k+q|p^|Zk) 

+ C/o (El k) —/o (Ev k+q) 3 

X(Z'k+q| — (e/2wc)(A*p+P 4 A)+«0|Zk). (1.8) 

Let us assume 

A(r,0 = A(q,co) expi(q- r-oA :), 
<Kr,0=<Kq,“) expt(q-r—orf), 

and make the Ansatz that the time dependence of 
(Z'k+q|p (i) |Zk) is exp(— izot). The frequency co is taken 
to have a small positive imaginary part, corresponding 
to an adiabatic turning on of the perturbing potentials. 
It is an easy calculation to show that 

(Z'k+ q' | e<f> | Zk)=5 q ' , q e0 (q,co) (Z'k+ q | Zk) (1.10) 

and 

K^k+ q' | A* p+ p- A | Zk) 

= $ q '. q (Z'k+q| p«+Ak+Aq/21Zk) - A(q,o>). (1.11) 

The abbreviation 

(Z'k+q|/(r a ,p a )|Zk) 


s (l /®«)J «i'k+ q *(r)/(r, —**Vr)«Jk(r)rfr, (1.12) 

has been introduced, in which the integration extends 
over a unit cell. Couplings of the wave vector q to wave 
vectors q+K, where K is a reciprocal lattice vector, 
have been neglected. These so-called Umklapp processes 
give rise to the local field corrections and will be dis¬ 
cussed in Sec. II. The solution of Eq. (1.8) is immedi¬ 
ately obtained in the form 


(Z'k+q|p (1) | Zk)= 


[/o(Eik)—/o(Erk+q)X(^k+q|Zk><^(q,co)— (Z'k+q| p e +^k+^q/2|/k)- (e/W)A(q,o>)] 

fUO + El^ — -El'k+q 


(1.13) 


The induced current and charge density may be 
calculated from 

j lnd (r,Z)=Trp a) j op C0 )(r)-|-Trp (0 )j op (1 )(r,Z), 
p lnd (r,Z)=Trp a )p op ( °) (r), 

where 

jo P C0) (r)= (V2w)[(p e /w)5(r— r e )+5(r— r e )(p 6 /w)], 
jop a) (r)=-(e/wr)A(r,Z)5(r-r,), (1.15) 

Po P C0) (r)=e5(r— r 6 ), 

and r e and p e are, respectively, the position and mo¬ 


mentum operators. This gives 

j ind (q,co)= -«*A(q fi>)N/mcV 

+5Zii , k(^k| p«+^k-|-^q/2 |Z'k+q) 

X(Z'k+q|p a) |Zk), (1.16) 

p lnd (q,co)= (e/V) X> k (Zk|Z'k+q) 

X{Z'k+q|p (i ) |Zk), (1.17) 

with N the number of cells in the crystal. 

Since j lnd (q,co) and p ind (q,co) are obtained by linear¬ 
ization of a gauge-invariant theory with respect to the 
potentials, they are invariant under infinitesimal gauge 
transformations, and thus also under arbitrary gauge 
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transformations. This is verified explicitly, in the case gauge in which <f> = 0 without loss of generality. Note 
of the expression for j^fajto), in the Appendix. Since that A will not in general be transverse in this gauge, 
the theory is gauge invariant, we may transform to a From Eqs. (1.13) and (1.16) we get 


} hid (q t o))= —e 2 X (q t u>)N/mcV 

e 2 (2k| pe+^k+^q/2 |/'k+q)[yo(Eik) — /o(-Erk+ q )]](/ / k _ bq| p e +^k+^q/2|/k)* A(q,co) 


m 2 cV irL 


#a>+.Eik~---Ei'k+q 


(1.18) 


Comparing with Eq. (1.3) and noting that E(q,co)=£a>A(q,co)/c gives 
e(q,<o)= (1-WAT/wF^l 

+ 47r ^ 2 £ W p e +^k+^q/2|/'k+q)[/o(Erk+ q )—/o(Eik)](2'k+q| p e +£k+£q/2|2k) 
mWoP- iv k &o+Ei k —-E^k+q 


(1.19) 


an explicit expression for the frequency- and wave- 
number-dependent dielectric constant tensor. 

It is interesting to compute £ L (q,co) — 
directly from (1.19). This can be done by using the 
three identities 

q* (2k| p a +/&k+/iq/2|2'k+ q) 

= (m/h)(Ey k+q -E^)(lk\l f k+q) t (1.20) 

0= — N+ (1/hq 2 ) Eii kq* (2k| p„+/&k+/zq/2 |Z'k+ q) 

XQ/o(£ik)—/o(^k+ q )](/'k+q|/k), (1.21) 


and 


0—E«'k| (Zk|2'k+q) | 2 []/o(-Eik) — /o(-Erk+q)l (1.22) 

Equations (1.20) and (1.21) are proved in the Ap¬ 
pendix, while Eq. (1.22) is obtained by making the 
change of variable k—k— q and noting that 
E k =E_k, Ek+K=Ek and «ik+K=«ik, where 

K is a vector of the reciprocal lattice. Applying the 
identities to (1.19) gives 


471-e 2 

lL*e(q,w)*li=li 1— ^K^N/mVoP-\ -E q*(2k|p a +^k+^q/2|Z'k+q) 

hcPmVa? iv* 

/ fuo 


A 

XE/o^'k+q)—/o(E Ik )[](2'k+q|2k)^ 


— 1 


\#a>+Eik — E^k+q / J 

4xre 2 | (2k|/'k+q)| 2 [/o(E^k +q )—/o(Eik)[] 

= 1l i+— E - • (i* 23 ) 

q 2 V iv k ^a>+Eik—Ej'k+q 


Equation (1.23) is just the longitudinal dielectric constant derived by Ehrenreich and Cohen. 2 Using the same 
identities employed to derive (1.23), it is easy to show that lr-e(q,w)-1 l and lx,*e(q,<iO*lr are given by 


employed to derive (1.23), it is easy to show that lr* e(q,w) 4 1 l and li,*e(q,<iO*lr are gi 

4TC 2 lr* (2k| p«+^k|2 / k+q)(/'k+q|2k)^/ 0 (E^k+q)'"/o(Eik)]] 

lr*£(q,w)-lL=-E-—-, 

a>qmV iv k ^co+Eik—Ei^k+q 

47re 2 g(2k|2 / k+q)(/'k+q| p a +^k|2k).lr[/o(Ei/k+ q )—/o(Eik)] 

U*£(q J w)*lr=—— E-. 

uqtnV iv k ^co+Ejk—E^k+q 


(1.24) 


o>qmV iv k ^w+Eik—Erk+q 

When the limit q —» 0 is taken, the dielectric constant tensor becomes 

4tt e 2 N 4«* (2k| p a |2'k)(Z'k| p a |2k)[/o(E^ k )-/o(E Ik )] 

e(0,co) = l-1-XI- 

mV co 2 mWu? M'k ^o>+Ejk—Ej'k 


(1.25) 


In a crystal of cubic symmetry, the sum over the 
star of k, Ek*(2k| p„|2'k)(2'k| p«|2k), is a multiple of 
the unit dyadic, and consequently £(0,to) is isotropic. 
Thus, with the approximations made to get Eq. (1.19), 
as q—>0 the longitudinal and transverse dielectric 
constants become equal and lr*s(q> w )'li. and h 
• £(q,co) • 1 T vanish. This is true for an arbitrary direction 


of propagation in a cubic material, but will not in gen¬ 
eral hold in the case of crystals of lower symmetry. 

II. LOCAL-FIELD CORRECTIONS 

In this section we will develop the theory of the 
longitudinal dielectric constant with local-field correc¬ 
tions, for a cubic insulating solid, in the case of wave- 
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lengths large relative to the lattice constant but small 
relative to the over-all crystal dimensions. Local-held 
effects arise in a real solid because the microscopic 
electric field varies rapidly over the unit cell. Conse¬ 
quently, the macroscopic field, which is the average of 
the microscopic field over a region large compared with 
the lattice constant but small compared with the wave¬ 
length 2n r/ q t is not in general the same as the effective 
or local field which polarizes the charge in the crystal. 
For example, suppose a slowly varying external potentia^ 

0 ext =< £ext(q jCO ) e xpi(q- r—o)t) ( 2 . 1 ) 

is applied to the crystal. The total potential <f>=<f> e * t 
+<£ ind will in general contain rapidly varying terms 
with wave vector q+K, where K is a vector of the 
reciprocal lattice: 

<£ = ZK<Hq,K,co) expi[(q+K)-r—to/]. (2.2) 

The potential <J> is the microscopic potential and deter¬ 
mines how the charge in the crystal is polarized. The 
macroscopic potential (<f>) a -v is clearly given by 

(<£)aY=<Hq,0,co) expi(q-r—a>0, (2.3) 

since exp(-tq- r) is nearly constant over the averaging 
region while exp[i(q+ K) • r], (K^O), is very rapidly 


varying. The derivation in Sec. I assumed a potential 
of form Eq. (2.1) instead of Eq. (2.2). In other words, 
the distinction between the microscopic and macro¬ 
scopic fields and potentials was neglected, with the 
result that no local-field corrections were obtained. In 
order to obtain the longitudinal dielectric constant with 
local-field corrections, a total potential of the form Eq. 
( 2 . 2 ) must be assumed (with A= 0 ), and the induced 
potential, 

^ ind =ZK^ ind (q J K,co) expi[(q+K)-r—to/], (2.4) 

must be calculated. The longitudinal dielectric constant 
is obtained from the macroscopic total and induced 
potentials 4 according to an alternative form of Eq. (1.2), 

5* * L ( q,w) •<?= 1 — (<^ illd )av(q,co)/(^)av(q,co). (2.5) 

Using Eq. (2.3), this is 

g-£ L (q/o)-$=l— <£ ind (q,0,to)/<£(q,0,to). (2.6) 

The right-hand side of Eq. (2.6) is easily evaluated 
in a formal manner. A calculation analogous to that of 
Sec. I gives the relation between <f> ind and <f> as 

<^> ind (q,K,w)= Iq+KI^X^K'G^q+K, q+K', to) 

X^(q,K',to), (2.7) 

with 


IU( . 4^ (/k|exp(-iK-r„)|/ , k+q)(/ , k+q|exp(iK , -r e )|/k)[/ 0 (£ik)—/o(£i'k+,)] N 

G(q+K, q+K',co) = — £ - : ——----. (2.8) 


V H'k 


fto> + £j k — Ej'k+q 


As before, the variable of integration in the matrix 
element has been indicated by r e . Let us define 
e(q+K, q+K', to) and *“*(q+K, q+K', to) by 

e(q+K, q+K', to) 

= 5k,K'— 6 (q+K, q+K', to) | q+K|“ 2 , (2.9) 
(q+K, q+K", to) 

X e“ 1 (q+K", q+K', to) = $K,K'. (2.10) 

[The quantity e~ J (q+K, q+K', to) is just the dielectric 
response function of Schwinger and Martin . 6 Equations 

(2.9) and (2.10) have been given by Falk , 6 who treats 
the nearly free electron case.] 

Rewrite Eq. (2.7) as 

I q+ K1 2 [<£ (q, K,to)— <f> ind (q, K,to)] 

= 'Hk' e(q+K, q+K', to)|q+K'| 2 <£(q,K',to), ( 2 . 11 ) 

and note that <£(q,K,to)—<£ ind (q,K,to)=<£ ext (q,K,to) 
=<£ ext (q,to)5K.o, since the external potential (the poten¬ 
tial due to charges located outside the crystal) is essen¬ 
tially constant over a unit cell of the crystal. Using Eq. 

( 2 . 10 ) we find 

*(q,K,to)= |q+K| _ 2 r' 1 (q+K, q, to^^^to), ( 2 . 12 ) 

4 L. Rosenfeld, Theory of Electrons (North-Holland Publishing 
Company, Amsterdam, 1951), Chap. 2. 

5 P. C. Martin and J. Schwinger, Phys. Rev. 115, 1342 (1959). 
8 D. S. Falk, Phys. Rev. 118, 105 (1960). 


giving 7 

Q ' e(q,w)-g= 1 / e _i (q,q,to). (2.13) 

Thus, the problem of finding the dielectric constant 
with local field corrections reduces to that of solving 
the integral equation 

e-^q+K, q+K'.co) 

= Sk,k'+£k" G(q+K, q+K", oj)| q+K"| -2 

Xe-Uq+K", q+K'.co), (2.14) 

obtained by combining Eqs. (2.9) and (2.10). 

The main purpose of this section is to develop a 
systematic method of approximating the integral equa¬ 
tion (2.14). This will be accomplished by means of two 
successive transformations. First, the integral equation 
will be transformed from the K representation to an r 
representation, where r is a continuous variable con¬ 
fined to a unit cell of the real lattice centered about the 
origin. In this representation, the kernel of the integral 
equation will be split into two parts, K L and K s . These 
describe the influence on a given cell of the field of the 
polarized charge in all other cells ( K L ) and of the field 
of the polarized charge in the same cell ( K s ) } and are 
connected, respectively, with the local field and self- 
polarization corrections. A second transformation will 

7 This equation has been independently obtained by N. Wiser. 
I am indebted to M. H. Cohen and N. Wiser for communicating 
their results prior to publication. 
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then be made by expanding K L in a multipole series, 
leading to an integral equation in what might be termed 
a multipole representation. This equation can be solved 
approximately by neglecting all but the first P multi¬ 
pole moments. The case when only dipole moments are 
retained will be worked out explicitly, and leads to a 
Lorenz-Lorentz formula modified by self-polarization 
corrections. 

In the equations that follow, the to dependence of e -1 
and G will no longer be indicated explicitly. To trans¬ 
form to the r representation let us define 

e-i(q,r,r')= E ^^'T'^Cq+K, q+K'), (2.15) 

K,K' 

G(q,r,r') = £ e**-'<r**'-'’G(q+K,q+K'). (2.16) 

K,K' 


The integral equation becomes 

e ~ 1 (q> r > r 0 

=v a D 5(r—r'—R^)H— f dndti 

I VaJo 

p 0 expiq • (Rj-+ r/ — rj 

XG(q,r,ri) — E — 


1 


L47r i IRj+r/ —r i( 

Xe-Hq^i'/), 


(2.17) 


where R/ are the vectors of the real lattice and the 
continuous variable r is confined to a unit cell centered 
at R/=0. The inverse of the dielectric constant is ob¬ 
tained from 


e _1 (q.q) = — [dtdt' e-‘(q,r,r'). (2.18) 

VaJ 0 


The kernel of the integral equation, 


K( q,r,r/)=— [dh G( q.f,^)-^- 
V a J 0 47T 


expiq- (R/+r/— n) 

XE-, (2.19) 

7 IRrfn'-nl 


can be divided into two parts, 

R:(q,r,r 1 0 = ^(q,r,r 1 ')+2i:«(q,r,r 1 0. (2.20) 

Since several kernels similar in structure will be intro¬ 
duced in the course of the derivation, we specify them 
all through the functional form 


W 

Jl q,/,g]=— E 

V H'k 


(Zk|/|Z'k+q)(Z'k+q|g| Zk) C/o (E lk ) -/„(£,.*+,)] 


Ei^—El' k+q 


( 2 . 21 ) 


(No terms with l=l f appear in the summation in 
the case of an insulator because all bands are either 
empty or full.) The kernels G(q,r,ri'), Af L (q,r,ri'), 
and i^ s (q,r,r/) are obtained from this form by the 
substitutions 

G.K^: f= V a E/5(r—r c —Rj), 

G: g=»aEiS(r/-r e -Rj), 

K L : g= (v a /4ir) exppq• (r/—r„)] 

XE/ exp(iq-Rj)/| r e —r/—Rj|, (2.22) 
K s : g= (va/iw) exppq- (r/-r.)]/1 x— r/1. 

The prime on the sum defining g in K L means that the 
term with Rj=0 is to be omitted. 

Since r/ and r e are restricted to lie within a unit cell 


centered about the origin, | r a |/| r/— Rj-| <1 for all 
R/t^O and the multipole expansion 



expfq*R* » 
r e —r/— R/| j>=o 


(2.23) 


is valid. Equation (2.23) serves as definition of the ex¬ 
pansion coefficient T p . Substituting Eq. (2.17) into 
Eq. (2.18), splitting the kernel according to Eq. (2.20), 
making the multipole expansion of Eq. (2.23) and letting 
q —» 0 results in 

e~ 1 (0,0) = 1 —i E (2.24) 

p=i 


The quantity B P L is defined by 


B„ i =lim — [dr'dr^ eitp(iq-r 1 ')T p (q,r 1 , )t- 1 (q,r 1 ',r') , 
’-^UaVo J 


(2.25) 


and Bi s is the p = 1 case of 
B/=lim|"— [ di'dr 1 'K 1I a (r 1 ')e- 1 (q,r 1 ',i , )l. (2.26) 

q -*°L Va J 0 J 

The kernels Ki P L and K p s (r/) are obtained from Eq. 


(2.21) by substituting 
Ki p L : f=r.,g={r e y 

K v s ( r/): /= (*e) p exp(iq-r a ), (2.27) 

g= (va/'b-) exppq - (r/- r.)]/1 r/- r.|. 
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In deriving Eq. (2.24), the relation 

q- (Z'k+q | r a exppq- r e ) |Zk) 

= -i(l'k+q\lk)+0(<f), (2.28) 

valid when l^V, has been used. 

The kernels K iP L and K p s (r/) are known quantities. 
In order to complete the set of equations, expressions 
for B P L and Bp 5 must be derived. Multiplying Eq. 
(2.17) by q exp(iq-r/)T p (q ) r 1 / ) and integrating gives 

B p L =^5 Pl i+E(-:lW j[limT it+ p(q,0)] 

fc=i \ k ) q "*° 


It should be noted that terms involving Bo L have 
been omitted from Eqs. (2.24), (2.29) and (2.35) be¬ 
cause they vanish in the limit q —» 0. For example, the 
coefficient of Bo L in Eq. (2.24) contains a factor 
(Z'k+q| exp(—Zq-r 6 )|Zk). Since 

(Z'k+ q | exp (—iq • r 6 ) | Zk) = (Z'k+ q | Zk) 

—iq- (Z'k+q| r 6 |Zk)+0(5 2 ) = 0(^ 2 ) 
and 

q r v a exppq-Rj) 

B 0 L =— / dr E' —- 

v a J o 47r 3 |r+R,| 


■mEV-B/), (2.29) 

71=1 

where the kernel Kk n L is obtained from Eq. (2.21), by 
the substitution 

Kfc« L : /= (r„) fc , g= (r«) n . (2.30) 

In the first term on the right-hand side of Eq. (2.29) 
the evaluation 

q r =i$, p=l, 

lim—/ dr e iq ' r T p (q,r) (2.31) 

*-° v a J o =0, £>1, 

has been used. 8 Finally, an equation for the B p s must 
be derived. Let us write 

Z»(q,r,r / )=5(r-r')-(l/7J a )^ s (q,r,r / ), (2.32) 

and define ZH^r^i) by 


(l/v a )J dti Z7- 1 (q,r,r 1 )D(q,r 1 ,r / )-5(r-r / )- (2.33) 

Then we can write 
e- 1 (q,r,r') 

= -D~ 1 (q,r ! r , ) + (l/»a 2 ) f <2r 1 <fr 1 '.D- 1 (q,r,r 1 ) 

JO 

X^T L (q,r 1 ,ri')e~ 1 (q,r 1 ',r'). (2.34) 

Multiplying Eq. (2.34) by — iqK P s (r)/v a 2 j integrating 
and making a multipole expansion gives 



X K p s (r)D -1 (0,r,ri)K n I '(r 1 ) • B„ L , (2.35) 


q r exp(iK-r) 1 

/ dr E-= 

VaJ 0 K | K+q| 2 q 

the term drops out as q —> 0. Similarly, the terms in 
Eq. (2.29) and Eq. (2.35) involving Bo L do not 
contribute. 

The preceding manipulations have replaced the inte¬ 
gral equation (2.14) for e -1 (q+K, q+K') by Eqs. (2.29) 
and (2.35), which together constitute an integral equa¬ 
tion for B P L (p= 1,* • ■, o°), and the integral Eq. (2.33) 
for ZH. Once the quantities B P L and Bi s are known, 
the dielectric constant with local-field corrections can 
be calculated from Eq. (2.24). The point of this formal 
rearrangement is that it is now possible to make an 
approximation with a clear physical significance which 
makes Eqs. (2.29) and (2.35) easily soluble. This is 
simply to neglect all the B P L with p greater than some 
integer P. This means roughly that we are approxi¬ 
mating the influence on a given cell of the charge in any 
other cell by the first P multipole moments of this 
charge. In many cases, we expect very good results to 
be obtained for a small value of P. The most familiar 
case is that of P=1 (dipole approximation). Utilizing 
the fact that the only second-order tensor compatible 
with cubic symmetry is the isotropic tensor, and noting 
that the inhomogeneous term in the equation for Bi+ 
is a vector parallel to q, Eqs. (2.24), (2.29), and (2.35) 
become 


e 1 (0,0) = l-iBi L ^g-K 11 L ^+Bi s ^, 

Bi L -2=t— 2[lim2-T 2 (q,())•£] 

q-H) 

X (ZBi s -<2+<2‘ Ki+’gB+’g), 

Bi s '$= — (i/v a 2 )$' fdrdr i 


(2.37) 


where K n L (ri) is defined by 
K n L (r 1 ): /=tf a Ej 5(n-r 6 -Rj), g= ( r € )\ (2.36) 


X K^(r)I)- 1 (O.rjOtfi^rO ^B+^ 

— i47rCiBi I '*<2. 


8 This is easily obtained by using It is easy to show that 

v a v exp (Iq • R;) _ ^ expp(q+K)- (r«—r)] 

4tt |r,-r-R,-| k |q+K| 2 ’ <?• Kn L -<2= — 47ra, 


(2.38) 
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where a is the polarizability calculated without making 
local-fidd corrections. [The second term on the right 
of Eq. (1.21) is just 47ra.] The dipole sum g- T2(q,0)-<2 
is not absolutely convergent. However, if we evaluate 
it in a crystal of finite diameter L, letting L —» oo and 
q —» 0 while keeping it has the value — inde¬ 

pendent of the crystal shape. 9 This evaluation procedure 
is the one that makes sense physically for wavelengths 
in the infrared, visible and near ultraviolet. For such 
wavelengths and for a typical crystal of dimension L 
and lattice constant a , the inequality qL2> 1 holds. 
However, since qa4 Cl and since the matrix elements 
and energies appearing in the kernels vary appreciably 
only when q changes by an amount of order 1/a , we 
can still take the limit q —» 0 in the kernels. 

Using these evaluations, Eqs. (2.37) may be readily 
solved to yield 

1 47r(a—Ci) 

_ = 1_|_(2 39) 

e-^O) l-(4r/3)(a-Ci)’ 

This is the usual Lorenz-Lorentz formula, modified by 
the subtraction from a of 

Ci= (l/47Tfl a 2 )<7* f drdii 

Jo 

XK^(r)(r|[l-(l/ Va )A: s ]- 1 |r 1 )K^(r 1 )-g. (2.40) 

This self-polarization correction takes just the form that 
is expected on the basis of a simple classical model. If 
we compute the dielectric constant of a macroscopic 
cubic lattice of uniform spheres composed of material 
of polarization per unit volume a, we find that the di¬ 
electric constant is determined by a Lorenz-Lorentz 
formula, except that a is replaced by 

47ra 2 /3 

a f = a -. (2.41) 

l+47ra/3 

The subtracted term is a self-polarization correction 
arising from the influence on a given sphere of its sur¬ 
face charge. To examine the qualitative form of Ci, 
let us replace all the kernels K appearing in Eq. (2.40) 
by —47ra [cf. Eq. (2.38)]. Then we see that the cor¬ 
rection Ci also has the form Ac?/ (1 +Ba), with A ) B> 0. 

With the formalism developed, higher-order correc¬ 
tions to the Lorenz-Lorentz formula can be obtained by 


taking the cutoff integer P larger than one. Note that 
T p (0,0) is absolutely convergent for p> 2, so no addi¬ 
tional ambiguities regarding the method of summation 
appear when working to higher order. Although the 
calculation has been carried out for crystals of cubic 
symmetry in order to avoid a tensor dielectric constant, 
its main features would be expected to carry over to 
the case of arbitrary symmetry. If the restriction to 
insulators is dropped, intraband terms ( l=V ) appear in 
Eq. (2.21). If these are treated in a free-electron ap¬ 
proximation, which should be reasonable when motion 
of the conduction electrons and holes is well described 
by an effective mass, the generalization of the above 
derivation is straightforward and leads to 

1 47r(a 11 —Ci) 

-=l+47ra I H--. (2.42) 

e * (0,0) 1 — (47r/3)(a 11 —Ci) 

In Eq. (2.42) a 1 and a 11 are, respectively, the intraband 
(/=/') and interband parts of a, the polariza¬ 

bility without local field corrections. The restriction 
qLs> 1, necessary to evaluate the dipole wave sum 
T 2 (q,0), cannot be relaxed without drastically altering 
the derivation. In order to deal with wavelengths 
comparable with the macroscopic dimensions of the 
crystal it would be necessary to take into account sur¬ 
face effects, which of course has not been done in the 
above derivation. 
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APPENDIX 

In order to prove that the expression for j ind (q,o>) is 
gauge invariant, we need the auxiliary identities: 

XiMik(r)«ik*(0 = ®aX/8(r-r'-Ry). ( A1 ) 

q- (Zk|^,+ftk+ftq/2|Z'k+q) 

= (m/ft)(£r k+q -£ Ik )(Zk|Z'k+q). (A2) 

Expression (Al) is just the completeness relation for the 
periodic parts of the Bloch functions. The identity 
(A2) is obtained in a straightforward manner by writ¬ 
ing k-p Schrodinger equations for wi k * and uv k+ q , 
multiplying the former by uv k+ q , the latter by Wit*, and 
subtracting. We may also regard (A2) as the result of 
expanding 


Jdr exp(-iq- r)[v • <Jk | j op (0) (r) | Z'k+ q)+d(Zk| p op ®> (r) | Z'k+ q)/df]= 0, 


which shows that it is an expression of conservation of charge in the unperturbed theory. 


(A3) 


9 M. H. Cohen and F. Keffer, Phys. Rev. 99, 1128 (1955), 
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Let us now make the gauge transformation A —> A+ q/(q,co), <f> —» <£+ (co/c)/(q,co). Then 
A j ' md /f (q,co) = — e?Nq/mcV 

e? (/k|p 6 +^k+^q/2|/ / k+q)[/ 0 (E ik )-/o(-Erk+q)](/ / k+q| p a +fck-ffcq/2 |Zk)-q 


m 2 cV iv k 


E ik — El' k+q 


e 2 o> 


Combining terms gives 


mcV «rk 


(/k| p.+ftk+ftq/2 |rk+q)[/o(£ !k )— /o(-Erk+,)3 

4 

#0) + .Elk'“i?J'fefq 


Aj lnd //(<L w ) = (e 2 /wcF){-A r ^q+Ei^ k (/k|p a +^k+^q/2|/'k+q)[/o(Eik)— /o(£i'k+ q )](/'k+q|/k)} 

= (e 2 /wcF)[-ATfcq+2 £ Ik f 0 (E ik ) Zi^(/k|p a +^k+^q/2|/ , k+q)(/ / k+q|?k) 

= {^/mcV){_—Nfiq+2 £ Ik / 0 (Ei k )(/k| p a +fck+fcq/2 |Zk)]=0. (A4) 

In the last step, the fact that (Zk| p a +&k| /k) has odd parity under inversion of k has been used. 
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Tests of the Conserved Vector Current and Partially Conserved Axial-Vector 
Current Hypotheses in High-Energy Neutrino Reactions* 

Stephen L. Adler! 

Princeton University , Princeton, New Jersey 
(Received 8 April 1964) 

The following theorem is proved: Consider the high-energy neutrino reaction v+a l+j 3, with a a 
nucleon or nucleus, l a lepton (e or n) and /3 a system of strongly interacting particles. Suppose that the mass 
of ct and the invariant mass of /3 are not equal, and that the lepton mass is neglected. Then when the lepton 
emerges with its momentum parallel to that of the neutrino, the squared matrix element, averaged over 
lepton spin, depends only on the divergences of the vector and the axial-vector currents. Tests of the conserved 
vector current and the partially conserved axial-vector current hypotheses, based on the theorem, are 
proposed. 


I. INTRODUCTION 

T HERE is a characteristic property of neutrino 
reactions at high energy which makes possible 
new tests of the conserved vector current 1 (CVC) and 
the partially conserved axial-vector current 2 (PCAC) 
hypotheses. Consider the reaction v-\-ot— »Z+/3, where 
a is a nucleon or nucleus, Z is a muon or electron, and 
/3=/3i+ ■ —b/3 n is a system of strongly interacting 
particles. Let the four-momenta of v , a, Z, and be, 
respectively, k h pi, k 2 , and pi, and let the leptonic 
momentum transfer be k=ki—k 2 =p 2 —pi. We denote 
by M a the mass of a, by mi the mass of the lepton Z, 
and by W the invariant mass of the system We take 
the neutrino mass to be zero. 

Theorem i. Suppose that W 7 ^M a and that mi is 
neglected. Consider the configuration in which the final 
lepton emerges with its momentum parallel to that of 
the incident neutrino. (We call this the parallel con¬ 
figuration . 3 ) Then the squared matrix element for 
y-j-a—> Z+jd, averaged over lepton spin, depends only 
on the divergences of the vector and the axial-vector 
currents. 

Proof: The matrix element is 

911= 2^Huiy x (l+y 5 )u v 0\ fa V +3 \ A I a>. (1) 

Squaring and averaging over lepton spin gives 4 

<|9Ti| 2 > = <^|^\ y +^\ A |o!)<i8|^/+ I *)*Tu , (2) 


* Work supported in part by the U. S. Air Force Office of 
Scientific Research, Air Research and Development Command. 

t National Science Foundation Predoctoral Fellow. Present ad¬ 
dress: Lyman Laboratory, Harvard University, Cambridge, Mas¬ 
sachusetts. 

1 R. P. Feynman and M. Gell-Mann, Phys. Rev. 109,193 (1958). 
2 M. L. Goldberger and S. B. Treiman, Phys. Rev. 110, 1178 
(1958); Y. Nambu, Phys. Rev. Letters 4, 380 (1960); J. Bern¬ 
stein, S. Fubini, M. Gell-Mann, and W. Thirring, Nuovo Cimento 
17, 757 (1960), and references listed there. 

8 When mi— 0, both k\ and are null vectors. If the space 
components of two null vectors are parallel in one Lorentz frame, 
they are parallel in all Lorentz frames. Hence “parallel configura¬ 
tion” is an invariant concept. 

4 The four-vectors have an imaginary time component: p 

— (Pip*) — (P,*». The quantity p* is defined by p* = p*, pi* 

— ~pK*, where * denotes complex conjugation. 


with 

T\ff x ki ff k 2 \ ki * ^25\ff - b . (3) 

When mi is neglected, k 1 and k 2 are null vectors. In the 
parallel configuration they are proportional. If W 5 ^M a , 
ko is nonzero, 6 and we may write 

ki=kiokf l k , k 2 = kiokfT'k . (4) 

Thus, 

T^=2k 1Q k 2Q ko- 2 hkr. (5) 

Since <|ff|d$x/d£\|a)= — ik\0\S\\a), we find that 

<|9TC| 2 )= 2k 1Q k 2Q k<r* |(91d (3\ v + S\ A )/dxx |a)| 2 , (6) 
proving the theorem. 

When W=M a , ko vanishes and the proof of the 
theorem breaks down. It is in fact well known that in 
the “elastic” weak reaction v+N —a conserved 
vector current will contribute strongly in the forward 
direction. 6 We assume henceforth that Wt^Mo. 

n. TESTS OF CVC 

Since the antisymmetric tensor term €\ ay ski y k 2 & 
vanishes under the hypotheses of Theorem 1, the charac¬ 
teristic parity-violating effects in weak interactions can 
arise only from vector-axial vector interference. Con¬ 
sequently, if the vector current is conserved, and if m t 
may be neglected, all parity violating effects must vanish in 
the parallel configuration . This makes possible new ex¬ 
perimental tests of the hypothesis that the vector cur¬ 
rent in AS=0 leptonic reactions is conserved (CVC). 
Whereas previous tests have dealt with for 

W^M a and various values of k 2 = (p 2 —pi) 2 , the new 
tests will study (S|^)\ r |a) for W 9 ^M a and & 2 «0. 

Let us work in the lab frame, in which a is at rest . We 
assume that a is unpolarized. Then, if CVC is false, the 
two simplest types of parity violating term which may 
appear in the differential cross section, in the parallel 
configuration, are: 


* In the frame in which /3 is at rest, kt— (W 1 — Af a 8 — k 2 )/ (2 W). 
When mi-0, k is a null vector, so if k% is non vanishing in any 
Lorentz frame it is nonvanishing in all Lorentz frames. 

*T. D. Lee and C. N. Yang, Phys. Rev. 126, 2239 (1962). 
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(A) The vector triple-product terms 

qrCqyXqfc), (7) 

where q*, qy, and q* are any three distinct momenta 
chosen from among the lepton momentum k 2 and the 
momenta q b • • • ,q n of 


qi>** An of 0 b * * *,0 n ; 

(B) The vector*pseudovector terms 

q r<*, (8) 

where a is the spin of a baryon in 0 and q< is any mo¬ 
mentum chosen from among k 2 and q b * - *,q n . 

Since, in the parallel configuration, ki and k 2 are 
proportional, and since ki— 1 * 2 +qH-bqn, the sys¬ 

tem 0 must contain at least three particles if there are 
to be enough linearly independent vectors to construct 
a nonvanishing triple product. Consequently, the re¬ 
action with the lowest threshold which could show a 
triple product term is two-pion production: 

y(ki)+a(0) —»Z(k 2 )+a'(qi)+T(q 2 )+7r(q3). (7a) 

If CVC is valid, the laboratory differential cross section 
must contain no term k 2 * (q 2 X q 3 ). (There is only one 
linearly independent triple product in two pion pro¬ 
duction.) Note that to test CVC it is not necessary to 
observe the recoil nucleus a ; it is enough to know the 
initial neutrino direction and to observe the lepton and 
the two pions. 

Because nucleon polarizations are hard to measure, 
lambda kaon production, 

KkO+«(0) -»/(k 2 )+a'( qi )+A(q 2 )+Z(q 3 ), (8a) 

is the reaction with the lowest threshold in which terms 
of type (B) could be detected in practice. The A, 
through its decay asymmetry, analyzes its own po¬ 
larization. If CVC is valid, the terms * k 2 , q 2 and 
q3 must not appear, while <r A * (k 2 Xq 2 ), <t a * (k 2 Xq 3 ), 
and <r A * (q 2 Xqs) are allowed. Similar tests of CVC may 
be constructed for reactions with thresholds higher than 
those of Eqs. (7a) and (8a). 

The proposed tests of CVC are strictly valid only 
when mi is neglected. However, we will see in the next 
section that the main lepton mass correction to the 
matrix element does not give rise to interference be¬ 
tween a conserved vector current and the axial vector 
current. Consequently, the tests should be good in 
practice even if the lepton is a muon. 

HI. TESTS OF PCAC 
A. Lepton Mass Neglected 

Let us now accept the truth of CVC. Then, neglecting 
mi, the matrix element in the parallel configuration 
depends only on (f$\dg\ A /dx\\ a). In an attempt to find 
a general explanation for the validity of the Goldberger- 
Treiman formula for pion decay, it has been postulated 


by Nambu, Gell-Mann, and others that $\ A is partially 
conserved. 2 We denote by PCAC the hypothesis that the 
covariant amplitudes contributing to (fi\dg\ A /d%\\cx) 
satisfy unsubtracted dispersion relations in the variable 
k 2 and that these dispersion relations, for —MjKk 2 
<and for all values of the other invariants formed 
from four-momenta in a and 0, are dominated by the 
one-pion pole. (Jf x =the pion mass; we are of course 
considering only the case where the quantum numbers 
of a and 0 permit a one-pion pole.) Let ko$ be the value 
of k 0 in the rest frame of 0. If the ex¬ 

trapolation from the physical value, k 2 ~ 0, to the pole 
at & 2 — —M v 2 will have little effect on the spinors and 
kinematics, and we have the covariant relation 

0\dS\ A /^X\\oi) 

= -ikxifi | I «> = (2 V' 2 ) T(ir + +a 0) (&+M v 2 )~ l 

X (2& 0 ) 1/2 (t+ | d T\ A /dx\ 10). (9) 

Here T(ir + +a—> (3) is the transition amplitude for 
T + +a —» 0, with the incident ir + of energy ko and with 
the momentum of the incident tt + parallel to k. The 
Goldberger-Treiman relation, itself a consequence of 
PCAC, may be used to express the pion-decay matrix 
element in terms of gA, the beta decay axial-vector 
coupling constant: 

(2&o) 1/2 (t + I dSx^/dxK |0>=- awMxgAgr'MJ . (10) 

Numerically, gA^l^XlO -6 ^^” 2 ; Mn is the nucleon 
mass and g r is the rationalized, renormalized pion- 
nucleon coupling constant (g r 2 /47r« 14). Combining 
Eqs. (9) and (10) gives 

- (2hy f2 r(T++a m^MNgAgr'M^+Mjy' 

- (2& 0 ) 1/2 r(ir++a 0)2w 'MxgAgr 1 

x[l-^(j^+M, 2 )- 1 ]. ( 11 ) 

This equation may be used to express the weak-reaction 
cross section in the parallel configuration in terms of the 
cross section for ir + +a— >0. Before carrying through 
the details we will consider lepton mass corrections. 

B. Lepton Mass Corrections 

Up to this point we have neglected Now let us 
compute the principal lepton mass corrections. We will 
find that the lepton mass corrections, while not con¬ 
tributing significantly to terms of the form (vector) 
• (axial vector) in the squared matrix element, make an 



Fio. 1. Diagram 
giving rise to 

<P|*x AU |«>. 
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important contribution to terms of the form (axial 
vector) • (axial vector). 

Consider the diagram shown in Fig. 1. We denote by 
<0 | $\ AII |a)the contribution of this diagram to <0 | $x A |a), 
and by (0|,$jx AI |a) everything that is left over. It is 
easy to see that 

<0| <3x AII | a) - (2* 0 ) lfl r(*++a m^MNgAgr- 1 

X(-kJ(k 2 +M r 2 )- 1 , (12) 

from which it follows that 

hM $\ AI |a)— (2koyi*n* + +<x-> m il2 M N gAgr~ l . (13) 

Although <0|$x AII |a) is a lepton mass correction, it 
must be retained because mf (k 2 +M ir 2 )~ x is of order 
unity when l is a muon. Other lepton mass corrections 
involve large masses in the denominator and may 
reasonably be neglected. Keeping terms of first order 
in mf in the kinematics gives 7 

k kioko~ l k+bpi , k2— k2oko~ l k+bpi; 

2 k‘p\b= 2 k\'k 2 = — m?k\ok 2 o~ l , k 2 =m?k o^o” 1 - 


parallel to k, by 


< 13TC| 2 ) — 


klok20 




ko 2 


-gA' 



m?h 


2(M v 2 k2o+m 2 ko) 


2 

I \m,(ir++a~*(3)\ 2 



In computing kio, ^ 20 , and ko in Eq. (16), the lepton 
mass should be neglected. Then Eq. (16) will be for¬ 
mally covariant, since ratios of the time components of 
parallel null vectors, such as kio/k 0 and & 20 /& 0 , are 
invariant quantities. 

Corollary 1. Under the hypotheses of the theorem, 
the energy, angle, and polarization distributions of the 
particles in /?, in the reaction v+a — »/“+0, will be 
identical with the distributions in the reaction t + + a —»0 
(for the same invariant mass W of in the two processes). 

Corollary 2. Under the hypotheses of the theorem, 
the lepton differential cross section da/d&i of v+a —> 
/“+0, in the laboratory frame (the rest frame of cc), is 
given by 


If the vector current is conserved, it follows that 

<0 | <9\ r + cJx A | a)<01 c }o V +$o A | a )*?"\a 

— 2k iok 2 oko ~ 2 1 <01 k ■ $ A | a) | 2 

+ 2 b ( k io+W&o -1 Re[<01 k ■ g A | a){/31 pv $ AU | a)*] 

+ 26 2 |<0|^i^ An |a)| 2 

+ 2 mi 2 k 1 O^ 2 O _1 {( 0 1 3\ AU | a)<0| c)x AI1 1 a)* 

+2 Re[(0 \$\ A1 \ a)<01 <<Jx Al1 1 a)*] } . (15) 

We have retained m 2 only in terms where there is one 
factor (k 2 +M v 2 )~ l for each factor m?. No vector-axial- 
vector interference terms are of this form. Substituting 
Eqs. (11) through (14) into Eq. (15) and performing 
algebraic simplification leads to the following theorem: 

Theorem 2. Suppose that CVC and PC AC are true. 
Consider the parallel configuration in v+a —»/“+0, for 
k 0 $ satisfying ko^/MJ^l. Let m? be retained only 
where it occurs in the combination m?(k 2 +M 2 )~ l . 
Then the invariant matrix element 2TC for v+a —» Z~+0, 
squared and averaged over lepton spin, is related to the 
invariant matrix element 8 EDI (ir + +a —> 0) for ir + +a —»/?, 
with the 7T+ of energy k Q and with the ir + momentum 


7 Actually, ki^kioko^k+ak+bpi, k^k^kfT'k+ak+bpi, where 
a is of first order in m\\ We have dropped the term ak in Eq. (14) 
because it does not lead to important lepton mass corrections. 

8 The transition amplitude T(i r + +a—>/3) and the invariant 
matrix element 2flt(ir + -h« —*/3) are related by 

r(^+« -> 0 ) -IJg TO(T++a _ 0 ). 

The factor of proportionality is just the product of the normaliza¬ 
tion factors for the wave functions of ir + , a and of all the particles 
in /3. The S matrix is given in terms of 7 by 

Sfi = pi) T. 


da 

d&i 



(k 0 2 -M r 2 yi 2 


Mi\ 2 k20 2 

- J —■gA 

F / 4tt 3 


2 


r m?k 0 “I 2 

X 1- *(W) , (17) 

L 2 (MJk 2 o+m?ko)J 


where a(W) is the total cross section at total energy W 
in the rest frame, for T + +a —» 0. Also, 

k2o= (M 2 +2M a E- W 2 )/(2W ), 

E is the neutrino energy in the laboratory frame, and 
F— M v g r /(2MN)& 1.0. The formulas for v+a— >/ + +0 
corresponding to those given above are obtained by 
replacing by ir~. 

Use of Corollary 1 to test PCAC does not require 
knowledge of the neutrino energy spectrum, since for 
a given W the energy, angle and polarization distribu¬ 
tions of the particles in are independent of the neu¬ 
trino energy E. Testing PCAC by making a quantitative 
comparison of da/d&i with Eq. (17) of Corollary 2 does 
require a knowledge of the neutrino spectrum. Since all 
dependence on E in Eq. (17) is contained in the factors 
W[l— 2 m?ko(M v 2 k 2 o+rn?ko)- l +, the weighting over 
the spectrum is easy to carry out once the spectrum is 
known. 


IV. EXTRAPOLATION IN fc* 

Theorem 2 requires that kof?/M v 2 be much larger 
than unity. This condition is necessary for it to be 
legitimate to extrapolate from k 2 ~m?kok 2 o~ l to k 2 
= —Mir 2 in the kinematics of the reaction v+a —> T~ +0. 
Since ko^(W-M a )(W+Ma)/(2W ), the condition 
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ko will be satisfied as long as W—Ma>^M v . 
Thus, for most weak multiparticle production reactions, 
Theorem 2 is valid as it stands. 

However, in the interesting case of single-pion pro¬ 
duction in the (3,3) resonance region, W—M a <AM r 
and Theorem 2 must be modified. This is done by re¬ 
placing £nT(7r + +Qf—> t+o/) by £fR c (7r + +a —» t+c/), 
where SfTC c is the invariant matrix element computed 
from the covariant amplitudes for ir + +a—>w+a r by 
using the correct kinematics, with ^—mfkokzo" 1 , for 
the reaction v+a—*l~+ir+ot'. When a is a single 
nucleon TV, 

3II(t++7V -» r+N')= (brW/M N )xSlfi(W,4-k) 

+*■$*■ kMW,4-k)lxi, (18) 

with fi and ft the usual center-of-mass pion-nucleon 
scattering amplitudes, 9 and with £ and k unit vectors, 
in the center of mass, along the momenta of the final 
and initial pion, respectively. Calculation of £HT C shows 
that 10 

2fR c (T++iV -> x+iV r, )= i(W,4-k) 

kg2(W ,$-*)]*, (19) 

where 

gi(W,4-k)^fi(W,x), 

_ gj(w, 4 -k)~ik 0 /(kj-Msy^Mw,x), ( 20 ) 

9 G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, 
Phys. Rev. 106, 1337 (1957). In Eqs. (18) through (20), isotopic 
spin indices have been suppressed. 

10 The calculation is done as follows: We are considering the 
reaction i»(j6i)+iV(^i) —*/(A*)+iV’ , (s)-hT(g). Let us define the 
variables v and vb by v t =-~\{p\-\'s)‘q/MN and vB=\q m k/Mit. 
The corrected matrix element 2flt c is given by 

2Nl c =u (s) [A (vjVr) -iry-qB vir (v, v B ) ]« (pi), 

where A wir (v 9 v B ) and B wir (VjV B ) are the covariant amplitudes for 
pion-nucleon scattering. |_In pion-nucleon scattering, ir(tfi) 
-h-Y(^i) irfy+N'is), the variables v and v B are defined by 
v*=— \{p\+s)-q/Mit and v B = hq-qi/MN.'] Expressing Silt 6 in 


x=ih/(k 0 *-jf,*) i fl]a.A 

+lh{M*+W)/2W(k<?-MS)~\, 

and where £o= (W 2 —M^+M V 2 )/{2W). Clearly, when 
one finds that gi, 2 (W,£-£)«/i. 2 (W,£- h ), as 

is expected. 

If only the dominant (3,3) partial wave is retained, 
the main effect of Eq. (20) is to replace o- 3i3 (W) in 
Corollary 2 by (r^ i3 (W)ko 2 /(ko 2 — -MV). If, in additino, 
the lepton mass and nucleon recoil effects are neglected, 
Eq. (17) reduces to the result obtained from the static 
model by Bell and Berman. 11 This agreement with the 
static model is not surprising. The (3,3) projections of 
the Bom terms for weak pion production and for pion- 
nucleon scattering can be shown to satisfy the PCAC 
proportionality. In the static model, the entire matrix 
element is determined by the (3,3) projection of the 
Born term and by the experimental (3,3) resonance 
parameters. Hence, in the static model, the weak pion 
production and pion-nucleon scattering matrix elements 
satisfy the PCAC proportionality. 

Note added in proof. The considerations of this paper 
also apply to the decays 2*—» k-\-e ± +(v/v), when the 
electron is relativistic and emerges parallel to the neu¬ 
trino. For example, if CVC and PCAC are true, meas¬ 
urement of the differential decay rate in the parallel 
configuration would determine the strong 2AH coupling 
constant. 
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terms of pion-nucleon center of mass variables, using the kine¬ 
matics appropriate to weak pion production, leads to Eq. (19). 

II J. S. Bell and S. M. Berman, Nuovo Cimento 25, 404 (1962). 
Bell and Berman neglect all lepton mass effects. 


Tests of the Conserved Vector Current and Par¬ 
tially Conserved Axial-Vector Current Hypotheses 
in High-Energy Neutrino Reactions, Stephen L. 
Adler [Phys. Rev. 135, B963 (1964)]. The follow¬ 
ing corrections should be noted: (1) In Sec. II, 
line 5, m% should be nti \ (2) the third and fourth lines 
after Eq. (7) should read "momenta q Xt • q n of 
0i. * • *. Pn "; (3) in Eq. (9), (2£ 0 1/2 ) should be (2£ 0 ) 1/2 , 

and (7T+ \dT\ A /dx\ | 0) should be (t+ d$\ A /&ex|0); 
(4) on p. B966, column 2, line 8, "additino” should 
be "addition”; (5) in the Note added in proof 2AII 
should be 2 Att; (6 ) in Eqs. (14) and (15), p x should 
be p. Delete footnote 7. (1 wish to thank Dr. G. 
von Dardel for pointing out this correction.) 
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Consistency Conditions on the Strong Interactions Implied by a 

Partially Conserved Axial-Vector Current 

Stephen L. Adler* 

Palmer Physical Laboratory , Princeton University, Princeton, New Jersey and 
Bell Telephone Laboratories, Murray Bill, New Jersey 

(Received 15 September 1964) 

It is shown that a partially conserved AS = 0 axial-vector current (d\J\ A ~C<pA implies consistency con¬ 
ditions involving the strong interactions alone. The most interesting of these is a relation among the sym¬ 
metric isotopic-spin pion-nucleon scattering amplitude A wir (.+) y the pionic form factor of the nucleon K NNir , 
and the rationalized, renormalized pion-nucleon coupling constant g r : 

grVM = A ■*<+> (y = 0 ,v B = 0,k 2 = 0) /K™* (k 2 = 0). 

[M is the nucleon mass and —k 2 the (mass) 3 of the initial pion. The final pion is on mass shell; the energy 
and momentum transfer variables v and vq are defined in the text.] By using experimental pion-nucleon 
scattering data, we find that this relation is satisfied to within 10%, Consistency conditions involving the tt 
and the tA scattering amplitudes are stated. 


I N 1958 Goldberger and Treiman 1 proposed a re¬ 
markable formula for the charged pion decay 
amplitude, which agrees with experiment to within 
10%. Subsequently, Nambu, Gell-Mann and others 2 
suggested that the success of the Goldberger-Treiman 
relation could be simply understood if it were postulated 
that the strangeness-conserving axial-vector current is 
partially conserved. The partial-conservation hypothe¬ 
sis leads to a number of relations connecting the weak 
and strong interactions, of which the Goldbeiger- 
Treixnan relation is the simplest. 3 So far, only the rela¬ 
tion for charged pion decay has been tested experi¬ 
mentally. 

We wish to point out in this paper that, in addition to 
giving relations connecting the weak and strong inter¬ 
actions, the partially conserved axial-vector current 
hypothesis leads to consistency conditions involving the 
strong interactions alone} This comes about, as will be 
explained below, because under special circumstances 
only the Born approximation contributes to matrix 
elements of the divergence of the axial-vector current. 
The most interesting consistency condition is a non¬ 
trivial relation among the symmetric isotopic spin pion- 
nucleon scattering amplitude A tN{+ \ the pionic form 
factor of the nucleon K NN *, and the rationalized, 
renormalized pion-nucleon coupling constant g r ; 

g r 2 A rN(+) (v=Q, vb=0, #*=0) 


M K NN *(k 2 = 0) 

* Junior Fellow. Now at Physics Department, Harvard Uni¬ 
versity, Cambridge 38, Massachusetts. 

1 M. L. Goldberger and S. B. Treiman, Phys. Rev. 109, 193 
(1958). 

3 Y. Nambu, Phys. Rev. Letters 4, 380 (1960); J. Bernstein, 
S. Fubini, M. Gell-Mann, and W. Thirring, Nuovo Cimento 17, 
757 (1960); M. Gell-Mann and M. Livy, Nuovo Cimento 16, 705 
(1960); J. Bernstein, M. Gell-Mann, and W. Thirring, Nuovo 
Cimento 16, 560 (1960). 

*J, Bernstein, S. Fubini, M. Gell-Mann, and W. Thirring, 
Nuovo Cimento 17, 757 (1960); S. L. Adler, Phys. Rev. 135, 
B963 (1964). 


[Here M is the nucleon mass and — k 2 is the (mass) 2 of 
the initial pion. The final pion is on mass shell; the 
energy and momentum transfer variables v and v b are 
defined in Eq. (15) below.3 By using experimental pion- 
nucleon scattering data, we find that this relation is 
satisfied to within 10%. 

In Sec. I we define and discuss the concept of a 
partially conserved axial-vector current. In Sec. II, we 
derive the consistency condition relating the pion- 
nucleon scattering amplitude to the pion-nucleon cou¬ 
pling constant. In Sec. Ill, pion-nucleon dispersion rela¬ 
tions and experimental pion-nucleon scattering data are 
used to test whether the consistency condition is 
satisfied. In Sec. IV, other consistency conditions on the 
strong interactions are stated. 

I. DEFINITION OF PARTIALLY CONSERVED 
AXIAL-VECTOR CURRENT 

We assume that the weak interactions between 
leptons and strongly interacting particles are described 
by a current-current effective Lagrangian of the form 

—B e ff=/AWi\W+adjoint, (2a) 

where 

j\(x) = (l/v5)[^7\(l+75)^..+ lf,7\(l+75)^..Il (2b) 

is the weak current of the leptons and where J\ is the 
weak current of the strongly interacting particles. Let 
J\ r and J\ A denote the vector and the axial-vector 
parts of the strangeness-conserving weak current 

A(AS=0)-/ x y +A A . (2c) 

Definition: By partially conserved axial-vector cur¬ 
rent (PCAC) we mean the hypothesis that 

-B^MM r 2 g A (0)/ gr K™*(0n<p r +R. (3) 

Here M is the nucleon mass, M v is the pion mass, £a( 0) 
is the 0-decay axial-vector coupling constant Qu(0) 
= 1.2 -lO^/M 2 ], gr is the rationalized, renormalized 
pion-nucleon coupling constant (g r 2 /47r=14.6), and (p* 


4 Related ideas have been discussed within the framework of a 
model calculation by K. Nishijima, Phys. Rev. 133, B1092 (1964). 
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is the renormalized field operator which creates the ir + . 
The quantity K NNt (0) is the pionic form factor of the 
nucleon evaluated at zero virtual pion mass; K NNv is 
normalized so that K NNr (— M x 2 ) = 1. It is explained 
below how the constant multiplying <p v in Eq. (3) is 
chosen. In order to give content to the definition, we 
must specify properties of the residual operator R. We 
suppose that for states a and for which (5| <pr\a)r^0 f 
and for momentum transfer near the one pion pole at 
—M r 2 [say, for — M* 2 < (pp—p a ) 2 <M T 2 ], the matrix 
element of R is much smaller than the matrix element of 
the pion operator term. In other words, we postulate 
that if (5| (p v |a)^0 and if | (p$—p a ) 2 | <Af x 2 , then 

--«1. (4) 

\yiMMr 2 gA (0 )/grK™* (0)] | (P | <Pr | «> | 

In what follows, we derive equalities which hold 
rigorously if the residual operator R is zero. If R is not 
zero, but satisfies the inequality of Eq. (4), the “equals'’ 
signs should be replaced by “approximately equals” 
signs. The magnitude of the squared momentum trans¬ 
fer \{pp—p a ) 2 \ is understood to be always less than M x 2 . 

It is not actually necessary to specify the constant in 
front of (p r in the definition of PCAC. If we simply 
postulate that 

d>J^=C<pr, (5) 

the constant C may be determined as follows: Let us 
consider the matrix element of d\J\ A between nucleon 
states (N\d\J\ A N). Let p 2 and pi be, respectively, the 
four-momenta of the final and the initial nucleon, and 
let us denote by k the momentum transfer p%—p\. 
According to the usual invariance arguments, (N \ J\ A \ N) 
has the form 

( M M \ 1/2 

<A|A A |A>=(-) u(p2)\jA(k 2 ) 7X71 

'p20 P\W 

—fA(k 2 )tr\ v k v y 6 —ihA(k 2 )hy&y + u(pO , (6) 

where r + =i(ri+ir 2 ) is the isospin raising operator. 
From Eq. (6), we find that 

(N\d^ A \N)\ k >„ 0 

= —ik\(N | | AO | fc a «o 

/ M M \ 1/2 

= 2 Mg A (0)f-J u{p 2 )yf>T+u{px ). (7) 

'^p20 pm 

We also have 

{N\C^\N) 

=TT~i^\(--0+Mr 2 )^\N) 


k 2 +Mr 2 

c c 

■■ - (N | u I - igfflK**' (k 2 ) 

k 2 +Mr 2 k 2 +Mr 2 


MM \ i v 


'p2 0 ^10 


) ir* 

u(p 


where K NNv (k 2 ) is the pionic form factor of the nucleon. 
From Eq. (8), we find 

C 

(N\C<Pr\N)\ =- igrtfK"** (0) 


Mr 2 


M M \ m 


w(/> 2 )75T + «(/>i), (9) 


*p20 pm 

and comparing this with Eq. (7) gives 

C= —iV2MM r 2 gA (0)/g r K NN *( 0). 


( 10 ) 


If we form the matrix element of d\J\ A between the 
one pion state and the vacuum, we find that 

(2& 0 ) 1/2 (t + | d\J\ A 10) 

= -*m£M w *g A (0)/ g JC™'(0), (11) 

which is the Goldberger-Treiman relation for charged 
pion decay. For general states and a, such that 
{f$\ (px |a)^0, we find that 


(8|d\/\ A |a)= — 


iyJlMM 2 gA{ 0) 


x- {ik^n-^+cx -> 0 ). (12) 

k 2 +Mr 2 

Here ^is the transition amplitude for the 

strong reaction ir + +a— >0, where the (mass) 2 of the 
initial ir + is — k 2 — — (pp—p<^) 2 . Thus, we see that PCAC 
leads to a whole class of relations connecting the weak 
and the strong interactions. 

The definition of PCAC which we have given is not 
the same as the definition which would be suggested by 
a polology approach. This would be to define PCAC as 
the hypothesis that the covariant amplitudes contribut- 
Ing to (513\/\ A |a) satisfy unsubtracted dispersion rela¬ 
tions in the variable k 2 , and that these dispersion rela¬ 
tions, for k 2 \<M^ and for all values of the other 
invariants formed from four-momenta in a and /?, are 
dominated by the one pion pole. It is easy to see that if 
(51 d\A A | a) depends on invariants other than k 2 , the 
polology version of PCAC is ambiguous. Suppose that 
A is a covariant amplitude contributing to {/?|d\/\ A |a), 
and that A depends on two invariants, s and k 2 . Then 
the polology version of PCAC implies that 


A(s,k 2 )=A(s)/(k 2 +Mr 2 ), 


(13) 


2 )yhT + u{pi ), (8) 


where A is the residue of A oXk 2 = —M x 2 . Let us now 
define a new variable s*=s—ak 2 and treat A as a func¬ 
tion of independent variables s' and k 2 . To evaluate the 
residue we set every explicit k 2 equal to —Mr 2 - We then 
find from the polology version of PCAC that 

lns-ak 2 )+a(-Mr 2 )l Als’-aM/l 

A(s\k 2 )~ ---. (14) 

k 2 +Mr 2 k 2 +Mr 2 
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Fig. 1. Generalized Born approximation diagrams for 
WV|/x A W. The heavy dot marks the vertex where the operator 
J\ A acts. 


Clearly, Eqs. (13) and (14) differ unless A has no de¬ 
pendence on the variable s to begin with. In other 
words, the polology definition of PCAC is inherently 
ambiguous, since the value of the residue at & 2 — —M v 2 
depends on how the invariants other than k 2 are chosen. 

This ambiguity is not present in the definition of 
PCAC given in Eqs. (3) and (4). The reason is that k 2 
is at no point set equal to —Mf but is kept at whatever 
value it has in the weak matrix element (01 d\J\ A | a). We 
use the unambiguous version of PCAC in the remainder 
of this paper . 6 


II. CONSISTENCY CONDITION ON PION- 
NUCLEON SCATTERING 


In the previous section we saw, in Eq. (12), that 
PCAC leads to relations between the strong and the 
weak interactions. These allow one to predict the weak 
interaction matrix element (ft | 3\/\ A |a), if one knows the 
strong interaction transition amplitude ^(^H-a —> 0 ). 
The principal point we wish to make in this paper is 
that there are cases in which only the Born approxima¬ 
tion contributes to a covariant amplitude of (/3 d\J\ A \ a ), 
for appropriately chosen values of the energy, mo¬ 
mentum transfer and other invariants on which the 
covariant amplitude depends. The Bom approximation, 
in turn, is known in terms of weak and strong inter¬ 
action coupling constants. Using PCAC to eliminate the 
weak interaction coupling constants leaves a con¬ 
sistency condition involving the strong interactions 
alone. In this section, we study the matrix element 
(tN | d\J\ A | N) and derive the consistency condition 
stated in Eq. (1). In Sec. IV, we discuss conditions 
obtained from other matrix elements of d\J\ A . 

We begin by writing down the structure of the 
matrix element (t 2 V|./\ a |.A 0 . Let pi, p 2 , and q be, re¬ 
spectively, the four-momenta of the initial nucleon, the 
final nucleon, and the final pion. The momentum 
transfer k is given by k=p 2 +q—pi. We define invariants 
v and vb by 


v=-(pi+p z )-k/(2M), 

VB=q‘k/(2M). 



The matrix element can be decomposed into eight 


5 In a previous paper p5. L. Adler, Phys. Rev. 135, B963 (1964)] 
we used the polology version of PCAC. If, instead, the definition 
of Eqs. (3) and (4) had been used, 2fllGr + +a —* 0) in Theorem 2 of 
the paper would amply have been the invariant matrix element 
for r + +a —+0, with the initial r + of (mass) 8 * —k*. 


covariant amplitudes A j(v,VB,k 2 ) according to 


/p 10 p2 0 

(-2 k 

\M M 


o 


1/2 

(TiV|A A |iV> 


8 


=U(p 2 )i'H Of Aj(v,v B ,k 2 )u(pi ). 

J'-l 



The quantities Of are given by 6 


Of=\(qy\-y\q), 

0 2 K =(pi+p2)\, 

Of=tMy\, 


Of—ik(pi+p^)\, 

Of=ikq\, 

Of—k\, 

Of^ikkx. 



The amplitudes Afv,VB,k 2 ) have been chosen so that 
they have no kinematic singularities . 7 

The isotopic spin structure of the amplitudes 
Aj(v,VB,k 2 ) is specified by writing 

A j (v,VB,k 2 )=X f *}p a *Aj(v,VB J k 2 ) a $B + Xi , 

Aj(v t VB,k S )aa = Aj (+) (v,VB,k*)BatJ (18) 

+Af- ) (v,VB,k 2 )\[r a ,rf]. 

Here X* and X f are, respectively, the isospinors of the 
initial and final nucleon and yp a is the isotopic spin wave 
function of the final pion. [If the final pion is a t ± , 
2 _ 1 / 2 ( 1 , 0 ) a , while if it is a ir°, ( 0 , 0 ,l) a .j 

The quantity \I/ B + is defined by (l,i,0)p, so that 

Tpa+Tt 3 — r + . The presence of is just a reflection of the 

fact that the weak current J\ A transforms like 1 1 + 1/2 
under isotopic spin rotations. 

Let us split each amplitude Afv,VB,k 2 ) a & into two 
parts, 

Aj(v,VB,k 2 ) a p = Aj P (v,VB,k 2 ) a p+Aj(r,VB,k 2 )aa • (19) 

The part Aj p is defined as the sum of all pole terms 
contributing to A j, while Aj is simply everything that is 
left over when the pole terms are removed from A j. The 
amplitudes Aj p are calculated from the generalized 
Born approximation diagrams shown in Fig. 1. In each 
diagram, the heavy dot marks the vertex where the 
operator J\ A acts. The nucleon vertex of J\ A is given by 

r + [gA (£ 2 )Y\Y5- /a (£ 2 )°-\AY6 -ih A (k 2 )h Ys]. (20) 

Evaluation of the Born diagrams gives 

Oj x x / *i{/ a *Aj a $ p ip$ + x i u(pi) 

/=i 

= u(p 2 )Xf*p a *{ir a Y 6 gr[l/ (ps+q—iM)] 
Xr+\j A (k 2 )y\y6—f A (k 2 )a\Jt,y s —ih A (k 2 )k\yi] 
+r + {jgA{k 2 )y\yt—fA{k 2 )<T ) ,Ji„y fl —ihA{k 2 )k- k yi} 

X[l/(^i-g-iM)>V a Yf£r}X,-«(/>i), ( 21 ) 

from which the Aj p are easily obtained. Since the 
divergence of the terms proportional to /a(£ 2 ) vanishes 

6 The kinematic structure of the matrix element (riV|/x A |iV) 
has been discussed by N. Dombey, Phys. Rev. 127,653 (1962) and 
by P. Dennery, Phys. Rev. 127, 664 (1962). 

7 A simple modification of the argument used by Ball [[J. S. 
Ball, Phys. Rev. 124, 2014 (1961)] can be used to show that the 
amplitudes A y have no kinematical singularities. 
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identically and since the divergence of the terms pro¬ 
portional to h A (k 2 ) vanishes when k 2 =0 y we write down 
only the pole contributions proportional to g A (k 2 ): 




grg A (k 2 ) 

2M 







The amplitudes A 2 and A 4, **•, A a have no pole 
contributions proportional to g A (k 2 ). 

Let us now evaluate 


(ttN I d^ A \N)=- MtN I I N) 

at k 2 =0. Using the decomposition of (tN\J\ a \N) into 
covariants Aj , splitting each A j into parts Aj p and Ay, 
and evaluating the Aj p from Eq. (22), leads to the 
result that 


The amplitudes A vN (v, vb , & 2 =0) andU ,rJV (j', VB y k 2 =0) 
describe pion-nucleon scattering with the initial pion a 
virtual pion of (mass) 2 = — £ 2 =0 and with the final pion 
a real pion of (mass) 2 =Jf T 2 . 8 We have separated off the 
pole terms of B (^4 has no pole terms); B denotes 
everything which is left over after this separation is 
made. 

Comparing Eqs. (24) and (25), we see that the pole 
terms proportional to 



are identical. This is consistent with the requirements of 
PCAC. A remarkable fact emerges when we consider 
the equation for the A amplitudes, 

(l/y!T)[_— 2ify(Ai+A 2 )a/3+2if^A 3 aa+2g r ^4(0)5a/33 
= l^Mg A $)/g r K™*(G)^A* N ^ v Bj k 2 =0) a$ . (27) 

Let us set v—vb — 0. Since the Aj have all pole terms 
removed, and since they have no kinematic singu¬ 
larities, 

limy(Ai+A 2 ) = lim^A 3 — 0. (28) 

v —K) VB -*0 


l(pi*/M)(p 2 ,/M)2hy 2 (TrN\dyJx A \N)\ 

=fl(fr)X,V.*lf^(^ + )X i «(# 1 ) J (23) 

with 

M a p = A (v 9 VB)aQ — ikB (v y VB)aP , 

1 

A (v,VB)aP=* -{ — 2Mv(Ai+A 2 ) a p 

y/2 

-h2MvBAz a p+2grg A (0)d a p} , 

BiVfVBiaP** - 2MAi a p — MAiaB'\'2MvAz a B 

V21 _ (24) 

— 2 MVsAza^grgji ( 0 ) 


X K J -It) 


Hence at v=vB=k 2 =0j all the unknown amplitudes 
drop out. Equation (27) then becomes 



A* N (v= 0, VB = 0, k 2 =0) a p 
K NN *( 0) 



Decomposing A a p rN into symmetric and antisymmetric 
isotopic spin parts, 


A a ^=A^S a? +A (30) 

gives 

g 2 A*"W(v=0, v B =0, k 2 =0) 

-=-, (3D 

M K NN *(ff) 

0 =^^(-)(„ =0> „ b =0,£ 2 =0). (32) 


+ iCT«,Tfl](-1-) 

M ’B—V J’B+J'/- 


According to the PCAC hypothesis, we can also evaluate 
(ttN I d x Jx A | N) as (tN I c<p, I N). This gives 

\3M Sa (0) 

M a p= -;— [A Tli ( V, VB, k 1 = 0) a p 


g r K””*( 0) 

V2Mg A ( 0) 
gr K«^(0) 


— ikB* N (v, v B ,k*= 0).J 
A rN (y, VB, k 2 = 0) a f, 

g<~ 


-ikB* N (v, VB, k 2 =Q) a „-ik — 0 ) 

2 M 


X s a /—-—) 

\vb~V VbA’VI 


+![’•«, tA- -1--)"]} . (25) 

\VB~V VB“hv/-l) 


Equation (32) is automatically satisfied by virtue of the 
odd crossing symmetry of A rN< -~K Equation (31) is a 
nontrivial consistency condition waich must be satisfied 
if PCAC is true. 

We saw above that the pole terms, which are the only 
pion-nucleon scattering terms of second order in the 
coupling constant g r , do not contribute to the amplitude 
A * JV(+) . The leading term in the perturbation series for 
K NNir is 1. Consequently, if A vNi+) /K NNir is expanded 
in a renormalized perturbation series, no term of order 
g r 2 will be present. Thus it is clear that the consistency 
condition is not an identity in the coupling constant. 
This makes it fundamentally different from relations 
obtained from unitarity or from crossing symmetry, 
which are always true order by order in perturbation 
theory. 

8 Pion-nucleon scattering with the initial pion virtual has been 
discussed by E. Ferrari and F. Selleri, Nuovo Cimento 21, 1028 
(1961) and by J. Iizuka and A. Klein, Progr. Theoret. Phys. 
(Kyoto) 25, 1017 (1961). 
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Note that a similar consistency condition cannot be 
derived for the B amplitudes, since the presence of the 
terms 2 MA\—MA 4 in Eq. (24) prevents the elimination 
of the unknown amplitudes A \ and A 4 . 

In the next section, the condition of Eq. (31) is com¬ 
pared with experiment. Before going on to do this, let 
us summarize the properties of J\ A that were actually 
used in the derivation. Nowhere did we use the fact that 
J\ A is the weak axial-vector current which couples to 
the leptons. Clearly, the consistency condition may be 
derived if the following two conditions are met ; 

(i) There exists a local axial-vector current J \, the 
divergence of which is proportional to the pion field, 

dxJx=C<p T ; (33) 

(ii) In the nucleon vertex of J \, which apart from 
isospin is 

(N | J\\N)=u(p 2 )ZG(k 2 )yyy 5 

-F(k?)<r\ n k,y 5 -iH (F)£*y , (34) 

the form factors G, F, and H are finite at & 2 =0, and fur¬ 
thermore, G(0j is nonvanishing. In the matrix element 
(rN |/\| N), the covariant amplitudes Ai^.^iv^sfi 2 ) 
are finite at v=v B =k?=0 once the poles which arise 
from the Born-approximation (one-particle intermedi¬ 
ate state) diagrams are subtracted off. [Except for the 
requirement that G(0) be nonvanishing, these condi¬ 
tions are necessarily satisfied if the form factors and 
covariant amplitudes in the two matrix elements of J\ 
satisfy the usual spectral conditions, that is, if their 
singularities as functions of the complex variables k 2 , v 
and vb arise only from allowed intermediate states.] 

Condition (ii) and the requirement of locality are 
essential for the derivation to go through. They are 
very restrictive conditions, and it is easy to find axial- 
vector currents which do not satisfy them but which 
obey Eq. (33). For instance, the current J\ defined by 


K NNx (k 2 = 0), requiring us to use a model to calculate 
the difference, 

^*(+>(,,=0, y B =0 , ife»=0) 

K™*(k 2-0) 

-A r *<*>(v= 0, VB = 0, k 2 = -M r 2 ). (36) 

We first give several alternative ways of using pion- 
nucleon dispersion relations to calculate the on-mass- 
shell amplitude. We then discuss a model for going off 
mass shell in k 2 , and summarize the final results. In the 
remainder of this section, we take the charged pion mass 
to be unity. In these units the nucleon mass is M = 6.72 
and 9 

g r 2 /M— 27.4zL0.7. (37) 

The equations used in making the calculations de¬ 
scribed in this section are derived in the Appendix. 

A. Evaluation of A rN W (v = 0, v^=0, k 2 = — l) 

We wish to evaluate the on-mass-shell amplitude 
^ 4 *wc+)(o, 0 —1). Since the point V = VB =§ is not a 
physical one, we must use pion-nucleon dispersion 
relations to compute A (0, 0, —1) from scattering 
data. The fixed momentum transfer dispersion relation 
satisfied by A tN(+) (v, vb , —1) is 10 

(^ vb ? _ 1) 

1 r 00 

=- / d/ImA'*<+>(/, VBj -l) 

^ J vo+vjj 

xFtUM ’’0=1+1/ (2M ). (38) 

Lv —v v-jrvJ 

Since the integral in Eq. (38) probably does not con¬ 
verge, it is necessary to introduce a subtraction. 


J\=Cd\ / D(x-x')(pi r (x')d 4 x', 


D(x) = - 



- d*k, 

(lirYk 2 



satisfies d\J\ =C<p T , by construction. But J\ is not 
local, and in the nucleon vertex of J \, G(£ 2 )=0 and 
H(k 2 )a 1/k 2 , so that (ii) is violated. 


ni. DISPERSION RELATIONS TEST OF 
CONSISTENCY CONDITION 

In this section, we use pion-nucleon dispersion rela¬ 
tions and experimental pion-nucleon scattering data to 
test whether Eq. (31) is satisfied in nature. By using 
dispersion relations, the on-mass-shell amplitude 
A tN ^ +) ( y =0, vb=0 , k 2 = —M t 2 ) may be calculated from 
scattering data. However, Eq. (31) involves the off- 
mass-shell combination ^4 xJVC+) (^=0, ^=0, ^ 2 =0)/ 


1, Threshold Subtraction 

The usual procedure is to make a subtraction at 
threshold. This gives 

2 r m dv 

A'^+>(0,0, -l)=^ T ^k0,-1)— / — 

TT J V ' 


X- 


Im A* N M(v\ 0, -l)vo 2 
(v' 2 -vo 2 ) 


(39) 


which has a strongly convergent integral. The integrand 
can be calculated in terms of phase shifts. The integral 


9 The coupling constant g T * is related to the coupling constant J 2 
by g r *«4ir*4Af*/ 1 . We use the value Jf «0.081 ±0.002 quoted by 
W. S. Woolcock, Proceedings of the Aix-en-Provence International 
Conference on Elementary Particles (Centre d’Etudes Nucl^aires 
de Saclay, Seine et Oise, 1961), Vol. I, p. 459. 

10 G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, 
Phys. Rev. 106, 1337 (1957). 
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was evaluated using the phase shift analysis of Roper 11 
up to a pion laboratory kinetic energy of TV=700 MeV, 
where the integral was truncated. A convergence check 
indicated that the truncation error is small. The result 
is 


2 

T 


»oo 


*0 


dv' w x *<+>(/,o, -iw 

/ v' 2 — v£ 


= 7.4. 



We make no error estimate here since Roper gives no 
error estimate for his phase shifts. 

The threshold subtraction constant can be expressed 
in terms of scattering lengths by 

A'M+Hvo, 0,-1) 


4tt 



eo 


X [W 3/2) +W 1/2) } 

2-0 


(;+i)[2(H-i)]i 
2’«[(*+1)!] 2 


-2itf E {![ai- (3/2) -a^ (3/2) ] 

2—1 

-, (41) 

2 l (ll) 2 


where is the scattering length in the channel with 
isospin /, orbital angular momentum l , and total angular 
momentum 7=/±J. Equation (41) is rapidly conver¬ 
gent and it suffices to keep only the 5-, P-, Z)-, and 
F-wave scattering lengths. Using the S- and P-wave 
scattering lengths quoted by Woolcock 12 and obtaining 
D- and F-wave scattering lengths from Roper’s poly¬ 
nomial and resonance fits to the phase shifts, gives 


A r * M (vo, 0, -1)=37.3±0.7, 
A tNW ( 0, 0, -1) = 29.9±0.7. 


The threshold subtraction constant arises almost en¬ 
tirely from the P-wave scattering lengths. The error 
estimates take into account only the errors in the 5- 
and P-wave scattering lengths quoted by Woolcock. 

Alternatively, we can obtain all scattering lengths 
from the threshold behavior of Roper’s fits to the phase 
shifts, giving 

4'*<+>(* 0 ,0, -1)=40.7, (43) 

-4'*<+>(0,0, -1) = 33.3. 


2. Broad Area Subtraction Method 

There is a fairly large discrepancy between Woolcock’s 
scattering lengths and the threshold behavior of Roper’s 

11 L. D. Roper, Phys. Rev. Letters 12, 340 (1964) and private 
communication. We use Roper’s / m =3 phase shift fit for pion 
laboratory kinetic energy T r in the range 0<P T <700 MeV. In 
terms of v and vb, T r ~v — vb — vq. 

13 W, S. Woolcock^ Ref. 9. Woolcock’s results are quoted in J. 
Hamilton, P. Menotti, G. C. Oades, and L. L. J. Vick, Phys. Rev. 
128,1881 (1962). The slightly different scattering lengths proposed 
by Hamilton et id. give the same result for 0. -1) as do 

Woolcock’s. 


phase shifts. This suggests that it would be desirable to 
perform the subtraction in a manner which does not 
weight threshold behavior so heavily. We give a method 
which effectively smears the subtraction over a finite 
segment of the real axis and has the additional ad¬ 
vantage of containing a built-in consistency check on 
the phase shift data used. Let us consider the function 


4 '*<+>(*, vb= 0 , £ 2 =-l) 

[(?— ^o) (H“ Po) (v— Vrn ) (H- Vm)'] 112 


(44) 


where v m >vo lies on the physical cut. Since F(v) ap¬ 
proaches zero at v— a>, we can write an unsubtracted 
dispersion relation 

1 r 00 AF(^)/ 1 1 \ 

F(w) = -\ dv 1 -—(--h--), (45) 

ir J vo 2 i \v—v v-\-vf 


where A F(v')=F(i/+h)—F(v—ie) is the discontinuity 
of F across the cut from vo to oo. The square root in the 
denominator has opposite signs on the opposite sides of 
its cut from vo to v m and has no cut from v m to oo . 
Consequently, 


AF(/) Re4'*<+>(/, 0, -1) 

2 X L(v— Vo) (v+ Vq) ( V m — v) ( V m + v')'} 12 


V0<V <.Vtn, 


AF(v') 


ImA xJV <+)(/, 0, -1) 


(46) 


2 i [_(v—Vo)(v , -{-Vo)(v— V m ) ( V r + Vm) ~J i2 

Vm<V<<*> , 

giving 


A xvc+) (o, o,-l) 

2 p dv' Re4 xJV( +>(/, 0,-l)vovm 
* J yo V Z(v— Vo)(v +Vo)(v m — v') (v m +v)] 112 
2 r- dv' ImA **<+>(/, 0, -l)vov m 
TfJym V Z(v—Vo)(^+Vo)(v'—V m )(v'+V m )y 12 

(47) 


This equation involves Re4 rN(+) over a segment of 
finite length, not just at threshold. In the limit as v m 
approaches vo, Eq. (47) becomes identical with Eq. (39) 
for the threshold subtraction. The fact that Eq. (47) 
involves no principal value integrals makes numerical 
evaluation easy. 

If the exact values of Re4 xJV(+) and Im4 xJV(+) were 
used to evaluate the integrals, Eq. (47) would clearly 
give the same answer for all values of v m between and 
oo. Thus, by varying v m we can check the consistency of 
the phase shifts used to evaluate ^4 TjV( + ) (i' / , 0, —1). 

Using the phase shift data of Roper and integrating 
up to a pion laboratory kinetic energy of 700 MeV gives 



146 


Adventures in Theoretical Physics 


B 1028 STEPHEN L. ADLER 


Table I. versus length of the square-root cut, as calculated by using fixed momentum transfer dispersion relations. The upper 

end of the square-root cut lies at pion-nucleon center-of-mass energy W m « At threshold, <a n = 1, and at the peak of the (3,3) 

resonance, « m *=2.1. In terms of <a m , v m is given by Vm^VB+um+ttinP/T-M. 



1.7 

1.8 

1.9 

2.0 

2.1 

2.2 

2.3 

mm 

2.5 

^<+>(0,0, -1) 

{v B = 0) 

29.70 

29.43 

29.16 

28.90 

28.66 

28.44 

28.23 

28.02 

27.83 

A tN(+) (o^ — 1/2M, -1) 
(**“-l/2Af) 

26.73 

26.54 

26.36 

26.17 

26.00 

25.83 

25.67 

25.51 

25.36 

D 

2.97 

2.89 

2.80 

2.73 

2.66 

2.61 

2.56 

2.51 

2.47 


the results shown in Table I. It is convenient to intro¬ 
duce a parameter co m , such that the upper end of the 
square-root cut lies at pion-nucleon center-of-mass 
energy In terms of co m , the parameter v m 

is given by v m ~ VB+o} m +ca m 2 /2M t In changing co m from 
1.7 to 2.5, we move the upper end of the square-root cut 
across the peak of the (3,3) resonance, thus considerably 
altering the distribution of the integral between the 
two terms in Eq. (47). Still, the total varies by less than 
10%, indicating that Roper's phase shifts are reasonably 
consistent with dispersion relations in the (3,3) reso¬ 
nance region. The end of the cut was not taken greater 
than co m = 2.5 to avoid introducing a large truncation 
error from extending the integrals only to 700 MeV. A 
convergence check indicated that in all cases shown in 
Table I the truncation error is small. 

The result of this analysis may be stated as 

( +>(0,0, —1) = 28.7 ±0.9, (48) 

where we have taken as the error estimate the variation 
of A xJV(+) as co m is moved across the peak of the (3,3) 
resonance. 

3. Alternative Broad-Area Subtraction Method 

As a further check, we have used an alternative 
method to evaluate ^4 xJVC+) (0, 0, — 1). Let us write 

A xJV(+) (0,0, -1) = D+A '*(« („=o, vb= ~ 1/2 M, -1 ) , 

D=A* N (+)(ir=0, **=0,-1) (49) 

-A r *M(v= 0, vb= — 1/23/, -1). 

In other words, we add and subtract the quantity 
^4 xJVC+) (0, —1/2if, —1). In the difference term Z), we 
evaluate ^ xJV(+) (0, — l/2if, — 1) by using the fixed 
momentum transfer dispersion relation for ^4 xJV(+) . ^See 
E.q. (38)3 with a broad area subtraction. This is just the 
method used above to evaluate ^4 xJV(+) (0,0, —1). The 
results are shown in Table I. Clearly, in forming the 
difference D of the amplitudes for different values of 
momentum transfer vb , much of the variation of the 
result with co m cancels out. This is probably not acci¬ 
dental. If we take as error estimate the variation of D 
as ca m is moved across the (3,3) resonance peak, we find 

Z>= 2.65±0.3. (50) 


We now add back A xJV(+) (0, —1/2 M, — 1) evaluated 
by an independent method. Let us recall that vb = — 1/ 
(2 M) corresponds to forward pion-nucleon scattering. 
Since the even isotopic spin forward scattering ampli¬ 
tude is given by 

F<+>(v)=A r x™(v, -1/2M, -1) 

+ *B x *<+>(r,-l/2Jf, -1), (51) 

we have 

FW(0)=;4 x *«->(0, -1/2M, -1). (52) 

Thus, we can use ordinary forward dispersion relations 13 
to evaluate ^4 xJVC+) (0, —1/2 M, —1). Making a broad 
area subtraction gives 

A xjv(+)(o, -l/2M,-l) 

_gr 2 _ Vm _ 

M Z(v m 2 - 1/AM 2 ) (1—1/4J/ 2 )] 1 ' 2 
2 r Vm dv RcFW(/)f. 

T J1 / Zi.V— 1) (/+1) (v M — v) (Vm+ v)^ 12 

2 f*dv’ Im FW(v')v m 

W, m "7 [(/-D^+D^-rJ^+rJ]^' 

(53) 

We recall that 
4 tt 

ReF c+> (i/)=— [2Mv-\-M 2 -\- l3 i/2 Re(/i (+) +, (54) 

M 

where /i (+) and / 2 (+) are the usual center-of-mass 
[isospin (+)] pion-nucleon scattering amplitudes. 
Furthermore, 13 

ImFW(/)-K^ 2 -l) 1/2 [^(/)+a_(/)], (55) 

where <r+(v) and <r_(i/) are, respectively, the total ir + p 
and ir~p cross sections. To evaluate the integrals, we 
used Roper's phase shifts for laboratory pion kinetic 
energies below 700 MeV. Above 700 MeV, we used the 
tabulation of <r + and given by Amblard et al. u and the 

18 For example, see the article by J. D. Jackson in Dispersion 
Relations , edited by G. R. Screaton (Interscience Publishers, Inc., 
New York, 1961), p. 38. 

14 B. Amblard et at Phys. Letters 10, 138 (1964). 
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Table II. A 9ir W versus length of the square root cut, as calcu¬ 
lated by using forward scattering dispersion relations. 


Wm 1.5 

2.1 

2.7 

3.3 

3.9 

A rf«+) (0, -1/2 M, -1) 26.33 

26.23 

26.15 

26.09 

26.07 


asymptotic region fit of Von Dardel et al. n The results, 
shown in Table II, give 

<+>(0, -1/2 M f — l) = 26.15zh0.2, 

(56) 

A* N ^( 0,0, — 1) —28.8±0.4, 

where we have taken as the error estimate the variation 
of ^4 irJvt+) (0, —1/2 M, —1) as co m is varied from 1.5 to 
3.9. We have not included in the error estimate the 
error in the factor g r 2 /M appearing in Eq. (53), since 
when we divide by g?/M to compare the left- and right- 
hand sides of Eq. (31) this error drops out. 

The values of .4 tWC+ >(0, —1/2 M t —1) obtained by 
using fixed momentum transfer dispersion relations 
(Table I) and forward scattering dispersion relations 
(Table II) are in excellent agreement. When fixed mo¬ 
mentum transfer dispersion relations are used, the total 
result for ^4 irJV(+) (0, —1/2 M, —1) comes from the inte¬ 
gration over the physical cut. By contrast, when forward 
scattering dispersion relations are used, nearly all of the 
total comes from the pole term in the dispersion rela¬ 
tions, which leads to the term (gr 2 /M)v m Z(v m 2 — 1/4M 2 ) 
X(l-l/4Af*)]- 1 '* in Eq. (53). Thus, the two methods 
“sample” pion-nucleon scattering in very different 
ways. Their agreement gives us confidence that the 
numbers obtained from the dispersion relations calcula¬ 
tions are reliable. 


B. Model for Going Off Mass Shell in k 2 

In order to compare the consistency condition with 
experiment we must calculate the difference 

0, -1). (57) 

To motivate the model which we use, let us return for a 
moment to the fixed momentum transfer dispersion 
relation for A vm+) (y t 0, —1), 

A*m+>(v 9 0,—1) = - f dv Im J 4 1rJV(+) (/, 0, — 1) 

X [”-+V^]- (») 


Let us proceed as if no subtractions were necessary. We 
evaluate the integral by keeping only the resonant (3,3) 
state in the integrand and going to the static limit. This 
gives 


32 1 

A**M( 0 , 0 , - 1 )»— Mr- 

3 T 




15 G. von Dardel et al., Phys. Rev. Letters 8, 173 (1962). 


where |q| is the pion center-of-mass momentum and 
where / 3 ,3 is the resonant (3,3) partial wave amplitude. 
According to Chew et a/., 10 in the narrow resonance 
approximation one finds that 


giving 


1 

T 


” Im/3,3 gr 2 

do )—-—~-, 

l | q | 2 12 irM 2 


A (0, 0, -1)«(8/9) (g r 2 /M )« 24.4. 


(60) 

(61) 


This number is in good agreement with those obtained 
above by the proper procedure of using subtracted 
dispersion relations. The fact that a (3,3) dominant, 
unsubtracted dispersion relation calculation gives a 
reasonable result for J 4»^(+)(0, 0, — 1) suggests that 
such a calculation may also give a reasonable estimate 
of the change in j 4 tW(+ > produced by going off mass 
shell. Thus, as our model for going off mass shell in £ 2 , 
we take 


A=lA*^ + ^0,0fi)/KXx*(0)l-A* N M(0, 0 , - 1 ) , 
2 ndv' r^3 l3 irJV(+) (/,0,0) 


2 r 00 dv' r. 
=- / —Im - 

TrJro v L 


0 ) 

-^3,3** (+) (/,0,-l)], (62) 


where the subscript 3, 3 indicates that only the resonant 
partial wave is to be retained. 18 

The integral in Eq. (62) can be evaluated once the 
off-mass-shell partial wave amplitude /^(Z, P = 0) is 
known. It turns out that in the (3,3) resonance region, a 
very good estimate of / 3[3 (/, £ 2 = 0) is given by 

/».»*(/, P=0) 

/».«(/, P-0)«/,..(/, P=-l) ---, (63) 

where / 3 , 3 5 denotes the (3,3) projection of the Born 
approximation. 17 Roughly speaking, the reasons for the 
validity of Eq. (63) are: 

(i) Equation (63) gives / 8l3 (i/, ^=0) the phase of the 
(3,3) on-mass-shell amplitude, as is required by 
unitarity. 

(ii) The left hand, or “potential” singularity of 
/8,3(/,P) nearest to the physical cut is determined 
entirely by / 3 ,3*0'\P). Multiplying / 3 , 3 (/, — 1) by 
/3 1 3 ' b (^,0 )// 3 ,3' b (/, —1) gives the right-hand side of 
Eq. (63) approximately the correct nearly potential 
singularity structure for /s.a(//)). A detailed numerical 
analysis 18 indicates that the error involved in using Eq. 

16 A justification for this model would be provided if one could 
provethat A(i»)»il Tj ' r(+) (i» l 0,0)/ir j ' rj ' rT (P)——1) satis¬ 
fies an unsubtracted dispersion relation in the variable v. Then 
A(0) could be expressed as an integral of ImA(p) over the physical 
cut. Since only the (3,3) phase shift is appreciable at low energy, 
It would be reasonable to keep only the (3,3) partial wave In 
ImA(^). 

17 E. Ferrari and F. Selleri, Nuovo Cimento 21, 1028 (1961). 

18 S. L. Adler (to be published). 
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(63) for 0), in the (3,3) resonance region, may be 
as small as half a percent. 

Since is proportional to K NNv ( 0), the 

pionic form factor of the nucleon drops out of the 
calculation. Substituting Eq. (63) into Eq. (62) and 
doing the integration numerically gives the result 

A*-0.5. (64) 

Hence the model we have used indicates that extrapo¬ 
lation off mass shell has only a small effect, of order 2% 
of g r z /M. This figure corresponds to the fact that the 
two terms in the integrand of Eq. (62) cancel up to 
small terms of order M v 2 /M 2 , which is about 2%. The 
need to use a model is unfortunate, and the extrapola¬ 
tion off mass shell is the least certain aspect of the 
comparison of Eq. (31) with experiment. However, the 
apparent smallness of A indicates that the model would 
have to fail very badly for there to be an appreciable 
effect on the numerical results. 

C. Summary 

Adding the —0.5 obtained from going off mass shell 
to the results of Subsection A gives the final results 
shown in Table III. They indicate that unless the model 


Table III. Final results for A r - y < +) (0,0,0) M/K N ^ (0) g r 3 . The error 
estimates are obtained as indicated in the text. 


Method 

Result 

Error 

estimate 

Threshold subtraction, using Woolcock’s 
S- and P-wave scattering lengths. 

1.07 

±0.04 

Threshold subtraction, using Roper’s phase 
shift fits for all scattering lengths. 

1.20 

4 • • 

Broad area subtraction, using fixed mo¬ 
mentum transfer dispersion relations. 

1.03 

±0.04 

Alternative broad area subtraction method, 
using forward scattering dispersion 
relations. 

1.03 

±0.015 


used for going off mass shell is badly in error, the con¬ 
sistency condition of Eq. (31) is satisfied to within 10%, 
and quite possibly to within 5%. This fact, together 
with the success of the Goldberger-Treiman relation, 
suggests that the PCAC hypothesis deserves further 
study. 

IV. OTHER CONSISTENCY CONDITIONS 

The consistency condition on pion-nucleon scattering 
is not the only condition on the strong interactions 
which is implied by PCAC. In this section, we discuss 
briefly the conditions connected with several other 
scattering amplitudes. 

A. Condition on Pion-Pion Scattering 

Let us consider the pion-pion scattering reaction 
illustrated in Fig. 2. 19 Let (pi fit), (p 2 ,fi) be the four- 

18 G. F. Chew and S. Mandelstam, Fhys. Rev. 119, 467 (1960). 


momenta and isospin indices of the initial pions, and 
(pa,of), (pifl) the four-momenta and isospin indices of 
the final pions. We take all four-momenta to be in¬ 
coming, so that the condition of energy-momentum 
conservation reads 

/>l+/>2+/>3+/>4=0. (65) 

We introduce the standard Mandelstam variables s , t, u 
by 

s ~ (pi+p2) 2 = (pa+pd 2 , 

t= (pl+/»4) 2 = (p2 + /»s) 2 , 

u= (pl+p 3 ) 2 = (p2 + pif , 

s +$+ u=pi -\-pi+P 3 2 -r pi . 

The isospin structure of the pion-pion scattering matrix 
element is 

(16/>io/> 2 o/> top 4 o) 1/2 (frir° ut | tit in ) 

=$a'^p*M vv (s,t,u) afiia 'p\p a \p0' (67) 

From the requirement that the scattering amplitude be 
symmetric under interchange of the pions, we find that 

M" (S f t,u) a fi,a' $ ,== A v *(s | A VV (t \ U,s)8 aa '8$0' 

-\-A rr (u [ t t s)8 a p'8fia', (68) 

where ^4 1r1r (j| i,u) is a symmetric function of t and u. A vr 
also depends on pi 2 , p 2 2 , pi, and pi. It is easy to see that 
at the symmetric point s=t=u= (pi+pi+pi+pi)/3, 
A vv is left invariant by the interchange pi 2 pi, by the 
interchange pi <-► pi, and by the simultaneous inter¬ 
changes pi <-> pi, pi <-> pi. 

Let us now consider the axial-vector matrix element 
(tt | J\ A | t). Let p 2 =k be the momentum transfer and $ 
the isospin index associated with the current J\ A , while 
we take (pi,a), (ph 0 *-), and (ptjfi') to be the four- 
momenta and isospin indices of the three pions. The 
isospin structure of the axial-vector matrix element is 

(8/>lo/»30^4o) 1/2 (7T7r | J\ A | t) 

= (69) 

Defining Mandelstam variables as above, we find that 
the amplitude M(s,t,u) a ff,a'p' x is given by 

= D4ifcU,«) (pz~^pi)'k~^A2(s\t J u) (pa—pdx 
+ ^4 a(s | t,U)p\\]8a$a f $'-\-[_A l(2 [ U,s) (p l+/>3)\ 

+ A 2 (t | U,s) (pi — pi) X+^4 3 (i I U,s)pA\\baa’8M f 
+ LA i (u | tj s) (p i+ pi) x+ A 2 (u | t,s) (p i —p 4 ) x 

+ ^4 3 (u I M^SX^a/S'Sfla' , (70) 

where A\(s\t,u) and Aa(s\t,u) are symmetric functions 
and As(s\t,u) is an antisymmetric function of the 
variables t and u. 

There are no pole terms which contribute to the 
amplitudes At, As, and A$ of Eq, (70). Thus, when 

ps'p\—p2’p%=p2 m p\—^, (71) 
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we have p 2 \M{s,t,u) a ^ l a t ^=^ J in other words, 

(tht [ d\J\ A | t)=0. (72) 

Equation (71) implies that we are at the symmetric 
point 

s=t=u=k 2 —M*. (73) 

Since s-\-l-\-u= — 3Af,r 2 +& 2 , we see that Eq. (73) can be 
satisfied when k 2 — 0, giving the result that (trir \ d\J\ A \ ir) 
vanishes when k 2 =0 and s=/=w= —M*. The PCAC 
hypothesis allows us to write 

(7nr|dxA A |7r) = C(7r7r| <Pw\ir). (74) 

Consequently, PCAC implies that 

A™(s=-M^t=-M*\u=-M v 2 \k?=0)=0, (75) 

where — k* is the (mass) 2 of one of the four pions and 
where the other three pions are on mass shell. 

Comparison of Eq. (75) with experiment will be 
difficult, since the effect of one of the pions being off 
mass shell is very likely to be important. In particular, 
the negative of the pion-pion amplitude at the on-mass- 
shell symmetric point, 

(76) 

is just the effective pion-pion coupling constant 19 and is 
not zero. 


B. Condition on Pion-Lambda Scattering 20 


The derivation in this case closely parallels the 
derivation given in Sec. II for the condition on ttN 
scattering. The generalized Born approximation dia¬ 
grams for (tA || A) are shown in Fig, 3, In the 
derivation of Sec, II, we make the replacements 

igr$Nyb'vl'N ■ igAs(^S-T5^A+^A75^S + )^ir-l- (77) 


to define the A2 t strong vertex 21 ; 


gA^Nj\y5T + ^N —*■ 

gA AS (^S + 7x75^A+^A7X75^S _ )-l- (78) 

to define the AS weak vertex; and 

A af} vN - ikB a p rN —> (A vA - ikB rA )S a p (79) 


Fig. 2. Four-momenta and isospin 
indices for mr scattering. 



\ 




References dealing with irA scattering are given by T. L. 
Trueman, Phys. Rev. 127, 2240 (1962). 

31 M. Gell-Mann, Phys. Rev. 106, 1296 (1957). 



Fig. 3. Generalized Born approximation diagrams for (tA | J\ A | A), 


to define the tA scattering amplitudes. Equation (27) 
becomes 


— 2M r(Ai+^2)+2Jlf r^3+2gAsgA AS (0) 


X 



1 


vb ~ r+cr 


1 


vb~{~ r+<r 



g A A H0)(M A +Mz) 

gA S * AS (0) 


A* A (v, VB, k 2 =0) , 


(80) 


where <r= - (pi+pi) ■ h/(2 Ma), 

VB=q m k/ (2M a ), and where K AX is the form factor of the 
ASt vertex, normalized so that AT AS (— M v 2 )= 1. Setting 
v=vb—0 gives the consistency condition 

0=A* a (v=0 9 vb= 0,£ 2 =0). (81) 

This is a null condition and thus differs greatly from the 
condition derived for irN scattering. The difference 
arises from the fact that the intermediate state baryon 
in the generalized Born approximation for (tA | J\ A | A) 
is a 2, which has a mass unequal to that of the external 
A. This makes the quantity <r in Eq. (80) different from 
zero, with the result that the coefficient of 2g A sg A AS (0) 
vanishes when v and vb are set equal to zero. In the case 
of irN scattering, <r is zero, and a nonnull condition on 
A is obtained. It would be an interesting problem to 
try to determine from a study of tA scattering whether 
Eq. (81) is satisfied. 


C. Other Reactions 

The space-spin structures of (irK\j\ A \K) and 
(t(2 ,E) |/x A | (2,E)) are similar to the space-spin struc¬ 
tures of (7nr[/x A |x) and (irN \ J\ A \ N), respectively. 
Consequently, there will be consistency conditions on 
the wK , the t 2 , and the tE scattering amplitudes. Since 
(t( 2 ,E) |/x a | (2,E)) has a generalized Bom approxima¬ 
tion diagram with an intermediate (2,E), the consist¬ 
ency condition will be a nonnull condition, like Eq. (31) 
for irN scattering, rather than a null condition, like Eq. 
(81) for tA scattering. 

We have not studied reactions with more than two 
particles in the final state. It would be interesting, for 
example, to determine from a study of (inr2V’|/x A |7V’} 
whether PCAC implies a consistency condition in¬ 
volving the amplitudes for t+ N —> t+t+jV. 
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APPENDIX 

We derive here the equations used in the numerical 
calculations described in Sec, III. Let us consider the 
reaction ir(k)-)-N(pi) “^ *"(?)+A (/> 2 ), where the four- 
momenta of the particles are indicated in parentheses. 
We take the nucleons and the final pion to be on mass 
shell, 

~ M \ ? 2 = (Al) 

but keep k 2 arbitrary. Let k= —pi and q= —p 2 be, re¬ 
spectively, the momenta of the initial and final pion in 
the center-of-mass frame of the reaction, and let k 0} pio, 
Qo, P 20 be the center-of-mass particle energies. We 
denote by W the total center-of-mass energy 

W= &0+/>10= <?0+/>20 , 
k^{W 2 -M 2 -k 2 )/2W, 
q 0 = (W*-M*+M r *)/2W , (A2) 

p l0 = (m+AP+k 2 )/2W, 
p2o= (W*+]P-M V 2 )/2W. 

We denote by <p the center-of-mass scattering angle be¬ 
tween the final and initial pion, so that 

y=cos<p—^’k, (A3) 

where £ and k are unit vectors along the directions of the 
final and initial pion, respectively. The magnitudes | q | 
and | k | are clearly given by 

|q| = | k | — (W+* 2 )*/ 2 . (A4) 

The quantities v and vb are related to W and cos^ by 

w-wb=(W % -M*)/2M, 

^=(l/2M)[|q||k|cos<p—^o]- 

The variable co, frequently used in going to the static 
limit, is defined by 

(o=W-M. (A6) 

Let us introduce center-of-mass amplitudes }\ and / 2 
by writing 

u(p 2 )(A irN -ikB^)u(p l ) 

4irW 

= -j (A7) 

M 

where A and B* N are the covariant amplitudes used 
in the text and where and X ai are the nucleon 


spinors, (We suppress isospin structure.) The trans¬ 
formation relating the amplitudes f h / 2 to the ampli¬ 
tudes A vN , B rN is 

/i 1 A* N W~M B rN 

C(/ , 10+-^)(/ , 2o+-M^)3 I/2 2W 47T 2W 47T 

(A8) 

fa -1 A* N W+MB* N 

-=-b-. 

l(j>ia-M)(taa-M)J l * 2W 4tt 2W 4tt 

The partial wave expansion of }\ and / 2 is given by 


00 


00 


h= E UPi+i'(y)~ E fi~PU(y ), 

2=0 1=2 


2-1 

1 r 

/«-=- / dy\J 2 P l+l (y)+f 1 P l (y)1, 

2 J _i 

1 f 

fi~=- / dy\J*P^x{y)+f l P l {y)~], 

2 J-i 


(A9) 


where fi± is the amplitude for the partial wave with 
orbital angular momentum l and total angular mo¬ 
mentum J=lzL%. The symmetric isospin amplitude 
/i ± c+> is given in terms of the isotopic spin f and J 
amplitudes by 

/i± (+) = f/i± (,/2> +i/i± (,/2> . (A10) 

Finally, we need the inverse of Eq. (A8) for the ampli¬ 
tude A 

A** (W+M)fi 

4 tt C(^io+-Af) (p2Q-hM)~] 112 


( W-M)fa 

-. (All) 

l(pio-M)(p 20 -M)yi 2 

A. Equations for Threshold Subtraction 
and Static Limit 

Let us first consider the case when k 2 =—M v 2 and 
derive the equations used in the threshold subtraction 
and the static limit treatments of the dispersion rela¬ 
tions. Below the two-pion threshold, 


/i ± < 7) =exp[V5i ± t7) ] sin5i± c7 V|q| , (A12) 


where 5* ± C7) is the phase shift. The scattering length 
<Zf ± C7> is defined by 


ar± (7) = lim 

|qM> 


fj±™ 

|q 


22 


(A13) 
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Using the facts that 


cos*>= [ {2Mv B +M 2 )l |q| 2 ]+l, (A14) 

and that the leading term of Pi (y) for large y is 

P/(y)~ P(2/)!/2 i (n) 2 > , “ 1 , (A15) 

we find that, at threshold, 

L/i (+) >= £ [W 3/2) +W I/2) ] 


H-l 2 H-1]I 

X- [IMvB+Mjy, 

2 i+, [(2+l)G 2 


q ‘Jt 


]-Z 

Jt 1=1 


IA '"<+>> 


{|[ ai _(3/2)_ ai+ (3»3 

+|[ ai _a/ 2 )>_ ai+ (./ 2 )] } 

1 ( 21 )! 

X-[2 MvB+M/y-', 

2* (l!) 2 


= (l+—)[/,<+>>-2 m[—1 . 

\ 2 Mi L |q 2 Jr 


(A 16) 


When vb= 0, this is just the result stated in Eq. (41) of 
the text. 

The static limit of is easily derived. According 

to Eqs. (A9-A11), when all partial wave amplitudes 
except /a,s=/i+ C8/8) are neglected, A * rJV ’ c+) is given by 

A »jv(+) rW+M2MvB+qo 2 




pw\"M 


(W-M)(p 20 +M)-]2 
+--- -As. (A17) 

la 2 j3 


In the static limit, when vb= 0, this is 

16 M 7TCO 

A** t+ )«-/,... 

3 | q | 2 

Since in the static limit (when vb= 0 ) r«co and vo^l, 
we have 


B. Equations for Extrapolation off Mass Shell 

Now let us consider and derive the equa¬ 

tions used for going off mass shell in k 2 . According to our 
model, we wish to calculate 

ImA (v,&) = [1mA 3 , 3 ^ c+) (vfi,k 2 )/K*x* (£ 2 )] 

-Im^ 3l 3** c+) (*sO,-l) (A19) 

at k 2 =0, From Eqs. (A9-A11) and Eq. (63) of the text, 
ImAivjk 2 ) is given by 

4 tt 2 r 3 (TF+M)? 0 2 


2 r°°dv f 

- / — Im A ***{.+) (^ o, — 1) 

ir J » A v* 


32 1 

— Mir — 
3 ir 




(A18) 


4tt l r 

ImA(v,#)=-Im/ 3l3 

|q| 2 3 L 


^20+3/ 


+ Oj(/>20+3i") 


with 




(A 20) 


L= 


1 /..Ar,* 2 ) I q | 

k*= -M T 2 ) | k| 
3(W+M)q 0 ko 


L(pio+M)(pi 0 +M)Ji* 


3(W+M)q<? 

pzt)-\-M 


+<o[(/>io+ M){p m +M)J* 


+ w(^20+A^) 


(A21) 


The Born approximations are computed by substituting 
the isospin J part of the Born approximation 


= - gl ?K»»*{k 2 )/ 1 qjjk | (y+a), 


(A22) 


a=( 2 /> 2 o£o+£ 2 )/ 2 |q||k| , 

into Eq. (A8) to calculate /i B(3/2) and / 2 B(3/2) . The /=§ 
projection is then done by using Eq. (A9). The result is 

1 B {v&) |q| N 

---, (A23) 

fs,3 B (v,&= —M/) | kl N' 


N=w[_(p i i S +M)(p'm+M)-]' n A (a) 

+ (W+M)t(p l0 -M)(p i0 -M)'] ll2 C(a) , 
N'=a,{p m +M)A (a')+ {W+M){p i0 -M)C{a '), 


where 


<*'= (2pnqo— M T 2 )/2lq| 2 , 


A{a) = 1—ln[(a+l)/(a-l)], 

2 (A24) 


C(a) = — 




Consistency Conditions on the Strong Interactions 
Implied by a Partially Conserved Axial-Vector 
Current, Stephen L. Adler [Phys. Rev. 137, 
B1022 (1965)]. In Eqs. (16) and (23), 

[(Pio/M) (pio/M)2k 0 ] l/2 


L(Pio/M) (pw/M)2q o y 12 . 


should be 


152 


Adventures in Theoretical Physics 


PHYSICAL REVIEW VOLUME 139, NUMBER 6B 20 SEPTEMBER 19 6 5 

Consistency Conditions on the Strong Interactions Implied by a 
Partially Conserved Axial-Vector Current. II 

Stephen L. Adler* 

Lyman Laboratory of Physics, Harvard University, Cambridge , Massachusetts 

(Received 26 March 1965) 

Consequences of the partially conserved axial-vector current (PCAC) hypothesis are explored. A set of 
simple rules is derived which relate the matrix element for any strong interaction process with the matrix 
element for the corresponding process in which an additional zero-mass, zero-energy pion is emitted or 
absorbed. A generalization to include lowest order electromagnetic processes is given. A theorem is stated 
and proved which shows how divergence equations of the form dx/x^D are modified when a minimal 
electromagnetic interaction is switched on. 


INTRODUCTION 


I N an earlier paper 1 it was shown that the hypothesis 
of partially conserved AS=0 axial-vector current 
(PCAC) leads to consistency conditions involving 
solely the strong interactions. One of these conditions, 
relating the pion-nucleon scattering amplitude 
and the pion-nucleon coupling constant g r , was shown 
to agree with experiment to within 10%. In this note 
we give a simplified and generalized derivation of the 
consistency conditions implied by PCAC. We will 
derive a set of simple rules which relate the matrix 
element for any strong interaction or first-order electro¬ 
magnetic process with the matrix element for the 
corresponding process in which an additional zero-mass, 
zero-energy pion is emitted or absorbed. The rules are 
closely connected with the “chirality conservation” 
formulas of Nambu, Lurie, and Shrauner. 

Let us begin by recalling certain definitions from (I). 
We denote by J\ A the strangeness-conserving weak 
axial current. By partially conserved axial-vector 
current we mean the hypothesis that 


d\J\ A = 


-MU£*Mfg A *(Q) 

-—--0 r +R. 

grK***(0) 



Here Ms is the nucleon mass, M v is the pion mass, 
gA N { 0) is the /3-decay axial-vector coupling constant 
[gA ^ (0)«1,2X 10 "~ 5 /Mgr is the rationalized, re¬ 
normalized pion-nucleon coupling constant (g r 2 /4x- 
«14.6), and 0 T is the renormalized field operator which 
creates the ir+. The quantity ^^'(0) is the pionic form 
factor of the nucleon evaluated at zero virtual pion 
mass; K is normalized so that K yyv (—Mn £ ) = l. 
In order to give content to the definition, we must 
specify properties of the residual operator R. We 
suppose that for states (P(Pf)\ and | a (pi)) for which 
(fi 0, | a) 7*^0, and for momentum transfer near the one 
pion pole at — M, 2 [say, for -Mtt 2 < {pF-pi) 2 <M/] y 
the matrix element of R is much smaller than the matrix 
element of the pion operator term. In other words, we 
postulate that if (/3|0»|a)^O and if | {pF—pif\ 

* Junior Fellow, Society of Fellows. 

1 Stephen L. Adler, Phys, Rev. 137, B1022 (1965). We will 
refer to this paper as (I). 


then 

-«i. 

\ym„MSg A " (oj/gx*" (o)] |</3|0 T |a)| 



In what follows we derive equalities which hold 
rigorously if the residual operator R is zero. If R is not 
zero, but satisfies the inequality of Eq. (2), the “equals” 
signs should be replaced by “approximately equals” 
signs. 

It will be helpful to introduce a number of abbrevia¬ 
tions and definitions. We denote by k the momentum 
transfer pF—pi . Let us introduce the isotopic vector 
quantities J\ Aa , 0»° ( a= 1, 2, 3), in terms of which 

Jx A =W* M +iJ * A2 ) > <t>* = (l/^)(0.4-^ 2 ). (3) 

We denote the product g r K NNv ( 0) by gr'^O). Then 
the generalization of Eq. (1) to all three isospin compo¬ 
nents J\ Aa is (neglecting R) 


dxJx A *= - i{2M N MS gA * (0 )/g T *» (0 ))*,•. (4) 


It will be convenient to introduce an isospin notation 
for the 2 and for the E analogous to that for the nucleon 
N. We introduce isospinors and isospin column vectors 
as follows: 



By us or u s we will mean the ordinary Dirac spinor for 
the hyperon, multiplied by the appropriate isospinor 
or isospin column vector. Let r a denote the usual Pauli 
matrices, and let t Ifa J and t 2a be the matrices 
defined by 

t Xa =t Sa =T a 9 (fy 

(7) 

Then we may write the baryon matrix elements of J\ Aa 
and of J r a = (—O+3f r 2 )0, a as follows. (We omit the 
induced pseudoscalar terms in J\ Aa , since these are 
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treated separately in the derivation below. See Refs, 
4 and 6,) 


(B(p F )]J^\B(pi)) 

[Mb Mb\' 12 

= ( --) u B ( pF)gA B y\yit Ba u B (pi ), 

' pFO plO f 

{B{p F )\J x °\B{pi)) 

/MbM b \ 11 - 

= 1 -) (pF)igr XB y^ Ba UB (pi) ■ 

\pF0 pIQ/ 



Here B denotes N, 2, or 3, 

Using these definitions of the coupling constants, and 
Eq. (4), it is an easy matter to see that 

M NgA »(0) M*g A *(0) _ M z g A s(0) 

gr'*( 0) gr' Z (0) " gr'*(0) 

Equation (9) will permit us to eliminate the axial- 
vector coupling constants g A N , g A s , and g A ^ from the 
consistency conditions obtained in the next section. 


I. DERIVATION OF CONSISTENCY 
CONDITIONS 

We take the matrix element of both sides of Eq. (4) 
between states (0(/>r) out | and |a(/>r) in ), where £ and 
a are any systems of strongly interacting particles. 
This gives 

k\(0(pF) out I J x A “ l“(^r) in ) 

= QMxM , 2 gA N (Q)/gr xN m(j3(pFr at !*.“|a (pi)* n ), 


2M N g A N (0) M, 


gr* N (0) 


</3(^0°' 1 M- 7 * o l“(^) !n >. (1°) 


Let us examine what happens in the limit as >0 
(pr~* pi). The right-hand side of Eq. (10) in most 
cases approaches a finite limit, since 

lim (8 (pF)° at | •/,“ | a (j>i)' n ) (11) 

Pjr+Pj 



N P, N Pl p 2 N 

Fig. 1. The sort of situation which is excluded by the re¬ 
quirement that we avoid singularities of </3 0Ut |a l11 ). When pi* 
— (Pi-\~Qi—^) 2e = —Mn 2 , the diagram illustrated is infinite because 
the nucleon propagator joining the two bubbles is infinite. Such 
infinities can arise in general from pole diagrams contributing to 
(j3° ut la in ). (Pole diagrams are those which can be divided into two 
disconnected parts by cutting a single internal line.) We restrict 
ourselves in the text to values of the external four-momenta for 
which all pole diagrams contributing to </3° ut |« in ) are nonsingular. 



a 



a 


fi 


Fig, 2, Ways of attaching the proper vertex of J \ A , represented 
by a heavy dot. The proper vertex can be (a) attached to an 
internal line, (b) attached to a terminating external pion Kne, 
(c) attached to a nonterminating external line. 


is just the matrix element for 

a —> £+ (zero-mass, zero-energy pion), 

and is in general nonzero, 2 Thus, the matrix element 
(P(pF) out \J\ Aa W(pi) il3L ) must contain pole terms which 
go as l/k, in order that the scalar product of k with 
this matrix element have a finite limit. Clearly, if we 
can develop a simple set of rules for calculating these 
pole terms, we can calculate (0(^F) ou ]/ir a |a(/>r) in ) to 
zeroth order in k. 

Calculation of the pole terms in (S(#r) out |/x Aa | 
a(pi) iD ) turns out to be quite easy. Let us restrict 
ourselves to values of the momenta of the particles in 
a and in £ for which the matrix element (j8° ut |a !l1 ) has 
no singularities. (The sort of situation we wish to 
exclude is illustrated in Fig. 1,) The renormalized 
matrix element for (jS(^F) out | A Aa | a (#0 in ) is obtained 
as follows 3 : First we write down a complete set of 
irreducible or “skeleton” diagrams for the matrix 
element. Then we make a series of insertions in the 
skeleton diagrams. We replace each bare propagator by 
the renormalized propagator, each bare strong-interac¬ 
tion vertex by the renormalized proper strong-interac¬ 
tion vertex, and each bare vertex where J\ A acts by the 
renormalized proper vertex of J\ A .* We can divide the 
diagrams so obtained into three categories, according 
to where the proper vertex of J\ A is attached: (a) The 
proper vertex of J X A is attached to an internal line 
[Tig. 2(a)]; (b) the proper vertex of J\ A is attached to 
an external pion line which terminates [Tig. 2(b )]; 
(c) the proper vertex of J\ A is attached to an external 
line which does not terminate [Tig. 2(c)]. 

* Note that the value of the limit depends in general on the 
direction in which k approaches zero. 

8 Let us review some definitions. The skeleton of a diagram is 
obtained by replacing all vertex parts by bare vertices and by 
omitting all self-energy parts from the propagators, so that only 
bare propagators appear. An irreducible or “skeleton" diagram is 
a diagram which is identical with its own skeleton. A proper 
vertex diagram is one which cannot be divided into two discon¬ 
nected diagrams by cutting a single internal Kne. 

4 Note that the dominant part of the induced pseudoscalar 
coupling arises from the diagrams which give the one-pion pole 
term in dispersion theory. These diagrams are improper when 
considered as baryon-/*-' 1 vertices, and thus are not included in 
the proper baryon vertices of J\ A . 


154 


Adventures in Theoretical Physics 


B1640 


STEPHEN L. ADLER 


Corresponding to this division, wc can write 

(P(pF)° Ut \ k\J\ Aa | CL (pi) !n ) 

= <fi(pF)° at | hJx Aa | Ot(pl) in ) INT 
+<0 (pF)° at I kxJx Aa | at ipi ) in ) PI0N 

+ (8(/>r) out |^xA A ‘ I |a(/>/) in ) EXT . (12) 

We now analyze in turn the contribution of each of 
the terms in Eq. (12): 

(a) First let us consider the case where the proper 
vertex of J\ A is attached to an internal line. Each 
diagram contributing to (/?(/>r) out |/x A |a(/>/) in ) 1NT cor¬ 
responds to a diagram for (j8° ut |a l11 ), but has an addi¬ 
tional internal propagator. The requirement that 
</3° ut |a ,B ) be nonsingular means that all internal 
momenta are either integrated over or are off the mass 
shell. Thus the additional propagator cannot give rise 
to an infinity as k —► 0, and we conclude that {/3(^F) 0Ut | 
k>Jx A \<x(Piy»)™'* is of order k. h 

(b) The sum of all diagrams where the proper 
vertex of J\ A is attached to a terminating external pion 
line is proportional to 

<fi(tr) n V*'W(Piy m )lV(P+M r 2 )]<T c |A Aa |0). (13) 

Using Eq. (4) to evaluate <?r e |/x AB |0) gives the result 


-k> 2Mxg A N (0) 


k 2 +M r 2 gs*( 0) 


<P(tr) wt \Js\a(ti) in )- ( 14 ) 


This is of order k 2 and may be neglected. 6 

(c) We next consider diagrams where the proper 
vertex of J X A is attached to a nonterminating external 
line. (We restrict ourselves to external lines of particles 
in the pseudoscalar meson or baryon octets.) These 


diagrams may be divided into two types, according to 
whether J\ A changes or does not change the mass of 
the external particle. 7 The only case where the mass is 
changed is that where J\ A changes an external 2 to a A 
or an external A to a 2. Both of these cases make a 
contribution to {/3(/>F) out |&xA Aa |a(^r) iri ) EXT which is 
of order k t since the propagator which follows the 
proper vertex of J\ A behaves as (Mz*—M a 2 ) -:1 as 
k —> 0, and thus is nonsingular. Finally, we will show 
that the diagrams where J\ A is attached to a non- 
terminating external line, and does not change the mass, 
are of order kr l . Insertion of J\ A into a pseudoscalar 
meson line is forbidden by parity; insertion of J\ A 
into a A line is forbidden by isospin. Thus, we need only 
consider insertions of J\ A into external N, 2, and 2 
lines. The contribution of the insertion of J\ A into the 
line of a final baryon B of four-momentum ps is 

(—) UB{pu)gA B y\yst Ba -;—sm. (15) 

'pBQ' pB — ^—%M B 

Here 9TC is the matrix element for the process a —> 0, 
with the final baryon B virtual. Since ps 2 = — Mb 2 , 
the propagator can be written as 

1 pB—k+iM B 

- ; -=-, (16) 

pB — k — iM B — 2 pB'k-]-k 2 

showing that there is indeed a singularity as k —► 0. To 
lowest order in k t we can neglect k in calculating 3TC 
and can retain only the term of order k~~ l in Eq. (16). 
Thus, the insertion becomes 

/Mb\ 112 Pb+IMb 

(—) &B(pB)gA B y\yit Ba -9TC(£=0). (17) 

\ PbO* —2 pB'k 


& We assume, of course, that none of the proper vertices of 
have a singularity as k —> 0. 

6 These diagrams form the dominant part of the induced pseudo* 
scalar coupling. A statement much stronger than that they are 
of order k 2 can be made. Referring to Eq. (10), we note that the 
right-hand side may be written 

(2 Af)[1 —J8»/ (£*+]09(/»| S -) out I j) *“>- 

The part of this proportional to jfe*/(jfe*+Af r *) exactly cancels the 
contribution, given by Eq. (14), of the diagrams where J\ A is 
attached to a terminating external pion line. Now &/ {k l +M S) 
has the property 

lim lim 1, 

Jb—*0 Afx 2 —»0 

lim lim k 2 / (k^+Mr 3 ) *=0, 

M r 2 —*4 k —*0 

whereas the terms in Eq. (12) labeled INT and EXT are in¬ 
dependent of the order of the limiting operations: 
lim lim 

^^0 M *0 

= lim lim (0 (Pf)° ut | k\J\ Aa \a{pi ) in ) lNT -ext 

Aft *—*0 *-*0 

Hence the exact cancellation of terms proportional to k 2 /(k 2 -\~M r i ) 
means that the limit, as M,* —> 0, oi the consistency conditions 
of Eq. (24) is identical with the consistency conditions which 
would be obtained in a theory in which the pion mass was set 
equal to zero at the outset. Note that by virtue of Eq. (4), in 
such a theory the axial-vector current would be exactly conserved. 


Calculating SfTC with £ — 0 means that we keep the final 
baryon B on the mass shell. Furthermore, b is 

just the positive frequency projection operator for B , 
with the property 

(ps-\-iM b)Pb— {pB~\-iM s)iM b- (18) 

Let us denote by 3TC C the matrix element obtained by 
bringing all ps in 31X(& — 0) to the left and replacing them 
by iM b. Then the insertion becomes, finally, 

Mb \ 1!2 ps+iMs 

—) ub (pB)gA B y\yst Ba -src c . (19) 

pBO* —2 pB'k 

The crucial point is that 

(p**\a^) = 8fr+(2T)ty(pr-pi)m(a^P), (20a) 
/Mb\^ 2 

-m(a ff) = f-) Hb(Pb)^I c (20b) 

_ \pB0* 

7 We are neglecting the electromagnetic interactions, so all 
particles in the same isospin multiplet are of equal mass. 
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is just the matrix element which describes the strong 
process a —► 0, with all particles on the mass sheU. Thus, 
Snt c can be measured experimentally. Similar arguments 
show that the insertion of J\ A into an initial baryon 
line gives 

/M b \ 112 ps^-iMs 

—) sr«- gA B y\ybt Ba UB(ps), (21) 

\pBQt 2 pB’h 

with 

-m(a -► ff) = (M B /pBoy ,2 Wl' e UB(pB ). (22) 

To sum up, we have analyzed the behavior of each 
of the terms in Eq. (12). Let us collect the results 
and write 

(2Jtf*g A (0) )<^ 

= 0(k)+0(k 2 ) 

+ 51 [insertions in—t9H(a—>jS)]+0(^). (23) 

external 

lines 

The three terms on the right-hand side of Eq. (23) refer, 
respectively, to the internal line, the terminating 
external pion line, and the nonterminating external 
line insertions of J\ A . Multiplying through by g r rN (0)/ 
[2jlftfgA W (0)] and using Eq. (9) to eliminate the ratios 
gA 2 (0)/gA W (0) and gA S (ty/gA N (0) in terms of strong- 
interaction coupling constants leads to the following 
set of rules: 

&(pr) m *\JS\<*(PzV n ) 

= 0(k)+ 51 [insertions in—(a—>/?)]. (24) 

external 

lines 

Insertions 

For external t, K , tj, A, the insertion is zero. For 
external N, 2, E, denoted by B , the insertions are 


final B: 

ub(Pb) 
initial B: 
ub(Pb) - 


&b(Pb) 


[ 


Sr T *( 0 ) ypB+iM B 


■ky h t Ba - 


2 Mb -J — 2p B 'k 

p B +iM B rgr irB { 0) "1 

- - hy4 Ba \u B (pB ). 

2p B -k L 2M b J 


(25a) 

(25b) 


These rules are the generalization to arbitrary processes 
of the consistency conditions derived in (I). It is an 
interesting fact that these rules are just what would be 
obtained if the effective pion-baryon coupling for pions 
with four-momentum near zero were pseudovector 
rather than pseudoscalar. This intimate connection 
between PCAC and gradient coupling theories was first 
noted by Feynman. 8 

As an illustration of the above rules, let us consider a 
special case. Let a be a single nucleon of four-momentum 

8 R. P. Feynman, Proceedings of the Aix-en-Provence Inter¬ 
national Conference on Elementary Particles (Centre d’Etudes 
Nud6aires de Saclay, Seine et Oise, 1961), Vol. II, p. 210. I am 
very grateful to Dr. M. Veltman for calling my attention to this 
reference and for emphasizing the connection between PCAC and 
gradient coupling of the pion. 


pi and any number of pions; similarly, let 0 be a single 
nucleon of four-momentum p% and any number of 
pions. Then we may write 


Sfll(a —»j3) = (M tf 2 / piopw) 1,2 UN (p$$Slu>H (pf ). 
According to the rules derived above, 

(P(pF) out \ J r a \a(pi) in ) 


(26) 


/ M$ 2 \ 1/2 ([-£/* 

— — [ -j i«w(/>2)| - 

\^in^2o/ l L. 2ft£fj 


( 0 ) 


X 


piopio 

\~p2+iM N ~] [ pi+iMx~] 

- \m+m\ -- 

L — 2p r k J L 2pi m k J 




X 


[ 


2 P 
gS N ( 0) 
2 M n 


kytT° |w(/»i). (27) 


It is easiLy seen that Eq. (27) is equivalent to the 
“chirality conservation” formula obtained by Nambu 
and Lurie® in a theory in which the pion mass is zero 
and in which the axial-vector current is exactly con¬ 
served. 6 Nambu and Shrauner 10 and Shrauner 11 applied 
Eq. (27) to the case when a, and found possible 

consistency with experiment. A simpler case was studied 
in (I), where we took a=7V, $ — ir b +N. In this case9)1 
is just the pion-nudeon vertex ig r T b y 5 and {(7rW) out | 
J x fl | A ,n ) is the pion-nudeon scattering amplitude. 
Introducing the usual pion-nucleon scattering-energy 
and momentum-transfer variables v and v B) 

pi’k= —M n (v — vb) , 

p2-k= —Mn(v+ vb ), 
we get from Eq. (27) 


(28) 


X 


gr gr 2 f tV 

- &ab~~ik - - 

Mn 2Mn\-vb— 


M n 2 x 1 ' 2 

plQp2Q 
2 r T b^ 


/ M fj* 

( (x»iV)«‘ | Jr a | .V in ) = (- ) K” n *(0)u n (p 2 ) 

'PlQpiQ/ 


PB~\-V J 


u N (pi). (29) 


The term (g r 2 /M ^ )5 a & leads to the consistency condition 
A* n <+Hv*=Q, v b = 0, k 2 = 0) g 2 


K N! **( 0) 

which was discussed in detail in (I). 


M 


(30) 


N 


II. MODIFICATION IN THE PRESENCE OF 
THE ELECTROMAGNETIC INTERACTIONS 

It is interesting to see how the rules derived above are 
modified when the electromagnetic interactions are 
taken into account. Since isotopic spin is not a good 

*Y. Nambu and D. Luri6, Phys. RevA125, 1429 (1962); Y. 
Nambu and E. Shrauner, ibid . 128, 862 (1962). 
w Y. Nambu and E. Shrauner, Ref. 9. 

11 E. Shrauner, Phys. Rev. 131, 1847 (1963). 
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quantum number in the presence of electromagnetism, 
we will work only with fields and currents with definite 
charge transformation properties. Thus, we replace the 
three equations contained in Eq. (4) by the equations 

dx.V (±) = C0 T <±>, 

dxA^ o) =^C0/ o >, (31) 

where 

Jx A (±) = i (Jx A1 =F iJx A2 ), Jx A <°>= Jx A3 ; 

^* (±) = a/V2)(^^»), ^T C0) = ^ir 3 ; 

(32) 

grK» N '( 0) 


[The superscript (=t) refers to the charge destroyed.] 
It is shown in the Appendix that to first order in the 
electric charge e (e>0), the modification of Eqs. (31) 
in the presence of the electromagnetic interactions is 

(5x=Fi^x)/x a(±) = C^ t ^, 

5x/x a(o) = ^C 0 t W. (33) 

As is customary, A\ denotes the electromagnetic field. 
Since all electromagnetic corrections to masses and 
coupling constants are of second order in e , questions 
such as whether to use the charged or neutral pion mass 
in computing C do not arise. 

Equations (33) permit us to state a ample set of 
rules for computing (up to terms linear in the four- 
momentum of the added pion) the matrix elements 
( 0 out |/ T c± °) | (or)* 11 ), where a and 0 are any systems of 
strongly interacting particles and where the initial 
photon 7 may be real or virtual. The terms d\J\ A(±0) 
in Eqs. (33) give rise to insertions into the external 
baryon lines of— tSflI(ory-Hidentical with those of 
Eq. (25), apart from trivial changes in the isospin 
factors arising from the use of fields and currents of 
definite charge. In addition, we must add to (/3 0Ut | / r (±) | 
(ary) 1 Q ) the term 


±egr wN (Q) 

^M N gA N ( 0) 


<fi >ut \A x Jx M±) \(ay) in ) 



and Shrauner , 9 who also discuss a detailed application 
to the reaction e-\-N —» e+iV+ 7 r. 
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APPENDIX 

We give here a fairly general treatment of the way 
in which divergence equations of the form 

dxJx=D (Al) 

are modified in the presence of electromagnetic interac¬ 
tions. We state the result in the form of a theorem . 12 

Theorem. Let be the unrenormalized fields of 
particles of charge ej. Let us consider a strong-interac¬ 
tion theory with the Lagrangian £[{^}, {d *$}'], where 
{fa denotes the set of the Let J\ be a current with 
definite charge transformation properties (charge ej) 
derived by making an infinitesimal gauge transforma¬ 
tion on the fields ^ in the following manner 13 : 

*/=lM-A iW}], 

£-»£'=£[{*'}, {3.*'}:, (A2) 

J \ = [5 £' f /8 (5xA)]a=o* 

Then, 

(1) In the absence of electromagnetic interactions 
the current J\ satisfies 

dxJx=D, (A3) 

with J\ and D both functions of the ^7 and the d 9 \f/j only: 

jx=jxzm,{W)i, 

D = DZW ,{ W ) 1 - (A4) 

(2) Inclusion of the electromagnetic interactions, 

with minimal electromagnetic coupling, changes Eqs. 
(A3) and (A4) to 

(a x -tejA x )/x[{^}, {*-„}]= {*-„}], (A5) 


arising from the term in Eq. (33). Using the 

standard reduction formulas, we find to lowest order 
in e that 

<F' % \AxW* M±) <y)\(«r) lu ) 

exp ( ik' • y ) 

=-(^ out |ex/x Af±) W|a in > ) (35) 

(2£ 0 ') 1/2 

where k' is the four-momentum and ex the polarization 
four-vector of the photon 7 . Equations (33), (34), and 
(35) allow us to calculate the matrix element for the 
emission of a zero-energy, zero-mass pion in photo- and 
electroproduction reactions. They are equivalent to 
the formalism derived for this purpose by Nambu 


where denotes the quantity (d 9 —iejA 9 )$j. 

Proof . We proceed as if the fields were classical 
quantities, ignoring questions of commutation and 
anticommutation. Let us first consider the case when 
there are no electromagnetic interactions. The Lagrange 
equation of motion for the field fa is 

5£ $£ 

—= d 9 -. (A 6 ) 

fyj 8(d 9 fa) 

Under the gauge transformation 
_ fa -+ (A7) 

12 1 am grateful to Professor S. Coleman for assistance in proving 
the theorem. 

,a M. Gell-Mann and M. L6vy, Nuovo Cimento 16, 705 (1960). 
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the derivatives and the Lagrangian £ change 
according to 

» d*ylt/ = djfo+ (d 9 A)Fj+A(d 0 Fj ), 

£^£'=£[{^'},{3^'}] 

= £[{^+AF},{3^+(d ff A)F 

+A(3,F)>]. (A 8 ) 


From Eq. (A 8 ) we find for the first variations, 


5£' 

5A 




«£' 

5(d x A) 



«£' 


5(d x ^/) 



(A9) 


Eq. (A 8 ) also implies that 

r5£'n S£ r S£' n S£ 
Ls^/Ja-q - * L(3x*/)Ja-o «(3xlM 


(A10) 


Together, Eqs. (A 6 ), (A9), and (AlO) imply that 


The first variations are 


5£ em ' r5£ EM ' 5£ EM ' 1 


5A 3 L 8 \f// 

S£ EM ' rS£ EM ' "1 

--=E - t f S • 

6 (dxA) 3 L 6 it;x J 


for// 


(A17) 


Using the Lagrange equation, Eq. (A15), we see that 


|-S£ em '-| r5£ EM ‘~| 

, L 5A J 


1_5(dxA) J 


A=0 


(A18) 


A=0 


Let ns make use of the fact that the current J\ has 
definite charge transformation properties. Since 5£/ 
5(3x^) transforms as a field with charge —ej, Eqs, (A9) 
and (A12) tell us that Fj must transform as a field 
with charge Thus, 

Fj[\p i exp(«i/), $2 exp(ie 2 t), * - •] 

= expp(ej+ej)/]F£^i,^ 2) - * (A19) 



We define 

/x— p£V5 (dxA)] a=o , 

D=l 5£75A]a»o; (A 12) 

these are clearly functions only of the {^} and the 

Let us now turn on the electromagnetic interactions. 
According to the hypothesis of minimal electromagnetic 
coupling, the Lagrangian is modified according to 

£-^£EM =(£ [- Wj{7rff J-] +£ EMO j ( A13 ) 

where £ EM0 is the kinetic Lagrangian of the electro¬ 
magnetic field A„ and where 717 , is {d 9 —iejA 9 )\l/j. The 
new Lagrange equation for the field tfv is 

3 ff (5£ EM /5(d^;)) = 5£ EM /5»A;- (A14) 

Let us henceforth treat 1/7 and tj 9j rather than \frj and 
Wi, as the independent variables in taking the varia¬ 
tion of £ EM . Then the Lagrange equation becomes 

5£ EM 5£ em 5£ em 
d 9 --- iejA, -. (A15) 

for/, for/. 

Now let us make the gauge transformation \f/j— 
=^ 7 +A Fj. The quantity 7 r J9 and the Lagrangian £ EM 
change according to 

7Ty ff —> Tj 9 r =T$ 9 —iejA 9 AFj-\r (d ff A)F,+A (d 9 Fj) , 

£ EM^ £ EM' = { T ,'}]+JEEM0 

= £f{^+AF}, {w 9 —ieA 9 AF 

+ (d ff A)F+A(3 <r F)}]+£ EM0 . (A 16) 


Taking the first derivative with respect to t gives the 
identity 

^lijiF +ej)Fj. (A20) 

Consequently, using d 9 Fj=^i( 8 Fj/ 8 ipi)d 9 \pi, we obtain 

d 9 Ff—i +cj)ri 9 Fj = (8F;/8\p {) (d 9 —ieiA 9 )ifa 

= H.i{S F 3/fyi)niv (A21) 

In other words, d 9 Fj—i(ej-\-ej)A 9 Fj is the same func¬ 
tion of {&}, {tt 9 } as d 9 Fj is of {^}, Hence, by 

comparison of Eq. (A17) with Eq. (A9) it is clear that 

Pjb«7«(3*a)]a^=/xW}, W], 

Z;{p£ EM 75^/] A „oF y +[5£ EM 757r yff '] A „o 

Xld 9 F-i{e^ej)A 9 Fi]} = Dim, (A22) 

Thus, Eq. (A18) can be rewritten as 

(dx-^x)ACW, {t,}]. (A23) 

This completes the proof. 

Equation (A23) involves unrenormalized quantities 
throughout and is exact . In the case of PCAC, as con¬ 
sidered in the text, D=C u (f> r , where the superscript on 
C u denotes that it is unrenormalized. It is trivial to 
pass from Eq. (A23) to Eq. (33) of the text, which 
involves only renormalized quantities, if we work to 
lowest order in the electromagnetic coupling e: All 
electromagnetic renormalization effects are of second 
order in e and may be neglected. All strong interaction 
renormalization effects are contained in the ratio C/C v , 
where C is the renormalized constant appearing in 
Eq. (32) of the text. 
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Excerpt from S. L. Adler and R. F. Dashen, Current Algebras and Applications to Par- 
tide Physics (W. A. Benjamin, New York, 1968). Reprinted by permission of Pearson 
Education, Inc. publishing as Pearson Addison Wesley. 


APPENDIX A 

We give here a detailed discussion of the chirality 
method for deriving pion low energy theorems. We first 
consider a partially-conserved axial-vector current; we 
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then briefly show why the massless pion, conserved axial- 
vector current case considered by Nambu and Lurie 
[Paper 5] gives the same answers. We recall that the 
“chirality” x(t) is defined by 

X.(t) = /d 3 x[$F f° (x) + iff|° (x)] 

= F? (t) + iFf (t) (A.l) 

and from Eqs. (1.93) and (1.100), its time derivative is 


dx(t) 

dt 


g r ( 0 ) J V 


^n m Ua 

g r ( 0 ) 


f d 3 x$ 

J 7 ,- 


out 


If we define \ UUL and \ in by 

X 0ut = lim x(t), x in = lim x(t) 

t— ►oo t— ►- 0 O 


(A.2) 


(A.3) 


integration of Eq. (A.2) from — °° to °° gives 

X°ut - x in - [V3TM N g A /g r (0)] /d 4 xM^77- 

- [V2~M n g A /g r ( 0 )] 

x fd 4 x(n 2 x + Mj)* w - 

- [V2"M N g A /g r ( 0 )] J d 4 x J 77 - (A.4) 

(We have used the fact that f d 4 x = 0.) Taking the 

matrix element of Eq. (A.4) between states (j3(q 2 )| and 
I a (q,)) we get* 


* Again, we suppress the labels “out” and “in” on the states. 
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<(3(q 2 )U OUt -X in |a(q 1 )) = [V 2 T%g A /g r ( 0 )] 

x J d 4 x qi )‘ x 

X 0 (q s ) I Jtt- (0)1 or (q, )) 

= [V2M N g A /g r (0)] 

x (2tt) 4 5 4 (q 2 - qj) 
x <j3(q a )| J w _(0)|a(q 1 )) (A.5) 

Clearly, the right hand side of Eq. (A.5) is the matrix ele¬ 
ment for the emission of a pion of zero four-momentum in 
the process a —- j3. We will now show that the left hand 
side of Eq. (A.5) can be expressed in terms of the S-matrix 
element (j3(q 2 ) | a(q,)) describing the process in the ab¬ 
sence of the soft pion. 

To see this we must determine the effect of \ ou ^ in Eq. 
(A.5). (The discussion for x in will be similar.) We know 

that (i) x° ut is the space integral of a local operator, 

(ii) x° ut is time independent [oscillatory terms in x(t) 
vanish in the limit as t — °°]. Let us suppose that x ou *- 
can be written in the form 

X° Ut = Jd 3 x(P[{$ out }] (A.6) 

where (P is a (possibly infinite) polynomial in the “out” 
fields of all particles present. Consider a term in (P 
coming from the product of N “out” fields. It has the 
time dependence exp(-ifit), with 

N 

n = E (tf + M? ) m (A.7) 

j=l J 


with ej - ±1 and with the momenta Pj constrained by the 


x integration to satisfy 
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N 

Z Pi = o (A.8) 

j=l 

If no zero mass particles are present, it is easy to show 
that the constraint of Eq. (A.8) implies that S2 vanishes 
identically if and only if N = 2, M x = M 2 and e 1 = — e 2 . 

In other words, time independence requires that x ou *- be a 
bilinear expression in the “out” fields and their adjoints 
of the form 



+ Z /d 3 x$? ut(_) (x)t 0 H$o ut( - ) (x) (A 9) 
j ] J J 

The superscripts (+), (-) indicate respectively the positive, 
negative frequency parts of the “out” fields; the sum ex¬ 
tends over all stable particles. 

The matrices are determined by (iii) Lorentz co- 

variance and parity (x° ut is an axial-vector charge), 

(iv) isotopic spin (x° u ^ is the isospin raising member of 
an isotopic triplet) and (v) the asymptotic definition of x ou *- 
as lim x(t). For example, (iii) and (iv) require that the 

t —►°° , 

nucleon term in x have the form 

(+) r _ r OUt(+)/ x o ,OUt(+)/ x 

g v ' J d 3 x? (x) r y 5 tp n '(X) 

= /d*x^ +) WT t / yi ^ +) W (A.10) 

To determine g^ we use (v), which states that 

lim <p(q 2 ) | \(t) | n(q,)) = <p(q 2 ) | X° Ut I n(q,)) 

t—OO 


(A.11) 
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The matrix element (p(q 2 ) | x.(t) | n(qj)) is actually time in¬ 
dependent, because the n and p have equal mass; using 
Eq. (1.92) to evaluate it gives for the left hand side of Eq. 
(A.11) 


/ m nY - 

(qfq?) U P (q2)g A y ° r 5 u n (( li )(27r)3 63 (c *2 “ O ( A12 ) 

while Eq. (A.10) implies that the right hand side of Eq. 
(A.ll) is 


M 


1/2 


N 


q°q? 


u p (q 2 )g 


(+) 


y°y 5 u n (q, )( 2 tt) 3 6 3 (q 2 - q,) 


(A.13) 


Hence g( + ) = g^ , and 


x out = BAJd-xf^WT.r-y^'Ww 

+ negative frequency part 
+ terms for other particles (A.14) 


Clearly, the effect of x out on an outgoing particle is to 
give back the same particle, with the same momentum, but 
with changed spin and isotopic spin. The expression for x. in 
is obtained by replacing all labels “out'’ in Eq. (A.14) by 
"'‘in.” 

Using Eq. (A.14), we can evaluate the matrix element 

<0(q 2 ) I X 0Ut I afai )) in terms of </3(q 2 ) | a (q,)). Eq. (A.14) 
instructs us to form a sum over “insertions” in the lines 
of all outgoing particles in (j3(q z ) |. If we write (j3 | = (£,... 
the contribution of the particle £ is 
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[<0 I x out k>] 




= [<?, • • • I X 0Ut I a)] 


s 

spin, isospin 
of 


J 


d 3 q^, 

"(27t)3 


x <Clx out k , XC , ,...|a) 

and the total matrix element is 

03 I X° Ut I a ) = Zj [<3 I X° ut I ot )] 

£e|3 


(A.15) 




Let us illustrate with the case of a final nucleon line, 
write 


(A.16) 


We 


0 (q 2 ) I O' (q x )) = 6p + (277) 4 i 6 4 (q 2 - q, )T(a - 0) 


T (a - (3) = 


M 


1/2 


N 


i° 

•N 


* U N ^ 


(A.17) 


Then the contribution of the final nucleon N to 


0 (q 2 ) I x out I«(q»)) 


is 


* 


(2tt) 4 i 5 4 (q 2 - q,) 


M v 1/2 
_N 


iu N (^n )g A T +r°r» 


X 



+ m n 



3U 


(A.18) 


*To compare Eq. (A.18) with Paper 6, which uses the Pauli 
metric, replace inEq. (A.18) by — i^ N . See the introductory 
remarks about metrics. 
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that is, the effect of x out is to cause the insertion 

+ M N 


U N ( q N ^ U N ( q N ) g A T + y° ( 2 q^ 


(A.19) 


in Eq. (A. 17). Comparing with Eq. (A.5), we see that the 
contribution of the final nucleon N to 0 | | a) is 



U N ( q N> 


g r (0) 

V2M n 





+ M 

N 



o 

N 


arc (A. 20 ) 


The total expression for (fi | J n _ j a ) will be a sum of terms 
like Eq. (A.20) for all outgoing particles in (j3 | and all in¬ 
coming particles in | a). 

Now let us briefly discuss why Nambu and Luri£ get the 
same insertion rules in the massless pion case. According 
to Eq. (A.2), when - 0 the chirality is conserved, 

dx M n = 0 

-^T— = 0 (A.21) 


out 


in 


_ „ = 0. So the term 


which implies that x^ - ' = 0 = 0 

proportional to / d 3 x in Eq. (A.4) is no longer present. 
However, when the pion is massless, the asymptotic chi¬ 
rality x° ut , in addition to containing the bilinear terms of 
Eq. (A.9), will also have a time independent term propor¬ 
tional to / d 3 x(9/9x°) •f’Out (x). This term expresses the 


fact that emission of a zero four-momentum pion is a 
physical process, not a virtual process, when = 0. 
[The (9/9x°) is necessary in order to have the space in¬ 
tegral of the time component of an axial-vector.] It is 
easy to verify that 


out _ out V2 M n g^ 

*M^=0 ^bilinear g r (0) 

x fd 3 x(d/dx° ) (x) 


(A.22) 
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and similarly for \ 
implies that 


in 

M^O • 


Hence 


out _ in 
=0 =0 


< /3 (q 2 ) | x 


out 

bilinear 


in 

* bilinear 


O' fai )) 


0 


= [V2M N g A /g r (0)] 

x [<P(q 2 ) I J d 3 x(9/9x°) | a (q, )) 

-0(q 2 )l Jd 3 x(9/9x0)$j r n (x)|ff(q 1 )>] (A.23) 

The matrix elements on the right hand side of Eq. (A.23) 
are easily evaluated, 

~<P(q 2 ) I fd 3 x(d/dx°)$™(x) |ff(q,)> 

= (P (q 2 ) I /d 3 x(9/9x° ) (x) | or (q,)) 

= [(0]/d 3 x(9/9x°)^ t (x)|,-(q))] 

x { <tt- (q)/3(q 2 ) |a(q, )>} 

= ^(f^ [(27r)3 6 3 (q)(2q°)- 1/2 (-iq 0 )] 

x {(2 tt) 4 i S 4 (q 2 + q - q,)(2q°)-»« 

x 0(q 2 ) I J^- ( 0 ) | «(q f ))} 

= r( 27r ) 4 6 4 (q 2 - q,)</3(q 2 )| J 7r _(0)|o?(q 1 )> (A.24) 


so that Eq. (A.23) becomes 

(P(q z ) I x. bilinear - x. bilinear I «(q 2 )> 

= [V2'M N g A /g r (0)](2 tt) 4 

x 6 4 (q 2 - q, )0(q 2 ) I J w _ (0) | a (q,)) (A.25) 

This leads to the same insertion rules as were obtained 
from Eq. (A.5). 
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CALCULATION OF THE AXIAL-VECTOR COUPLING CONSTANT RENORMALIZATION IN /3 DECAY 

Stephen L. Adler * 

Lyman Laboratory of Physics, Harvard University, Cambridge, Massachusetts 

(Received 17 May 1965) 


1. Introduction . -We have derived a sum rule 
expressing the axial-vector coupling-constant 
renormalization in & decay in terms of off-mass- 
shell pion-proton total cross sections. This 
Letter briefly describes the derivation and gives 
the numerical results, which agree to within 
five percent with experiment. Full details will 
be published elsewhere. 

The calculation is based on the following as¬ 
sumptions: 

(A) The hadronic current responsible for AS 
= 0 leptonic decays is 


. „ /T VI V2 . A1 . _ A2. 

W J A +lJ \ +J X +lJ \ 


( 1 ) 


where Gy is the Fermi coupling constant (Gy 
= 1.02X 10Here J x Va = :4> N Y X 

•“* is the vector current, which we as¬ 
sume to be the same as the isospin current, 2 
and •••: is the axial-vec¬ 

tor current. Since the vector current is con¬ 
served, the vector coupling constant is unre¬ 
normalized. The renormalized axial-vector 
coupling constant^ is defined by 


A 

-- +g A r x y 5 )T+U N {<}) - (2) 


(B) The axial-vector current is partially con¬ 
served (PC AC), 3 


0 J 
X X 


A -iM \1 2 g . 

Act N 7T A a 


g r K NNir (0) 


<P 


7T 


(3) 


where^ r is the rationalized, renormalized 
pion-nucleon coupling constant (g r 2 / 47T = 14.6), 
kNNtt(Q) i S the pionic form factor of the nucle¬ 
on, normalized so that = 1, and 

(p^ is the renormalized pion field. According 
to Eq. (3), the chiralities x ± V) = fd^x(J^ ± iJ^) 
satisfy 


d 

~~X 

dt 


± 



v 5 j V f .'^ 

fn" m m 


f cf* x (p 




(C) The axial-vector current satisfies the 
equal-time commutation relations 


c Aa. Ab 

l J 4 



. abc T Vc. x 

v _ v =fi ( x -y) |€ J a (*)• 
'' 0-0 ^ 


This implies that the chiralities satisfy 

\x + (t),x~(t)] = 2I 3 , 


(5) 

( 6 ) 


where Z 3 is the third component of the isotopic 
spin. 


Copyright© 1965 by the American Physical Society. Reprinted with permission. 
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The assumptions (A) are the usual ones for 
the leptonic decays. The additional hypotheses 
(B) and (C) are both necessary to obtain the sum 
rule for gj±. The hypotheses (A)-(C) are mutual¬ 
ly consistent, in the sense that there is a renor- 
malizable field theory (the a model of Gell-Mann 
and Levy 4 ) in which they are exactly satisfied. 

2. Sum rule . -There are two essentially equiv¬ 
alent ways to derive the sum rule for # 4 . The 


first is to use a method proposed recently by 
Fubini and Furlan . 5 We take the matrix element 
of Eq. ( 6 ) between single proton states (p(q)\ 
and I p(q')). The right-hand side gives 

(p(q)\2P\p(q')) = (277) 3 6(q-q'). (7) 

In the matrix element of the commutator we 
insert a complete set of intermediate states, 
separating out the one-nucleon term (to which 
only the neutron contributes): 


</>(< 7 )l[x + ( 0 ,X ( 0 ]l/>(< 7 ')> 

* spin J y 1 j*N ) 


The one-neutron term is easily evaluated using Eq. (2), giving 

(2ir) 3 6(q-q’)g A 2 (l-M N 2 /q Q 2 ). 

In the summation over higher intermediate states we make use of Eq. (4), giving 


•J2MM V/ 2 
N it A 

i **""(»>. 


i 

£ 

j*N 


(p(q)\l<Px(p +\j)(j\!d 3 x(p _l p(q*)) 

7T 7T 


-(7T —7T ). 




(9) 


( 10 ) 


From Eqs. (9) and (10), we see that there is a family of sum rules, with q 0 as a parameter. In the 
limit as q 0 approaches infinity, a sum rule for 1 -g^~ 2 is obtained. Let us assume that the limiting 
operation can be taken inside the sum over intermediate states in Eq. (10). It is useful to write this 
sum in the form 

( 11 ) 

INT 



where Qj is the total momentum and where "INT” denotes the internal variables of the system j. The 
invariant mass of the system j is Mj. The integrations over x and qj can be done explicitly, giving a 
factor (2?r) 3 6(q-q'), and constraining q j to be equal to q. Let us write 

I m n m Y 2 

<;i<p 7 r ±( 0 )i/>te)> = (—^-J f*, ( 12 ) 


so that Fj ± is a Lorentz scalar. Then using the facts that q j0 = (<? 0 2 +Mj 2 -Mrf) lf2 and (q^-q^)~ 2 = too 
+ qjtif N 2 ) 2 , the limit of Eq. (10) becomes 






r 



2 

(277) 3 6(q-q') 




dWM N W 

(w 2 -m n 2 ) 2 


k 0 + te 0 2 + M / 2 -M Ar 2 ) I/2 ] 2 


lim { K-[W , (q- q f}-K+[W, fo-*.) 2 ]}, 

q 0 ~ x ‘ 7 1 
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with 

A' ± [W' ( (<7-<7.) 1 2 ]= £ *\F ± l 2 . 

7 i*N 1 * 7 

INT 


(13b) 


The limit of the quantity in boldface curly brackets is 4, and the limit of the momentum transfer (q 
-qj) 2 =-[q 0 -(q 0 2+ Mj 2 -Mtf) V2 ] 2 is 0. It is easy to see that 0) is equal to [(^-M n 2 )/(2ttM j^)) 

xc‘ 0 ± (W r ), where c , 0 ± (W') is the total cross section for scattering of a zero-mass 7r ± on a proton at cen- 
ter-of-mass energy W. Thus we get the simple and exact result 


1 


1- 





WdW 


M +M W 2 ~M 
N 7r 


[o 0 + (W)-o 0 -(W)l 


N 


(14) 


Here o 0 ± (W) is the total cross section for scattering of a zero-mass tt* on a proton, at center-of- 
mass energy W . 

The second method of getting the sum rule parallels the derivation from PCAC of a consistency 
condition on pion-nucleon scattering. 6 Using the identity 


(dM)<AriT[x a (Ox 6 (0)]lN) =<An[x a «),x 6 (0)]6(OW> + <^l7’[(rfM)x a «)x 6 (0)]W>, (15) 


and hypotheses (B) and (C), one obtains the relation 


1 


1- 





G(0,0,0,0), 


(16) 


where 


__ i __ f. 1 .7tAT(—). .. i f. 7tAT(—) i f. 

G(v,v M M ) = -A (v,v M M )+B (v,v M M ). 

Jt> 7T 7T V -0 77 77 -O 77 77 


(17) 


Here A and B are the usual odd-isospin pion- 
nucleon scattering amplitudes, v and vg are 
the energy and momentum transfer variables, 
and and AT^are, respectively, the mass¬ 
es of the initial and final pion. 7 If G(v, **•) is 
assumed to satisfy an unsubtracted dispersion 
relation in the energy variable v, Eq. (14) fol¬ 
lows from Eq. (17). Thus, the assumption that 
the limit {q n ~~ 00 ) may be taken inside the sum 
over intermediate states in the method of Fu- 
bini and Furlan is equivalent to the assumption 
that G(v ,»») obeys an unsubtracted dispersion 
relation. There is evidence that the unsubtract¬ 


ed dispersion relation for G(v, •••) is valid. 8 
Clearly, if a subtraction were required, the 
sum rule for^ would be useless. 

3. Numerical evaluation .-Because Eq. (14) 
involves off-mass-shell pion-proton scattering 
cross sections, a little work is necessary to 
compare it with experiment. Let us split the 
right-hand side of Eq. (14) into the sum of three 
terms: 

1 4Af 2 

1-—1=—4-<Ri+** + *.). (18) 

g A g r 


with 


R 


1 



/ 2 V*V 


M ) 

77 


2tt 



(19a) 


dv 


1 r dv i r °° 

R n =- / —lmG(v, -M 2 /2 Af . Af ,Af )— / — ImG(i/,0,Af ,Af ), 

2 "X " * ^ 77 77 77 77 


(19b) 
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and 



i r 

7T J a m 


dv 

— Im 




G(u,0,M ,MJ- 

7T 7T 


G{v, 0,0,0)" 

/^(O ) 2 J 


(19c) 


The dominant term, R lt involves only the phys¬ 
ical pion-proton total cross sections o ± . Nu¬ 
merical evaluation gives 9 ’ 10 

(4 Mj/g 2 )R, =0.254. (20) 

N r 1 

The term R 2 can be calculated in terms of pion- 
nucleon scattering phase shifts, giving 11 

( 4 Af Ar 2 /^ r 2 )/? 2 = 0 . 155 . ( 21 ) 


The term R 3i which describes corrections aris¬ 
ing from taking the external pion off the mass 
shell, cannot be calculated directly from exper¬ 
imental data. In order to estimate this term, 
we assume that the off-mass-shell partial-wave 
amplitude f ijj(W\MJ y M is given by 




Ml 7T 7T 

f B (W,M M ) 

Ml 7T 7T 


■f (W,M M ). (22) 

MI 7T 7T 


(Here /= orbital angular momentum, J- total 
angular momentum, and/ = isospin.) The su¬ 
perscript B denotes the Born approximation. 
Multiplying the physical fuj by the ratio of the 
Born approximations gives the off-mass-shell 
fljj , the correct threshold behavior, and the 
correct nearby left-hand singularities. Gener¬ 
alized unitarity implies that the off-mass-shell 
and the physical partial-wave amplitudes have 
nearly the same phase; Eq. (22), which gives 
them identical phases, approximately satisfies 
this requirement. Numerical evaluation of R s , 
using Eq. (22), gives 12 

(4 Mj/g 2 )R =-0.061. (23) 

Nr 3 


It is possible that this number for i? 3 is correct 
to within 20%. 13 

Combining the three terms of Eq. (18) yields 


M (24) 

We have not attempted to make a detailed er¬ 
ror estimate. 14 The best experimental value 


for gj^ is 15 

g , expt =1.18±0.02. (25) 

/I 

It is interesting that the region around the 
600- and 900-MeV pion-nucleon resonances 
makes an important contribution to the sum 
rule. If only the contribution of the (3, 3) reso¬ 
nance is retained, we get the result#^ = 1.44. 
Thus, the (3,3) resonance does not exhaust 
the sum rule. 

After completing this work, I learned that 
a similar calculation has been done indepen¬ 
dently by Weisberger. 10 


♦Junior Fellow, Society of Fellows. 

*In the Cabibbo version of universality [N. Cabibbo, 
Phys. Rev. Letters .10, 531 (1963)], Gy is replaced by 
COS 6Gy. 

2 R. P. Feynman and M. Gell-Man, Phys. Rev. 109 , 

193 (1958). 

3 M. Gell-Mann and M. L6vy, Nuovo Cimento 16, 705 
(1960); Y. Nambu, Phys. Rev. Letters 4, 380 (1960); 

S. L. Adler, Phys. Rev. 137 , B1022 (1965). 

4 Gell-Mann and Levy, reference 2. The viewpoint 
that the commutation relations of Eq. (5) may hold ex¬ 
actly is due to Gell-Mann [M. Gell-Mann, Physics 1_, 

63 (1964)1. 

5 S. Fubini and G. Furlan, to be published. 

6 S. L. Adler, reference 3 and to be published; Y. Nam¬ 
bu andD. Lurie, Phys. Rev. 125. 1429 (1962); Y. Nam- 
by and E. Shrauner, Phys. Rev. 128, 862 (1962). 

7 In the scattering reaction Tr{k i )+p{q i )-*TT{k 2 )+p{q 2 ), 
the variables v, vg, and M^/are defined by v 

= -ky {qi + q2)/2Mtf, VB~ k l* k 2/ 2M N* (A/^) 2 = “*1 2 > 

<AV> =-*2 2 - 

8 First of all, the convergence of the sum rule of 
Eq. (11) suggests that an unsubtracted dispersion re¬ 
lation is valid. Secondly, B. Amblard et al., Phys. 
Letters 10, 138 (1964), have shown that the physical 
forward charge-exchange amplitude G{v , 

satisfies an unsubtracted dispersion relation. 

It would be surprising if this result were changed by 
the extrapolation of the external pion mass from 
to 0. 

^Values of <r* from 0 to 110 MeV were taken from 
the smoothed fit of N. P. Klepikov et al,, Joint Insti¬ 
tute for Nuclear Research Report No, D-584, 1960 (un¬ 
published), From 110 to 4950 MeV we used the tabu¬ 
lation of B. Amblard et al., Phys. Letters 16, 138 
(1964) and private communication. Above 4950 MeV, 
we used the asymptotic formula a (r + = 7,73 mb x [fe/ 
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(1 BeV/c)l” 0 ' 7 given by G. von Dardel et al., Phys. Rev. 
Letters 8., 173 (1962). This formula gives a good fit 
to the experimental data up to 20 BeV/c. The contri¬ 
bution to the sum rule of the region beyond 20 Bev/c 
is negligible. 

10 For the pion-nucleon coupling constant we used the 
value f 2 ~ 0.081 ± 0.002, quoted by 

W. S. Woolcock, Proceedings of the Aix-en-Provence 
Conference on Elementary Particles. 1961 (C.E.N., 
Saclay, France, 1961), Vol. I, p. 459. 

u It is convenient to write R 2 as a single integral over 
pion-nucleon center-of-mass energy W, the integrand 
of which is the difference of two terms. This integral 
is sensitive only to low-energy pion-nucleon scatter¬ 
ing data, since the two terms in the integrand cancel 
at high energies. The number quoted in the text was 
obtained using Roper *s l m = 3 phase shifts [L. D. Ro¬ 
per, Phys. Rev. Letters 12, 340 (1964), and private 
communicationl, truncating the integral at IF = 11.20 
Mtt- The integral is dominated by the (3, 3) resonances: 
Extending the integral only over the (3, 3) resonance 
gave {4Mpj 2 /g r 2 )R2 = 0.166. A third calculation, using 
simple Breit-Wigner forms for the (3,3) and the 600- 
and 900-MeV resonances, and neglecting all other par¬ 
tial waves, gave {4M^ 2 /gy 2 )R2 = 0-156. Thus, the val¬ 
ue of R 2 * s insensitive to “controversial” features of 


CALCULATION OF THE AXIAL-VECTOR COU¬ 
PLING CONSTANT RENORMALIZATION IN 0 
DECAY. Stephen L. Adler [Phys. Rev. Letters 
14, 1051 (1965)]. 

Please note the following corrections: (1) De¬ 
lete the redundant sentence immediately fol¬ 
lowing Eq. (14); (2) in Eq. (19a), du/d should 
be du/u\ (3) in reference 7, (M 7] /) = -fe 2 2 should 
be (Mj/) 2 - “& 2 2 ; (4) in reference 8, Eq. (11) 
should be Eq. (14); (5) in reference 13, [( W 2 
-M n 2 +M 7T 2 ) 2 -4W^M 7T 2 ] should be [(W 2 -M N 2 
+M 7 r 2 ) 2 -4lV 2 M 7r 2 ] ( 6 ) in reference 16, 0000 

should be 1047 and R^ should be R,; (7) in ref¬ 
erence 2, Gell-Man should be Gell-Mann, and 
in reference 8 , Phys. Letters 10 should be Phys. 
Letter 10. 


Roper’s phases, such as whether the wave reso¬ 
nates. 

12 This number was obtained using Roper’s phase 
shifts, truncating the integral at W = 11,20 M v , Extend¬ 
ing the integral only over the (3,3) resonance gave 
{4Mtf 2 /g r 2 )R% = -0.066; evaluating the integral with on¬ 
ly Breit-Wigner terms for the low-lying resonances 
gave (4Mpj 2 /g r 2 )Rs =-0.059. 

,3 To estimate the accuracy of the model, we repeated 
the calculation of R 3 with the assumption /jj/OF, 0, 0) 

= fij[(W , , M lr )K NNl, W) 2 W 2 -M n 2 ) 21 [ (W 2 -M N 2 +M t, 2 ) 2 

-4W 2 M 7r 2 ] , which includes only a threshold correc¬ 
tion factor, and a constant factor q) 2 account 

for the change in strength of the nearby left-hand sin¬ 
gularities. The numerical result for {AM^/gy^R^ 
was changed by about 20%, to —0.051. 

14 The variation among different calculations (refer¬ 
ences 11-13) of R 2 andR 3 gives an idea of the uncer¬ 
tainty in the theoretical result. 

,5 C. S. Wu, private communication. 

16 W. I. Weisberger, accompanying Letter [Phys. Rev. 
Letters 14, 0000 (1965)1. In the numerical evaluation 
of Weisberger, gj^ is calculated from the dominant 
term Rj, giving^ - 1.16. 
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Sum Rules for the Axial-Vector Coupling-Constant Renormalization in $ Decay* 

Stephen L. AdleeI 

Lyman Laboratory oj Physics y Harvard University , Cambridge, Massachusetts 

(Received 7 June 1965) 

Starting from the axial-vector current algebra suggested by Gell-Mann and the hypothesis of a partially 
conserved axial-vector current, we derive a sum rule relating 1 — gA* to off-mass-shell pion-proton total 
cross sections. Numerical evaluation gives the theoretical prediction ^*1.24, in good agreement with 
experiment. A similar sum rule for plon-pion scattering can only be satisfied if there is a large low-energy 
7=0, 5-wave pion-pion scattering cross section. We suggest tests, in high-energy neutrino reactions, of an 
algebra suggested by Gell-Mann for the vector and axial-vector current octets. 


INTRODUCTION 

W ITHIN two years after the discovery of parity 
violation in the weak interactions, the main 
features of j8 decay were clarified. 1 It was found that 
only vector and axial-vector couplings are present. The 
vector coupling constant was found to be identical with 
the vector coupling constant in muon decay; the axial- 
vector coupling constant was found to differ by a factor 
gA~ 1.2 from the value expected for a pure V—A inter¬ 
action. The identity of tie vector coupling constants 
in beta and in muon decay was soon explained by the 
hypothesis of a conserved vector current (CVC). 2 The 
value of the axial-vector coupling constant, on the other 
hand, has remained somewhat of a mystery. 3 

We give, in this paper, a theory of the axial-vector 
coupling-constant renormalization gA, based on the 
axial-vector current algebra suggested by Gell-Mann 4 
and on the hypothesis of a partially conserved axial- 
vector current (PCAC). 6 In Sec. I, we discuss the 
assumptions made. In Sec. II, we present two deriva¬ 
tions of a sum rule relating 1— gA~ 2 to off-mass-shell 
pion-proton total cross sections. Numerical evaluation 
of the sum rule, in Sec. Ill, gives the theoretical pre¬ 
diction £^=1.24. In Sec. IV, we derive a sum rule 
relating 2 gjr 1 to pion-pion scattering; we find that this 
sum rule can be satisfied only if there is a large low- 
energy 7=0, 5-wave pion-pion scattering cross section. 
In the final section, we propose tests, in high-energy 


* An abbreviated version of the calculation of gA has appeared 
in Physical Review Letters fiS. L. Adler, Phys. Rev. Letters 14, 
1051 (1965)]. After this calculation was completed, I learned 
of similar work by Weisberger fW. I. Weisberger, Phys. Rev. 
Letters 14, 1047 (1965)]. 

f Junior Fellow, Society of Fellows. 

1 M. Goldhaber, Proceedings of the 1958 Annual International 
Conference on Sigh Energy Physics (CERN, Geneva, 1958), p. 233. 

2 R. P.. Feynman and M. Gell-Mann, Phys. Rev. 109,193 (1958). 

3 Previous papers on the axial-vector coupling constant renor¬ 
malization include: R. J. Blin-Stoyle, Nuovo Cimento 10, 132 
(1958); S. Okubo, ibid. 13, 292 (1959); J. Bernstein, M. Gell- 
Mann, and L. Michel, ibid. 16, 560 (1960); A. P. Balachandran, 
ibid. 23, 428 (1962); H. Banerjee, ibid. 23, 1168 (1962); V. S. 
Mathur, R. Nath, and R. P. Saxena, ibid. 31, 874 (1964); Y. S. 
Kim, ibid. 36, 523 (1965); Y. Nambu and G. Jona-Lasinio. Phys. 
Rev. 124, 246 (1961); Nguyen-Van-Hieu, Nud. Phys. 42, 129 
(1963). 

<M. Gell-Mann, Physics 1, 63 (1964). 

6 M. Gell-Mann and M. L6vy, Nuovo Cimento 16, 705 (1960); 
Y. Nambu, Phys. Rev. Letters 4, 380 (1960); S. L. Adler, Phys. 
Rev. 137, B1022 (1965). 


neutrino experiments, of the algebra proposed by Gell- 
Mann 4 for the vector and the axial-vector current 
octets. The tests make no assumptions about partial 
conservation of the currents. 


I. ASSUMPTIONS 

The sum rules for gA discussed below are derived 
from the following assumptions: 

(A) The hadronic current responsible for A5=0 
leptonic decays is 

7x=GV cos, (1) 

where Gy is the Fermi coupling constant (GV« 1.02 
X10~ 5 /Af at 2 ) and cos0 is the Cabibbo angle. 6 Here 
J\ Va is the vector current, which we assume to be the 
same as the isospin current, and J\ Aa is the axial-vector 
current. In the Fermi theory, we would have had 

J\ Va = i jv7x : , (2a) 

J\ Aa =i -ipirYxybh^N (2b) 


Actually, we know that mesonic and other terms must 
be present. Fortunately, in what follows we will not 
have to assume any specific expressions for J\ v and J\ A 
in terms of particle fields. 

Since the vector current is conserved, the vector 
coupling constant is unrenormalized. The renormalized 
axial-vector coupling constant gA is defined by 

(N(q) \Jx\N(q))- C M N /qo)Gr cos 0* N (q) 

X (y\+gAy\y&)r + U N (q ). (3) 


(B) The axial-vector current is partially conserved 
(PCAC), 




MuM^gA 


- 

grK™'( 0 ) 



Here g r is the rationalized, renormalized pion-nucleon 
coupling constant (g r 2 /47 t«14.6), K NNr (fS) is the pionic 
form factor of the nucleon, normalized so that 
K NNt (—MJ) = 1, and is the renormalized pion field. 


6 N. Cabibbo, Phys. Rev. Letters 10, 531 (1963). 
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According to Eq. (4), the chiralities 


satisfy 



±Ua A2 ) 



UMwMfgA 

grK™*<® 



d 2 x<t> r *. 



(C) The axial-vector current satisfies the equal-time 
commutation relations 

[Ji- 1 ° (x),J* Ai (y) ] | nr-vo = -«(x-y)f 6e / 4 re (*) • (6) 

This implies that the chiralities satisfy 

Cx + (0,^(0]=2 P, (7) 

where P is the third component of the isotopic spin. 

The assumptions (A) are the usual ones for the 
leptonic decays. The vector-axial-vector form of the 
leptonic weak interactions is, of course, well estab¬ 
lished. 1 There is also considerable experimental evi¬ 
dence for the hypothesis 3 that the weak vector current 
J\ v9 is the same as the isospin current. 7 

The hypothesis (B) of a partially conserved axial- 
vector current (PCAC) was introduced by Gell-Mann 
and L6vy 6 and by Nambu 6 to explain the successful 
Goldberger-Treiman relation 8 for charged pion decay. 
In addition to predicting the Goldberger-Treiman rela¬ 
tion, PCAC predicts an experimentally satisfied relation 
between the pion-nucleon scattering amplitude A T ^ (+) 
and the pion-nucleon coupling constant g T 9 
The commutation relations (C) play an essential role 
in the calculation. QNote that Eq. (6) is a somewhat 
stronger assumption than Eq. (7), since even if spatial 
derivatives of the delta function were present on the 
right-hand side of Eq. (6), they would integrate to zero 


in Eq. (7). Only Eq. (7) is actually needed in the 
derivation below.] The hypothesis that Eq. (6) or 
Eq. (7) holds exactly is due to Gell-Mann. 4 Gell-Mann 
and Ne’eman have emphasized 10 that Eq. (7) is the 
most natural way in which one can make meaningful 
the idea of universality of strength between the weak 
couplings of leptons and baryons, without spelling out 
in detail the construction of J\ A from particle fields. 
Gell-Mann has also pointed out 11 that Eq. (7), by fixing 
the scale of the axial-vector current relative to the 
vector current, can, in principle, determine the axial- 
vector renormalization gA- 

To sum up, Eqs. (1), (3), (5), and (7) are the hy¬ 
potheses on which our calculation of gA is based. They 
are mutually consistent, in the sense that there is a 
renormalizable field theory (the <r model of Gell-Mann 
and L6vy 6 ), in which they are exactly satisfied. 

H. DERIVATIONS OF THE SUM RULE 

We give, in this section, two different derivations of 
a sum rule expressing gA in terms of off-mass-shell 
pion-proton total cross sections. A third derivation has 
been given by Weisberger. 13 

A. Method of Fubini and Furlan 

The simplest derivation uses a method proposed 
recently by Fubini and Furlan. 13 We take the matrix 
element of Eq. (7) between single-proton states (p(q) \ 
and | P(q r ))- The right-hand side gives 

(P(q) 1 2/ 3 1 P(<f)) = (2ir ) 3 S (q— q'). (8) 

In the matrix element of the commutator, we insert 
a complete set of intermediate states, separating out 
the one-nucleon term (to which only the neutron 
contributes): 


f <Pk 

(P(q)\[xHt),x-m\P(q'))= E / —Mq) Ix+Mi »(*)><»(*) lx-«)I/>(?')) 

spin J (2 t) z 


+ E (P(q) I x + (t) | j)(j | x~(t) I />(?'))- (x + «-»■ x _ ) • (9) 

ir*N 


The one-neutron term is easily evaluated using Eq. (3), giving 



<Pk 

j~(P(q)\x + (t)\n(k))(n(k) |x’(0 \p(q')) 

(2t) 3 



(27r) 3 $(q—k) (2T) 3 $(k— 


/Mn Mn\ fk-\-iMx\ 

<!')(-— }gA 2 u(q)yiyd - 17475 «(?') 

\ <?o «o / \ 2 iM N / 


= (27r) 3 $(q- q')gA*(l—JtfjvYffo*). 



1 C. S. Wu, Rev. Mod. Phys. 36, 618 (1964). 

fl M. L. Goldberger and S. B. Treiman, Phys. Rev. 109, 193 (1958). 
• S. L. Adler, Ref. 5. 

10 M. Gell-Mann and Y. Ne’eman, Ann. Phys. (N. Y.) 30, 360 (1964). 
U M. Gell-Mann, Phys. Rev. 125, 1067 (1962). 
u W. I. Weisberger, Phys. Rev. Letters 14, 1047 (1965). 
w S. Fubini and G. Furlan, Physics 1, 229 (1965). 


R9 


173 


B 738 


STEPHEN L. ADLER 


In the summation over higher intermediate states we make use of Eq. (5), giving 


^2M N M r *gA 


iw'gAy (p(q)\S<Px<t> T +\j)(j\S<Px<t>T-\p(q)) 

- 2Z-:-:-0r + 

'(0) J * 


g r 2sT™*(0) J 


(qo-qnd* 


IT ) 


(ID 


From Eqs. (10) and (11), we see that there is a family of sum rules, with qo as a parameter. In the limit as q o 
approaches infinity, a sum rule for 1—g ^ -2 is obtained. Let us assume that the limiting operation can be taken 
inside the sum over intermediate states in Eq. (11). It is useful to write this sum in the form 


r'cPqj /•“ 

E-/ —/ dW E BQV-Mt), (12) 

j*iV J (2tt)v Mx+M, ij-N 


where q,- is the total momentum and where “INT” denotes the internal variables of the system j. We have denoted 
by M, the invariant mass of the system j. The integrations over x and q, can be done explicitly, giving a factor 
(2ir) 3 6(q— q') and constraining q,- to be equal to q. Let us write 

01^(0) | p{q)) = {{M N /qo) (kA'o)) 1 '^, (13) 

so that Ff*=■ is a Lorentz scalar. Then we have for the summation over higher intermediate states, 


r\flM N MSg A -f 

(2«-) 3 6(q—q') - / dW E 6 (W - Mi) (M N /q 0 ) (Hfj/fco) (?o - ?yo)- 2 [ | 2 - I Fj+ 1 2 ]. (14) 

L gr X™'(0) JJ Uir+ M w 


Using the equations 


(?o-?yo)- J = {qz+qj*r/{M?-MJT , 


(15a) 

(15b) 


the limit as jo—>°o of Eq. (14) becomes 


r ^M NgA t n , r m, 

- (2ir) 3 5(q— q') / dW - 

Lg,X™'(0)J (TU 2 - 


MnW 


■ lim 
Hat 2 ) 2 «•—» 


Cg>»+ (gw , +W—Af j>*) 1 ^ 2 J 

q^qf+W'-MW* 


X lim l^pF,(?-?,) 2 ]-XWfo-?i)*]], (16) 

flCT+W 

where we have defined ?y) 2 ] by the equation 




3*N 

INT 


(17) 


Note that K* can only depend on the indicated variables because (i) if* is a Lorentz scalar, and (ii) all internal 
variables are summed over . 14 

It is now trivial to take the indicated limits. The limi t of the quantity in curly brackets is 4, and the limit of the 
momentum transfer (q— qj) 2 = — C 0 o— (tfoH-TF*— -MV ) 1/2 ] 2 is 0. Thus we are left with the sum rule 


1 2 

1 -=- 

gA 2 gr*K NN *( 0) 2 


Uf N WdW 




ZK + (W,0)-K-(W,0)]. 


(18) 


To complete the derivation, we must express in terms of pion-proton scattering cross sections. Let 

o'o ± (IF) denote the total cross section for scattering of a zero-mass ir ± on a proton, at center-of-mass energy W. It 
is easiest to calculate <r ^(WO in the center-of-mass frame. If we let k and q be, respectively, the four-momenta of 


14 An average over initial proton spin is understood, but is not indicated explicitly. 
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the initial pion and proton, then we have 14 

<r 0 ± (W)-flux= (2 tt) 4 E 

j*N 


\(j\jr*mpm 

2k n 


&{qj-q-k) 



— E 

(2t)* j i 

INT 


\(j\Jr<0)\Pm 

2k o 


tfift-q-k) 


= 2 t 2 Z 


INT 


\(j\Jr<0)\pm 

2 h 


2 

7o — ko) . 


B 739 



Keeping in mind the fact that the initial pion has zero 
mass (& 2 = 0 ), the following center-of-mass-frame equa¬ 
tions may be derived: 

go+fco=W, g/o=Afy; (20a) 

flux= |k|/A 0 +|k|/go=^/go; ( 20 b) 

h= (IF 2 — Mat 2 )/ {2W) ; (20c) 

(j\JM^)\P(q))^M T Hj\<kMO)\p(q)) 

= MJ(M N /q Q r*Fs±. ( 20 d) 

Combining Eqs. (19) and (20) gives 

j*N 

INT 

= (2TM N /(W-MJ))KHWfi ). ( 21 ) 


Let us take the matrix element of this equation between 
states ( 0(kp)\ and |a(£/)). We get the equation 

— i(kp— kr)\0(kp) | jx(0) |a(A/)) 

-<U(*jp)|d(0)|a(*/)). (24) 

Let us now consider what happens as {kp—ki) —> 0. In 
this limit, only those pole terms of (0(kp) |ix( 0 ) |a(£/)) 
which behave as {k F —ki)~ l will contribute to the left- 
hand side of Eq. (24). It was shown in (II) that these 
singularities arise only from insertions of the vertex of 
j\ on external lines of (5|a). Furthermore, in the limit 
as {kp—ki)— > 0 , these insertions leave the external 
particles on mass shell. Thus we get a “consistency 
condition” expressing 

lim 0(k F )\d(p)\a(kr)) (25) 

(kp — kf)-+Q 


Comparing with Eq. (18), we get the simple and exact 
sum rule 


1 MIn 2 1 

1 -=- 

gA 2 g, 2 K™'(0) 2 


WdM 


Mjf+Mr W*-Mi sr 2 


in terms of the physical matrix element Clearly, 

the same procedure can be applied to the quantities 

j (0 = Jd*xjt(x t t) and d(t) = J d 8 3 cd(x, 0 , 


X&o+(HO-<r n “(IF)]. (22) 

While the derivation just given is straight-forward, 
it suffers from the defect of requiring an additional 
assumption: We must assume that the limit q 0 —can 
be taken inside the sum over interemdiate states in 
Eq. (11). The next derivation which we give clarifies 
the meaning of this assumption. 

B. “PCAC Consistency Condition” Method 

In two previous papers 16 (hereinafter called I and II), 
we showed that the hypothesis of a partially conserved 
axial-vector current leads to consistency conditions in¬ 
volving strong-interaction scattering amplitudes. The 
method used is a general one. Suppose that we have 
local field operators j\{x) and d(x) which satisfy the 
equation 

_ d\j\ (x)=d(x). (23) 

15 S. L. Adler, Phys. Rev. 137, B1022 (1965), hereinafter called 
I; S. L. Adler, Phys. Rev. 139, B1638 (1965), hereinafter called II. 
See also the related papers: Y. Nambu and D. Luri6, ibid. 125, 
1429 (1962); Y. Nambu and E. Shrauner, ibid. 128, 862 (1962). 


which satisfy the equation 

dj(t)/dt=id(t). (26) 

Of course, the resulting formulas will not be manifestly 
covariant. What was done in (II) was to study in detail 
the case when j(t) is simply the chirality x°W* We will 
now apply the same method to a somewhat more 
complicated object, 

j(x a )=jdy a e~ il “ im (N(q) | 7tx a (*°)x*(yo)] | ,v (?)), (27) 

in order to rederive the sum rule for gA. 

Let us consider the quantity T defined by 

7 '= J dxo e il * X0 J dyo e~ iiotfo 

X(A r (g)|T[x«(^)x fr (yo)]|Y(g)> 

= / dx o e il ° xo j (*n) - 


(28) 
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Let us also define P a (x) by the equation 

d\J\ Aa (x)=P a (x), 
so that the chirality x°(#o) satisfies 

d 

dx o 


J d 3 xP*{x). 



We will introduce the assumption that P a (x)°c<l> ir a (x) 
at a later stage of the calculation. 

From time-translation invariance, we know that 



j (#o)=constant. 


Consequently, 



— ikoj(xo)= j(x 0 )= fdy a < r"°''°(A7'(?)|-P[x a (*o)x*(yo)]| N(q)) 

dx o J dx o 

= er^(N (?) | [x“(*b),x s (*o)] | N (?)>+ J dy 0 Jd’x | 7'[/ > “(x)x i, 0'n)] | N(q)). (32) 


Since the second term on the right-hand side of Eq. (32) is proportional to exp(—i£o#o), we can rewrite it as 


H <Px £*“ ,i ovo(— \2 X +MS 


)(N(q)\TZP*(x) X >(y Q )l\N(q)) 


(33) 


We have assumed that we can integrate by parts with respect to the spatial variables x; this can be justified by the 
use of wave packets . 10 Combining Eqs. (28), (32), and (33), and then interchanging the order of the integrations 
over Xq and y 0 , gives 


— ik Q T= J ^o^ ( ^^ )x °(A^fe)|[3c 0 (^),X fr (^o)]|^(?)) 


+ dx o e ikx *- 


1 




dy Q / <Pxe ~ ik •»(- n*+M T *)(N(q)\TZP*(x)x b (yo)l\N(q)) 


= 2T5a 0 -W(A r (?)|[x“(O),x 6 (O)]|iV( ? )H 


1 


with 


M r *-k o 2 . 


dyoe~ ikf>m ji(yo) , 


(34) 


/ d A xe il w(- n*+M r *)(N(q) \ TZP*(x) x b (y Q )3\N(q)) 


_ e iiQvo ^ constant. 


(35) 


Treating ji(yo) in the same manner as we treated 7 (^ 0 ), we get 

d 3 x e <hv °{N(q) | [x fr (yo),i > 0 (x,y 0 )] | N(q)) 


+ 


1 


M r *-h\ 


d*x / <Py CL+M, 2 ) (- Uv+M *){N(q) | r[P‘ W? s (y)] | N(q)). (36) 


To sum up, we have derived the identity 


—ik Q dx 0 e ilt *Q / dy Q e~ ik ^(N(q) | 7'[x a (^o)x 6 (3'o)]| N(q)) 


= 2rf(/n-^n)j^(A'(y) | Cx°(0),x J (0)] I A 7 (?))+ - ^ ^ d>x ( N ^ I Cx l (0),P a (x,0)] | A 7 (?))J 


+ 


1 


1 


(MS-k<?)(MS-h>) ih. 


d'x / d'ye^^i- D.+A/VX- □„+M. 3 )(A 7 ( ? ) | r[P»(x)P 4 (y)]|A 7 ( ? )). (37) 


11 We will never integrate by parts with respect to the time variable. 
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Since we will obtain the sum rule for g A from the part of Eq. (37) which is antisymmetric in a and b 9 let us drop 
all terms which are symmetric. Because Cx 0 (#o),xH*o)] = ^ oftfl / fl , and since dPjdx^ 0, we have <Cx°(*o),x & (#o)D/ 
dx o=0. In other words, 

J (f^[P“(x > a*) ) x*(*<>)!]= J cPx[P b (x,x (l ),x , ‘(x a )2, (38) 

indicating symmetry under interchange of a and b. Thus we can drop the term proportional to 

(A 7 (<?) | [x t (0),-P°(x,0)]| A 7 (?)). 

Let us now consider the antisymmetric part of Eq. (37) for small k <*, At the end of the calculation, we will let 
ko approach 0. On the left-hand side, only diagrams with x a inserted on the external nucleon lines will make a 
contribution of zeroth order in £o, as was shown in (II). This can be seen directly by inserting a complete set of 
intermediate states in the time-ordered product: 

dx o jdy^^ {N (q) | T[x a (*b)x 4 (yo)] | N(q)) 



= j dxt> j dyue' ll,x,, ~ <kon X) C(Af (?) Ix^o) liXj |x s 0yo) | N (?))0(*o—yo) 

+ <a 7 (?) | x s (yo) [ j)(j | x a (xo) I A 7 (?))0(yo-*o)] 

= E C<A 7 (?)| | A 7 (q))i (^ 0 - Ay)- 1 - (N(q) | /*“ ( 0 ) | j)(j \J< A ° ( 0 ) | N(q))i (£ 0 + A;) -1 !] 

J 

X2irt(h-ko) (2ir) o S(0)5(qy- q), (39) 

where 

Ay— (qt+Mf-M^-q*. (40) 

Clearly, only the one-nucleon intermediate state {j=N t Ay=0) gives a singularity behaving as ko~\ Evaluation 
of the spin sum, as in Eq. (10), gives, for the left-hand side of Eq. (37), 


(2t)*5(0)5(/o-£o)^ (41) 

where 0(k o) indicates terms which vanish as ko—> 0 . 

Let us now evaluate the terms of the right-hand side of Eq. (37). The commutator of the chiralities is easily 
evaluated, using Eq. ( 6 ), giving 

2^A(A>—I [x-W^O)] I (2^)V(0)a(2o—(42) 

In the last term of Eq. (37), let us introduce the PCAC hypothesis, 


giving 


P*(x) 


M N M r *g A 

gr*™’(0) 


<£»°(#) , 



/ M , 2 w M* \r M n g A 

W„ 2 - W/U 2 - io 2 JLg r K™*(0) 



d 4 y gikto-ikoito 


X (- U*+Mr 2 ) (- □*+J4V)(jV(?) | n>,‘(*)^>(y)] | N(q)). (44) 

Apart from factors, this is just a pion-nucleon scattering amplitude. In fact, the off-mass-shell pion-nucleon scat¬ 
tering amplitudes 

A rN ^{v t v B ,M r \M/) and B* N <->(r 9 v B9 M w *,M w f ), 
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where M,* and M r f are, respectively, the masses of the initial and final pion, are defined by 17 

J**j d< y «-«*««»(- Dx+M, 2 ) (- Qv+MSXN ( ?1 ) | 7T>,«(z)0.*(y)] | A r (?i)> 

— ~i (2t)^5(^i+ h—q%—F)((M n/ qio) (Mw/ q 2 o) ) V8 

XuN(q i ){\iA 1 ' N <- ) (v t VB t M x i ,M T / )-ikB TN <- ) (v t vB 9 M* i M* / )l$L'r a ,r x \\+isospin symmetric}w^(^i), (45a) 

w B -k-l/(2M N ), v= -k • ( qi +q 2 )/(2M N ). (45b) 

The term B can be separated into pole terms, 17 and a nonpole part which we label B : 

B 9N ^={jg?/2M N )K NN9 l~ (M/) 2 ]^^- (M/n^vB-v^+iv^vy^+S^-K (46) 

The integral in Eq. (44) is identical with Eq. (45), with 

l=(fi 9 ilo) = k=(fi,iko) 9 M/=M/=£ 0 ; wB—kt/(2M w ), v=qoko/M N . (47) 

Combining Eqs. (44), (45), (46), and (47), we find that Eq. (44) becomes 

f 2 Mat 2 1 ) 

(2T)^(0)5(/ 0 -^o)i€ o6fl (ir-) -^W/?o 2 - TJV< “ ) (y,0,0,0)+y5’ rAr <“>(*,0,0,0)] +0(W), (48) 

l g r a lT WW '(0) f V 


with v—q&o/Mif. The term proportional to —g A 2 M] sr 2 / 
arises from the Bom term in Eq. (46) when the sub¬ 
stitutions of Eq. (47) are made, and just cancels the 
similar term in Eq. (41). Thus, in the limit as ko—> 0, 
we obtain from Eq. (37) the Lorentz-invariant identity 

1 -2 Mn* 

1---G(0), (49) 

gA 3 gr 2 K™*(0) 2 

where 

G(p) = p- 1 [A^<->(pA0fi)+pB^- ) (p r 0,0,0)] 

= J r-i[^^i-)( J/ ,o,o,o)+^’ fAr< -)(y, 0,0,0)]. (50) 

We are able to drop the bar on B because the Bom 
term (vs— *)“*+ (**+ *) _1 vanishes identically at vb =0. 

Equation (49), which follows solely from the assump¬ 
tions of Sec. I, is our final result. From the crossing and 
analytidty properties of A 9li ^ and we know 

that G(v ) is an even function of v and is analytic in the 
v plane, apart from cuts running from =b[Af»+M T 2 / 
(2My)] to ±00. Let us assume that G(*) satisfies an 
unsubtracted dispersion relation in the variable v . Then 
we may write 

2 f" dv 

G(0)—/ — ImGOO. (51) 

It is easily verified that 

ImG(*)=§(<ro“— ^o + ) ■ (52) 

Changing the integration variable from v to the center- 
of-mass energy W [y= (TF*—Mw 2 )/(2M Ar )], and com¬ 
bining Eqs. (49), (51), and (52) leads to the sum rule 
of Eq. (22). Thus , the assumption tkat the limit qo—>oo 

17 G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, 
Phys. Rev. 106, 1337 (1957). 


may be taken inside tke sum over intermediate states in 
the method of Fubini and Furlan is equivalent to the 
assumption that G(v) obeys an unsubtracted dispersion 
relation. 

There is evidence that an unsubtracted dispersion, 
relation for GO) is valid. First of all, provided that the 
Pomeranchuk theorem is valid, the integral in Eq. 
(22) is convergent. Secondly, Amblard et at. and 
Hohler et al . have shown 18 that the forward charge- 
exchange scattering amplitude 

A 9 *<-> O, -M, 2 /(2Mw), M, 9 M T ) 

+ vB*<-> 0, —Mr 2 / (2My), 

satisfies an unsubtracted dispersion relation. It would 
be surprising if this result were changed by the ex¬ 
trapolation of the external pion mass from M r to 0. 
Clearly, if a subtraction were required, the sum rule 
for g A would be useless. 

By writing a dispersion relation for the last term in 
Eq. (37), without assuming the PCAC hypothesis, one 
gets a sum rule relating 1— g A 2 to cross sections measur¬ 
able in high-energy neutrino experiments. This sum 
rule is discussed further in Sec. V. 

m. NUMERICAL EVALUATION 

Because Eq. (22) involves off-mass-shell pion-proton 
scattering cross sections, a little work is necessary to 
compare it with experiment. Let us split the right-hand 
side of Eq. (22) into the sum of three terms: 

l-^“ 2 = (4 Mrf/gf)(Ri+Ri+Ri) , (53) 

10 B. Amblard a al., Phys. Letters 10, 138 (1964); G. Hohler, 
G. Ebel, and J. Giesecke, Z. Phyaik 180, 430 (1964). 
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with 

1 r dv / Mr* \ 

Ri= -/ - ImG r,-, M r} Mr 

irJ M* v \ 2M y / 

= -/* (54a) 

2irJ m* v 1. 


It is interesting that the region around the 600- and 
900-MeV pion-nucleon resonances makes an important 
contribution to the sum rule. If only the contribution of 
the (3,3) resonance is retained, we get the result g A 
= 1.44. In other words, the (3,3) resonance does not 
exhaust the sum rule. 

The remainder of this section deals with the details 
of the numerical evaluation 


1 f* dv / Mr 2 

R*=- I — ImGU — 

1TJ Mr V \ 2 M 


V \ 

-,M„ Mr) 

N ' 


1 


*30 


dv 


lmG(v 9 0,M rj Mr) t (54b) 


Mv+Mt*/(2Mn) V 

l r dv 


Rz=- I — ldiG(vfl 7 M,J£r) 

TJ AfT+AfrV(2Jt/y) V L 

G(y,0, 0,0) -j 

K HlfI (0) 2 J ’ 


(54c) 


(54d) 


There is a definite reason for splitting things up this 
way. Numerically, we find that | Ri \ > \ R 2 | > | R 3 |. The 
dominant term, R u involves only the physical pion- 
proton scattering cross sections and thus can be 
reliably determined. The terms R 2 and Ri are correc¬ 
tions, which take into account the fact that the sum 
rule involves the forward charge-exchange scattering 
amplitude, with both external pions of zero mass. The 
term R 2 can be calculated in terms of pion-nucleon 
scattering phase shifts. Since it is dominated by the 
(3,3) resonance, it can be fairly reliably calculated. The 
term Rz is less well known, because a model is needed 
to calculate the off-mass-shell partial wave amplitudes. 

We get the following numerical results 19 


(4Jf y 2 / g r 2 )Ri= 0.254, 
(4My7g r 2 )2? 3 =0.155, (55) 

{4:M tf t /g r 2 )Rz= —0.061 , 

giving 

^theory =L24> (5 6 ) 


A. Calculation of Ri 

As stated above, Ri is calculated directly from the 
physical pion-proton total cross sections a ± . Values of 
(r ± from 0 to 110 MeV were taken from the smoothed 
fit of Klepikov et a /. 21 From 110 to 4950 MeV, we used 
the tabulation of Amblard et al. n Above 4950 MeV, 
we used the asymptotic formula <r~—e + = 7.73 mb 
XCVCBeV/c )] -0 - 7 given by von Dardel et a /. 23 This 
formula gives a good fit to the experimental data up to 
20 BeV/c. Use of this formula beyond 20 BeV/c repre¬ 
sents an extrapolation from the present experimental 
data, and gives 


4My 2 1 


►so 


dv 


g> 


2r. 


(v 2 -Mr 2 ) 1/2 (e+-o-)= -0.011. (58) 


20 BeV 


Thus, unless the [_k/(B eV/c )] -0 * 7 asymptotic behavior 
is very much in error, the region above 20 BeV/c 
contributes only a few percent of 1 — g A ~ 2 . 

B. Calculation of R 2 

It is convenient to express R 2 as a single integral 
over center-of-mass energy W, the integrand of which 
is the difference of terms referring to vb= 0 and to vb 
— — M-?l(2Mx). The center-of-mass scattering angle <f> 
is given by 

y=cos^= l+if T 2 /|k | 2 at v B =0, 


y=cos^= 1 at v B = —Mr 2 / ( 2 M N ), 


(59) 


where |k| is the center-of-mass frame pion momentum. 
Thus we get 


•SO 


R*=-16l dW\{W), 

Mb+Mw 


A reasonable error estimate, based upon the variations 
among the several calculations of R 2 and R% discussed 
below, is ±0.03. The best experimental value is ? 0 

g A ^= 1.18±0.02, (57) 

Thus, the sum rule agrees with experiment to within 

5%. 


w For the pion-nucleon coupling constant, we used the value 
/ , s*£r 8 M»V(l6TMy*) = 0,081±0.002 quoted by W, S, Woolcock, 
Proceedings of the Aix-en-Provence International Conference on 
Elementary Particles (Centre d'Etudes Nucl6aires de Saclay. Seine 
et Oise, 1961), Vol. I, p. 459. 

“ C. S, Wu (private communication). 


A(ir)= 


w 

{\\n-MJY 



w, H 


M r \ (W+M„y 
|k|V(^+^) 2 -M r 2 



M x \ ( W-Mn? 1 
\V\V(W-M N ?-Mr 2 \ 


W 2 

(m-MY-Mr 2 ) 2 


Zfi(W,l)+MW,l)-], (60) 


21 N. P. Klepikov et al. t Dubna report D-584, 1960 

(unpublished), 

28 B, Amblard et al. t Ref. 18 and private communication. 

” G, von Dardel et al., Phys. Rev. Letters 8, 173 (1962). 
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with fi(W,y) and fziW^y) the usual center-of-mass 
pion-nucleon scattering amplitudes. Since f\ and fz are 
analytic functions of y in an ellipse with foci ± 1 and 
with semimajor axis l+2M x 2 /\ k| 2 , 24 we can legiti¬ 
mately use paxtial-wave expansions in calculating f\ 
and / 2 in both terms of Eq. (60). The integral is rapidly 
convergent, since the two terms in A(W) tend to 
cancel at high energies. 

The number (4if x 2 /g r 2 )R 2 = 0.155 quoted in Eq. (55) 
was obtained by using Roper’s A»=3 phase-shift fit, 26 
truncating the integral at 11.20Af*. (Beyond this 
energy no phase-shift fit is available.) The integral is 
dominated by the (3,3) resonance; extending the inte¬ 
gral only over the (3,3) resonance gave (4ifw 2 /g r 2 )-^2 
= 0.166. A third calculation, using simple Breit-Wigner 
forms for the (3,3) and the 600- and 900-MeV reso¬ 
nances, and neglecting all other partial waves, gave 
(4Afw 2 /g r 2 )i?3=0.156 when the integral was truncated 
at 11.20Af v , and (4Jfw 2 /g r 2 )^2='0.145 when the integral 
was extended to an upper limit of 17M x . The good 

agreement of these numbers indicates that is in¬ 
sensitive to “controversial” features of Roper’s phases, 
such as whether the Pn wave resonates. 

C. Calculation of Rz 

The term Rz, which describes corrections arising from 
taking the external pion off the mass shell, cannot be 
calculated directly from experimental data. In order to 
estimate this term, we must assume a model for the 
off-mass-shell partial wave amplitude fui(W t M x { ,M x f )- 
(Here /= orbital angular momentum, J= total angular 
momentum, and/=isospin.) 

Actually, in order to evaluate 2? 3 , we only need to 
know the imaginary part of furWMJM* 1 )' Below 
the inelastic threshold at W=Mir\-2M XJ generalized 
unitarity tells us that 

= | k\fui(WM w *J£.)f U i(WJ£SJ£.)*. (61) 

The intermediate state pion is, of course, on the mass 
shell. Since only the region around the (3,3) resonance 
is appreciably affected by taking the external pions off 
the mass shell, it suffices to study fui{W y Mr*,M x ) and 
then to use the elastic unitarity relation of Eq. (61) to 
get ImfuiiWtMJMiO- 

In constructing a model, we use the following in¬ 
formation about fur: 

(i) Threshold behavior. From kinematic considera¬ 
tions, we know that near the threshold at W=Mh+M Xi 

0 will be equal to (| k m '||k-^| ) f times 


*This statement assumes the validity of the Mandelstam 
representation. 

“ L. D. Roper, Phys. Rev. Letters 12, 340 (1964) and private 
communication. 


slowly varying factors, with 

|k"| =[(W*0 2 -M, 2 P 2 , 
ko i -f=[W i -M N i + (M x i - f ) 2 3/(2W). 

Here | k*'| and | k^| are the center-of-mass momenta of 
the initial and final pions; when M x i =Q{M x ) ) we de¬ 
note |k’| by | k°| (| k|). 

(ii) Unitarity . Setting either Mr' or M x f equal to 
M x in Eq. (61), we see that fuiiWjM^jMr) has the 
same phase Bui as the true pion-nucleon partial wave 
amplitude fui{WiM*M*\ 

(iii) Left-hand singularities . Changing the external 

pion mass changes the left-hand singularities in the 
partial wave amplitude fur(W J&JJir*). The left- 
hand singularities closest to the physical region come 
from the partial wave projection /ijj B (W r ,Af* , ,Af*0 °f 
the Born approximation (the pole term) in Eq. (46). 
Reference to Eq. (46) shows that fui B {W , 1 M x i 1 M x f ) 
contains a factor (M x *) 2 2^ NNir t~ ( M x -Q 2 ] aris¬ 

ing from the change in strength of the coupling of the 
external pions to nucleons when the external pion mass 
is changed from the physical value. 

A simple model, which takes into account the con¬ 
siderations (i)-(iii), is to take 


fuiiW^r^Mr) 


fuiBQVtMr^Mr) 

fiJl*(WJ£ x J£ 9 ) 


fui{W,M Xl Mr) 



Equation (63) gives fuiOVjMJjMr) the same phase as 
fuiiWjMrjMr). Multiplying the physical fui by the 
ratio of the Born approximations gives the off-mass- 
shell fur the correct threshold behavior and, approxi¬ 
mately, the correct nearby left-hand singularities. A 
second model is to take 


fuiiW^M^Mr) 

- ( | k V I k I (M X *) 2 2flJI(WyMrtMr ). (64) 

Here we have put in only a threshold correction factor 
and a constant factor (Af/ - ) 2 ] to account for 

the change in strength of the nearby left-hand singu¬ 
larities. According to Eq. (61), the first model gives 


ImfuiQVflfl) 

r fuPQVflfl) "i 2 

= - Im furtWMiM*), 

Lfvi*(WJ£ x ,M 9 )J 



while the second model gives 

lmfur(Wfifi) =(|k°|/|k| )*«**■( 0) 2 

XImfur(W t M x1 M x ). (66) 

Although Eq. (61) is valid only below the inelastic 
threshold, we will use Eq. (65) and Eq. (66) above the 
inelastic threshold as well as below. 

Numerical evaluation of Eq. (54c) gives (4 Mtf/gf)Rz 
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= —0.061 when the model of Eq. (65) is used, and 
(4Mn 2 /g r 2 )Rz= —0.051 when we assume Eq. ( 66 ). In 
both cases, Roper’s phase-shift fit was used, and the in¬ 
tegral was truncated at W = 11.2GA7 *. Using Eq. (65) in¬ 
tegrated only over the (3,3) resonance gave (4 MiS/gr)Rz 
— — 0.066. Evaluating the integral with only Breit- 
Wigner terms for the low-lying resonances gave similar 
results. Thus, the quoted value of Rz, while dependent 
on the model used for going off mass shell, is insensitive 
to “controversial” features of the phase shifts. 

D. Remarks 


Now let us make a quantitative analysis. According 
to Eq. (57), the left-hand side of Eq. (69) is 

2/gA 2 =l-43. (70) 

Let us express the right-hand side of Eq. (69) in terms 
of the variable s= W 2 , giving 


1 

g 2 K” N *(0) 2 2ir 


r" ds 

-|Vo*“ (s )— <r 0T + CO ]. 

4AO* S—Mx 2 



As in the proton case, we take account of the fact that 
the external pion in Eq. (71) is of zero mass by writing 


The terms R 2 and Rz, which come largely from the 
(3,3) resonance region, give a combined contribution of 
0.094, as compared with the contribution of 0.254 
coming from Ri. It may at first seem surprising that 
the effect of R 2 and 2 ? 3 is so big, but it is easy to under¬ 
stand this. From Eq. ( 66 ), we can see that the main 
effect of i ?2 and Rz is to multiply 0 - 3 . 3 , the (3,3) reso¬ 
nance contribution to the integrand of 2 ?i, by a factor 

|k°| 2 /|k| 2 . (67) 

At the peak of the (3,3) resonance, this factor is 1.27. 
Since the (3-,3) contribution to 2?i is 0.43, we expect 
Ri to be increased by an amount of order 

0.27X0.43^0.12, ( 68 ) 

in rough agreement with the sum of R 2 and 2? 3 . 

IV. PION-PION SCATTERING SUM RULE 

In Sec. II, we took the matrix element of Eq. (7) 
between proton states and derived a sum rule relating 
gA to pi on-proton scattering. Now let us take the 
matrix element of Eq. (7) between ir + states. The same 
manipulations used in the proton case lead to the sum 
rule 

2 4-Afjv 2 1 f 00 WdW 

gA 2 "gr 2 K™*(0) 2 ir J 2 Mt W*-M r 2 


<ro x l ‘ I (s) = K NN *(0) 2 (\ k°|/| k| ) 2 W' r (s) 

= K NN ^(0) 2 l(s-Mr 2 ) 2 /s(s-mr 2 )lW J (s) , 

(72) 

where 1= orbital angular momentum, 7=isospin, and 
<r x l,l (s) is the on-mass-shell partial wave cross section. 
ThusEq. (71) becomes 

4MV 1 r* ds 


g? IttUmS s-M 2 

00 r (s-M* 2 ) 2 ~] 1 


[ « r (s-MSF y 

X L , 7 . 7 - 

I J-0 Lj(j— 4:M* 2 ) J 

/even 

00 r (s-M* 2 ) 2 1 1 I 

+ E --- ^(s) . 

1-1 b(j-4jf r «)J 



Let us first evaluate the contributions of the two 
well-established 7 nr resonances, the 1=1=1 p and the 
1= 2,7=0 /°. We parametrize o -* 1 * 1 andtr * 2,0 in the form 26 


12*7 P 2 S/(v+MS) 

0‘S(s) =-, 

(*p-*) 2 +YpV/(*+M, 2 ) 

20*7/V/ (v+M* 2 ) 

0‘S >0 (s)= -, 

(Sf— s) 2 +y/S/ (v+M w 2 ) 


(74) 

v=\s—MS. 


XC <r o*“(W r )—<r 0 * + (W)], (69) 

where a'o» ± ( W) is the total cross section for scattering 
of a zero mass on a physical t + , at center-of-mass 
energy W. Equation (69) involves gA~ 2 , rather than 
gA~ s — 1, because the one-pion intermediate state con¬ 
tribution vanishes on account of parity. The factor 2 
on the left-hand side of Eq. (69) comes from the fact 
that (Tr + (q)\2P\ir + (q f ))=2' (27r) 3 S(q—q'). 

While, of course, no direct pion-pion scattering data 
is available, there is enough information on pion-pion 
resonances to compare Eq. (69) with experiment. First 
of all, o-Qr+fW) comes only from 7=2 scattering. While 
there are resonances in the low energy 7= 0 and 7= 1 
pion-pion scattering, the 1=2 scattering seems to be 
small. Thus the right-hand side of Eq. (69) is positive, 
agreeing in sign with the left-hand ride. 


The reduced widths y„ 2 and 7 / are related to the 
experimental full widths at half-maximum T p and Tf by 


7p 2 = 


Vp+M * 2 Vf+MS 

-;—^ p 2 , 7 /=- s/Tf , 


vf 


(75) 


vp, / 4 $p,f 


Using the experimental values 27 s p = 29.7M* 2 , T p 
= 0.755Jf„ j/=80.0Af, 2 , r/=0.716Af,r, we get, for the 
p and /° contributions to Eq. (73), 


(76) 


p contribution=0.42, 
f° contribution = 0 . 11 . 

As a check, we also calculated the p and f° contribu- 

28 L. A. P. Bal&zs, Phys. Rev. 129, 872 (1963). 

27 A. H. Rosenfeld el al. f Rev. Mod. Phys. 36, 977 (1964). 
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tions in the narrow resonance approximation. This gave 
0.35 for the p and 0.09 for the f° contribution, indicating 
that resonance shape corrections will not substantially 
change the numbers of Eq. (76). 

The contribution of 0.53 from the p and the f° is 
only 37% of the total of 1.43 required by the sum rule. 
Since the f° contribution is so small, and since there 
seem to be no resonances with />3 in the low-energy 
region , 27 it should be reasonable to neglect the con¬ 
tribution of terms with />3 in Eq. (73). Rearranging 
Eq. (69), we get 


i 


ds 


gr 2ttJ S— 

4 m n 2 i r ds 

gr 2 2wJ AMr* S—M r 2 


K°’°co 


3 


r (s-Msy -i 2 

L(*-4J/ r 2 )J 


2.2 


(*) 


+ 1.43—0.42—0.11> 0.9. (77) 


Thus, the pion-pion sunt rule can be satisfied only if there 
is a large low-energy 7=0, S-wave pion-pion scattering 
cross section. 

In order to get an idea of how big the 7=0, 5-wave 
scattering cross section would have to be in order to 
satisfy Eq. (77), we evaluated the left-hand side of 
Eq. (77) using a simple scattering-length parametriza- 
tion of the 7=0, 5-wave phase shift , 28 

(v/iv+M* 2 )) 1 * 2 cotS°*°= l/a 0 +H(v ), 

B(v)= {2/n)(v/{v+Mr 2 )Y 2 (78) 

Xln l{v/Mr 2 ) lt2 + (v/Mr*+ 1 ) 1 ' 2 ] , 

which gives 

4irao 2 

o* T 0,0 =-. (79) 

a^v+ (v-\-M r 2 )[_ 1+floff (y)3 2 

We find that Eq. (77) can be satisfied only if a 0 > 1-3 or 
if flo<—0.85. It is interesting that an 7=0, 5-wave 
scattering length of the order of a pion Compton 
wavelength is also suggested by studies of low-energy 
pion-nucleon scattering 29 and of Ku decays . 30 Needless 
to say, there is nothing unique about the parametriza- 
tion of Eq. (78). 


V. TESTS OF THE CURRENT ALGEBRA IN 
HIGH-ENERGY NEUTRINO REACTIONS 

The sum rules discussed in the preceding three sec¬ 
tions are derived from two principal hypotheses: the 


28 G. F. Chew and S. Mandelstam, Phys. Rev. 119, 467 (1960). 
n J. Hamilton, Strong Interactions and High Energy Physics — 
Scottish Universities * Summer School , 1963, edited by R. G. Moor- 
house (Plenum Press, New York, 1964). 

80 C. Kacser, P. Singer, and T. Truong, Phys. Rev. 137, B1605 
(1965). 

Sum Rules for the Axial-Vector Coupling Constant 
Renormalization in (5 Decay, Stephen L. Adler 
[Phys. Rev. 140, B736 (1965)]. In Eqs. (73) and 
(77), the coefficient of the isospin-2 cross section 
vj' 2 should be f rather than |. None of the conclu¬ 
sions of Sec. IV is changed. I wish to thank 
Dr. A. N. Kamal for pointing out this error. 


axial-vector current commutation relations of Eq. (7) 
and the partially conserved axial-vector current hy¬ 
pothesis of Eq. (5). In this section, we discuss a sum 
rule which follows from the axial-vector current algebra 
alone, regardless of whether PCAC is true. We will 
also derive sum rules which follow from a proposed 
algebra of the strangeness-changing currents. 

Let us begin by reviewing the theory of leptonic 
weak interactions of the hadrons. According to Gell- 
Mann 4 and to Cabibbo , 8 the hadronic weak current is 31 

J\ k = ($ix+i$2x+5v?+i$2\*)Gr cose 

+ fSFsx + + i&t\ l )Gv sin0. (80) 

Here Gv is the Fermi coupling constant and 6 is the 
Cabibbo angle. The vector currents 3vx and the axial 
currents Syx 6 ( 7 = 1, ■•■ > 8 ) each form an SU 3 octet. 
The SUz generalization of the conserved-vector-cur- 
rent (CVC) hypothesis is to assume that the vector 
currents 3yx are just the unitary spin currents, with 

ff.x=7 x », a= 1,2,3; (81) 

Ssx=!v3Fx, 

where 7x° is the isotopic spin current and Y\ is the 
hypercharge current. In our new notation, the currents 
defined in Sec. I are 

A Ao =ff 0 x 6 , o=l, 2,3. (82) 

Let us define vector and axial-vector “charges” J7y 
and Ffi according to 

F*=-i J d 3 x 3+ 6 . (83) 

Gell-Mann 4 has postulated that even in the presence of 
the SU 3 symmetry-breaking interaction, the following 
commutation relations hold exactly: 

[F t -,F ? ]= ifijkFk , 

iFiffl-ifakFk*, (84) 

The chirality commutation relation of Eq. (7) is, of 
course, just a special case of Eq. (84): 

lFi*+iF 2 \ 2V-tfV> 2 F*. (85) 

From Eq. (84), we also get the following commutation 
relation for the “charge” associated with the strange¬ 
ness changing part of J\ h : 

IFi+iFt+Ff+iFf, Ft-iFz+Ff-iF^ 

= 2^F,+2F 3 +2^Fa B +2F3 6 - (86) 

Assuming that we can integrate by parts with respect 
to the spatial variables x, we can express the time de¬ 
rivatives of the “charges” in terms of the divergences of 


n In this section, we use the notation of Ref. 4 for the currents. 
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the corresponding currents : 


zero (5=0) leads to the relations, for forward lepton, 




d z x d\5j \, 


d z x . 



Let us now derive sum rules which provide tests of 
the commutation relations of Eq. (85) and Eq. (86), 
considering first the strangeness-conserving case, Eq. 
(85). We proceed exactly as in Sec. II, taking the matrix 
element of Eq. (85) between proton states. The only 
difference is that we do not assume that the divergence 
dxffax 6 is proportional to the pion field. We thus get 
the sum rule 




f 

J Mr 


4 MnWdW 


Mtf+Mr (W*-M**y 


lN 9 -m-N p +mi, 


with 


( 88 ) 


N P HW)=Z 6(TF-M i )|ff / ±| 2 | ( ^ ) ^, 

INT 


p(q)) 

=aMN/gom i /q j o)) li2 ^. (89) 

In other words, ffy* is the matrix element of the di¬ 
vergence of the axial-vector current; the sum rule of 
Eq. (88) involves this matrix element only at zero 
four-momentum transfer (y— qjf. 

The matrix element needed to evaluate the right- 
hand side of Eq. (88) can be directly measured in high- 
energy neutrino reactions. Consider the inelastic 
reaction 

vi+N->l+j, (90) 

with vi a neutrino, l a lepton, N a nucleon, and j a 
system of strongly interacting particles with 
In a previous paper, 82 we showed that when the lepton 
emerges parallel to the incident neutrino direction, and 
when the lepton mass is neglected, the matrix element 
for Eq. (90) depends only on the divergences of the 
hadronic current. Clearly, under these hypotheses the 
momentum transfer (g—gj) 2 is zero, so we are measuring 
just the matrix element needed in Eq. (88). (In the 
A5=0 case, the divergence of the vector current 
vanishes.) Slimming over final states j of strangeness 


« S. L. Adler, Phys. Rev. 135, B963 (1964). 


dV[V+p—+( 5 = 0 )] 


dSlidEi 
<P<x[v+p->l++(S= 0)] 


=Gy* cos*0f(W)N P + (W ), 


(91) 


with 


d&rdEi 


=Gv i cos 2 0f(W)N p -(W), 


1 vM^+IMnE— 


2ir 2 L 


rr. 


(92) 


Here E is the incident-neutrino energy, Ei is the final- 
lepton energy, and 8* is the lepton solid angle (all in 
the laboratory frame, where the initial proton is at 
rest). In terms of W and E , E x is given by 

E t = (Mjy+2MNE-W^)/(2MN) - (93) 

We can apply the same method to the commutator 
of the strangeness-changing currents 83 [Eq. (86)]], giv¬ 
ing the two sum rules 


4= 

2 = 


/ 

/ 


4 M N WdW 

4 M N WdW 

(yp-Mtyy 


is p -(w)-s+mi , 

iSn-m-SnHW)']. 


(94a) 

(94b) 


Equation (94a) has discrete contributions at W=Ma, 
W=Mx and a continuum from W=M,+M a to 
Equation (94b) has a discrete contribution at W=Mz 
and a continuum from W=M x +Ma to «. The func¬ 
tions S Pt n ± are measurable in strangeness-changing 
high-energy neutrino reactions, since for forward lepton, 

dV[>+ l'+ (5= +1)] 


dQjdEi 

=Gv 2 sin 2 0/(IV)S< p , fl) +(WO, (95) 

<P<r[v+ (j>,n) —> Z++ (S= — 1)]] 
dsijdEi 

=Gy2sm*6J(W)S iv .nr(W)- 

Thus, Eqs. (88), (91), (94), and (95) can be used to 
directly test the algebra proposed by Gell-Mann for 
the vector and the axial-vector currents. 
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a u The nucleon matrix element of the axial-vector terms on the 
right-hand side of Eq. (86) vanishes when we average over 
nucleon spin. 


Sum Rules for the Axial-Vector Coupling Constant Renormalization in § Decay, Stephen L. Adler 
[Phys. Rev. 140, B736 (1965); 149, 1294(E) (1966)]. 

1. In the first line of Eq. (62), Mr should read ( M x i,f ) 2 . In Eq. (65), /ijT fl (JV,0,0) should read 
fiji B (Wfi,Mr)‘ I wish to thank G. E. Brown, A. M. Green, B. H. J. McKellar, and R. Rajaraman for 
pointing out these errors. 

2. A factor of |k|/|k°| was omitted in Eqs. (72), (73), and (77). Equation (72) should read 

o*6T ,,r (5) = (|k|/|k°| )i^ JVJVr (0) 2 (|k°|/|k|) 2I o-, I ' 7 (j), 

and Eqs. (73) and (77) are corrected by making the substitution ds —» (|k|/|k°| )ds . Making the correc¬ 
tion increases the magnitude of the scattering length a Q required to saturate the sum rule. 
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We show that the local commutation relations of the vector and the axial-vector current octets can be 
studied in nonforward lepton-neutrino reactions. We do this by using the commutation relations to derive 
sum rules, for fixed q 2 (g 2 *invariant lepton momentum transfer squared), involving the elastic and the 
inelastic form factors measured in high-energy neutrino reactions. 


1. INTRODUCTION 


I T has recently been proposed by Gell-Mann 1 that the 
fourth components of the vector and axial-vector 
current octets satisfy the local equal-time commutation 
relations 


| * 0 — fabc'Fc4(%)8( i X y) , (1&) 

C^'a4(^))9 : &4 B (y)I|| xo—Vo = fabe^ei 5 (x)8(x y) , (lb) 
CSFrf 6 (*),SF»4 5 (y)] | *o-i-o= ~ f°bc5c t (x)S(x-y). (lc) 


Here $ a \ and are, respectively, the octet vector, 
and axial-vector currents, and a , b , c are unitary spin 
indices running from 1 to 8, According to Eq. (1), the 
octet vector and axial-vector charges 





d 9 x ff«4(x,0, 


d*x ff«4 6 (x,0, 



In addition to Eq. (1) for the fourth components of 
the current octets, let us postulate that the space com¬ 
ponents of the octets satisfy the local equal-time com¬ 
mutation relations 

l> an *o—vo 

= 8 nm f a bcVe4 l (x)8(x—y)+Sab l , (4a) 

{E^a«0*')>^&»n B (y)II| *o—Vo‘^“C^ r an B ( ; ^) > fi : 6t»(y)I| | Xq—Vo} 

= —28 nm fabeGLu(x)8(x—y)+S a b 2 j (4b) 

0^®n B (#)»^6» B (y)] I xo—VO 

= 8nmfobcVrt 2 (x)8(x—y)+S a b 3 - (4c) 

Here and V c a z are the fourth components of vector- 
current octets, and Qa is similarly the fourth component 
of an axial-vector octet. The quantities S a b 1,2,s are sym¬ 
metric in the unitary spin indices a and b . If the simple 
quark-model commutation relations proposed by Dashen 
and Gell-Mann 3 and by Lee 4 are valid, we have 

•Uc4 1 ='0c 4 2 =^4, a e 4=0W*. (5) 


satisfy the equal-time commutation relations 

lF a (t),F h (t)>ifabJ?c(t), 

lF*(t),F b *m=if a bcFc 5 (t), (3) 

The commutation relations of Eq. (1) are considerably 
more restrictive than those of Eq, (3), since even if 
derivatives of the delta function were present on the 
right-hand side of Eq. (1), Eq, (3) would still be valid. 
In an earlier paper 2 [hereafter referred to as (I)] we 
showed that the commutation relations of Eq, (3) can 
be tested in high-energy inelastic neutrino reactions, in 
which the lepton (which is regarded as massless) emerges 
moving parallel to the direction of the incident neutrino. 
In other words, Eq. (3) may be tested in g 2 =0 neutrino 
reactions, where q 2 is the invariant momentum transfer 
between the neutrino and the outgoing lepton. In this 
paper we generalize the results of (I), by showing that 
the local commutation relations of Eq. (1) can be tested 
in g 2 >0 (nonforward lepton) neutrino reactions. We do 
this by deriving from Eq. (1) a sum rule, valid for each 
fixed q 2 , involving quantities measurable in high-energy 
neutrino reactions. 

* Junior Fellow, Society of Fellows, 

1 M. Gell-Mann, Physics I, 63 (1964). 

2 S. L. Adler, Phys. Rev. 140, B736 (1965). 
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However, Eq. (5) is not valid in theories in which meson 
fields are explicitly included in the currents, whereas, in 
many of these field theories, Eq. (4) still holds. We will 
derive sum rules which provide tests of Eq. (4) in g 2 >0 
neutrino reactions. 

Each of the sum rules discussed in this paper requires 
for its derivation, in addition to a local equal-time com¬ 
mutation relation, the assumption that a certain scatter¬ 
ing amplitude obeys an unsubtracted dispersion relation 
in the energy variable, for fixed g 2 . No attempt will be 
made in this paper to justify the assumption of unsub¬ 
tracted dispersion relations. Thus, the statement made 
in this paper is that if the assumption of unsubtracted 
dispersion relations is valid, the sum rules derived 
provide a direct experimental test of local equal-time 
commutation relations. 

In Sec. 2 we state in detail the results of the paper. 
The next two sections comprise the derivation. In Sec. 3 
we analyze the kinematics of high-energy neutrino re¬ 
actions. In Sec. 4 we derive, from local commutation 
relations, sum rules which involve the quantities defined 
in the kinematic analysis of Sec. 3. In an Appendix we 
give lepton-mass corrections to the results stated in 
Sec, 2. 

* R. F. Dashen and M. Gell-Mann, Phys. Letters 17, 142 (1965). 

4 B. W. Lee, Phys. Rev. Letters 14, 676 (1965). 
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2. RESULTS 

We consider the high-energy neutrino reaction 

v+N->l+(3, (6) 

where v is a neutrino, N is a nucleon (neutron or proton), 
l is an electron or muon, and 0 is a system of strongly 
interacting particles. Throughout the text of this paper, 
we will neglect the final lepton mass, i.e., we take 

tni~ 0. (7) 

The results stated below are only slightly modified when 
all lepton-mass terms are included. (See the Appendix.) 
We define all noninvariant quantities referring to the 
reaction of Eq. (6) in the laboratory frame , in which the 
nucleon N is at rest: 

Ep =neutrino energy, 

E\= lepton energy, 

lepton-neutrino scattering angle, 

Sit=lepton solid angle, (8a) 

k ,=neutrino four-momentum, 
ki =lepton four-momentum, 
q=k,—kt =lepton four-momentum transfer, 


We denote by W the invariant mass of the system 0, 
by Mn the nucleon mass, and by q 2 the invariant 
momentum transfer between the leptons: 

q 2 = (ky-h) 2 = 4 EyEt sin 2 (tf>/2), 

( 8 b) 

W= [2M N (Ey-Ei)+M N 2 -q 2 2 112 . 

We assume that the semileptonic weak interactions 
are described by the current-current effective Lagrangian 
density 

£(x) = (G/vZ)./x(*)A(*)+arijoint, 

G= 1.023 X10~ 6 /M;y 2 , 

/*(*)“#i(*)7x(l+T5)lM*)» (9) 

/ x (^)=(cos0c)[^ixW+i9 : 2x(^)+9 : ix 6 (^)+j9 : 2x 6 (^)I| 

+ (sin0c)C^4x(^)+i9 : 5x(^)+9 : 4 x 6 (^)+j9 : 5X 6 (^) ], 

0c=Cabibbo angle. 

We define the form factors Ei v (q 2 ) t F 2 v (q 2 ) t gv(q 2 ), 
gA(q 2 ) , and /u(? 2 ), which describe elastic neutrino 
reactions, as follows: 


(N(p 2 ) \ 5i\(0)+i3i\(0)\N(Pi)) = ((MN/p2o)(MN/pio)y i2 iuN(p2)T + ZFi v (q 2 )yx—F 2 v (q 2 )o'\ v q v }uN(Pi) 

= {(M N /p2o)(MN/pio)y i2 iuN(p2)r + ^gv(q 2 )y\+iF 2 v (q 2 )(pi+p 2 )f}^N(pi) , 
q=p 2 —pi, gv(q 2 )—Fi v (q 2 )+2MNF 2 v (q 2 ), 

(N(p 2 ) | ffix 6 (0)+iff2x 6 (0)| N(pi)) = ((MN/p 2 <i)(MN/piQ)y i2 iuN(j> 2 )T + [gA(q 2 )y\—ihA(q 2 )q\]yjfliN(pi) • 


( 10 ) 


Here r + denotes i(r l +ir s ), with |r c (c=l, 2, 3) the nucleon isotopic spin matrices. 

Finally, we define the diagonal one-nucleon matrix elements of the operators VeA 1,2 appearing in Eq. (4) as follows 


W|U C 4 1 ‘ 2 (0)|^))=.C / L2(i r c) > c =i, 2 , 3; 

(N(p)\Vu l ' 2 (0)\N(p))=iC Y l > 2 . 

If the quark-model commutation relations hold, so that Eq. (5) is valid, then 

Cr 1,2 = 1, Cf 1 - 2 =|^. 


(ID 


( 12 ) 


If the quark-model commutation relations are not valid, the values of C/ 1,2 and Cy 1 * 2 are not at present known. 
We may now state the results of this paper, as follows. 

Strangeness-Conserving Case 

The kinematic analysis of Sec. 3 shows that we may write the reaction differential cross section in the form 

G 2 cos 2 0c Ei 


d 2 <r 


( 0 + ^ 0 +ws - o) )/ 


d£lvdEi= 


(2t) 2 Ey 

XC^« <±> (^W0+ 2 ^Ei cos a (^)/J<±>( ff a ,RO^(^+-Ei)ff*Y <±) (ff a ,W r )D - (13) 

By measuring d 2 <x/dQidEi for various values of the neutrino energy E Vf the lepton energy E t) and the lepton- 
neutrino angle we can determine the form factors a (±) , j0 (±) , and ? (±) for all ^ 2 >0 and for all W above threshold. 
In Sec. 4 we prove that: 

(i) the local commutation relations of Eq. (la) and Eq. (lc) imply 

r W 

2=gA(?T+^iV) 2 +? 2 ^(? 2 ) 2 + / — dw[_a^(q\w)-^)( q \w)y, 


(14) 




RIO 
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(ii) the local commutation relations of Eq. (4a) and Eq. (4c) imply 


W 


C^+C/Ml+gV^* 2 ^^ f - dW[a<-)(q*,W)-aW(q* t W)l; (15) 

J MN+Mt Mn 


(iii) the local commutation relation of Eq. (4b) implies 

gv(q 2 )gA(q 2 ) r W 


Mn 


r w 

= / - dWl^~\q\W)-y^{q\W)']. 

J Mji+Mr Mn 


(16) 


We write 


Strangeness-Changing Case 


MO“-ocj)/«=Tr' 


(2t) 2 E, 

XC? 2 “(p.») (±) (q\W)+2E,Ei cos 2 (^)/3 (j> .„) (±) (q\W)^(E,+Ei)cfy^ n) <±>( ? 2 ,^)] . 


Then, 

(i) the local commutation relations of Eq. (la) and Eq. (1c) imply 

W 


(17) 


f W 

(4,2)= / —^D3 CP , n) ^(?W-0( P , n /+)( ? 2 ,^)]; 
J Mn 


(18) 


(ii) the local commutation relations of Eq. (4a) and Eq. (4c) imply 

W 
M n 


r w 

C^(Cy 1 +Cy 2 )+i(Cr 1 +C^),^(Cy 1 +Cy 2 )-KCz 1 +Cz 2 )]= / -d^[a Cp , n )^( ? W-a Cp , n) ^)( ? V^)]; (19) 

J Mn 


(iii) the local commutation relation of Eq. (4b) implies 

f W 

(0,0)= / -^C7c P( .) t “ ) (? 2 ) ^)-7c P .n ) < + )( ? W]. (20) 

J Mn 

The integrals of Eqs. (18)-(20) have discrete contributions at W=M a and/or M s and a continuum extending from 
W=Ma+M, or from W=Ms+M T toW=<x>. We have not explicitly separated off the discrete contributions to 
the integrals, as was done in Eqs. (14)-(16) for the strangeness-conserving case. It would, of course, be straight¬ 
forward to do this. 

The sum rules of Eqs. (14)-(16) and (18)-(20) hold for each fixed q 2 , provided, as was stated in Sec. 1, that the 
assumption of an unsubtracted dispersion relation needed to derive each sum rule is valid. When q 2 = 0, Eqs. (41) 
and (43) of the next section show that 

P(0,W) = m N 2 /(W*-MN*y) E E &(k(to+Ei-E y -MN){\(P\dxJx v \N)\ 2 +\{p\dxJx A \N)\ 2 }, (21) 

0,INT $ 


where J\ v and J\ A are the vector and axial-vector weak currents appropriate to the AS=0 or \AS\ =1 cases 
(e.g., J\ v = JFix+i^x or S^x+i^sx). Thus, at q 2 =0 Eqs. (14) and (18) are just the forward lepton sum rules derived 
in (I). 

The sum rule on 0 has an interesting consequence for the behavior of neutrino cross sections in the limit of very 
large neutrino energy E v . With the aid of Eq. (8), let us write Eqs. (13) and (14) in the form 


dV 



+0(5=0) 



G 2 cos 2 6c 

-[? 2 a (±) +(2E, 2 -2^ o -k 2 )0 (±) ^(2^'?oV7 (±) ], 

4t rE, 2 


2= d? o (0<->-0< +) ). 

J (q*/2 ATjv)- 


The differential cross section da/d(q 2 ) is given by 


d<r 

dq 2 


dqo - 

d(q 2 )dqo 


( 22 ) 

(23) 


( 24 ) 
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The upper limi t of integration is fixed by the requirement that sin 2 (tf>/2) lie between 0 and 1. Using Eqs. (22)-(24), 
it is straightforward to prove the following theorem: 

Theorem . Suppose that the integrals 


r° dq 0 r r 

/ -( a H-aW), / H7 w +7 (+) ), / dq o (0^-0^) 
J On 2 J On J 


(25) 


qo* J qo 

are convergent. Then 

lim [dafy+P -> l+P(S=Q))/d(q*) -do{v+p-> l+p(S= 0))/d( ? 2 )} 
e 9 -*°o 

G 2 cos 2 0c 

2t 


f 


G 2 cos 2 6c 
d?oD3<->-/? (+) ]=- 


(26) 


Similar results hold in the strangeness-changing case. Adding the cross sections for the AS=0 and the | AS \ = 1 
cases to obtain the total cross section, we find 

lim C do- T (v+p)/d(q 2 )—do-T(v+p)/d(q 2 )2= (G 2 /t)(cos 2 0 c +2 sin 2 0 c ), 

(27) 

lim [ do-T(v+n)/d(q 2 )—do-T(v+n)/d(q 2 )2=(G 2 /ir)(— cos 2 0 c +sin 2 0 c )• 

Ey+9 0 

Equation (27) is the somewhat surprising statement that, in the limit of large neutrino energy, dar(v+N)/d(q 2 ) 
— d<r T (v+N)/d(q 2 ) becomes independent of q 2 * This result is unchanged by the lepton-mass corrections. 


3. KINEMATIC ANALYSIS OF HIGH-ENERGY NEUTRINO REACTIONS 


In this Section we derive Eq, (13) ,which gives the general form for the neutrino reaction leptonic differential 
cross section, d 2 <x/dQidEi . 6 In particular, we find explicit expressions for the form factors a(q 2 ,W) f 0(q 2 ,W) f and 
y(? 2 »^)» in terms of matrix elements of the vector and the axial-vector currents. 

According to the effective Lagrangian of Eq. (9), the matrix element 3ft for the process v+N —> 1+0 is given by 

31 l=gm , m=w I (^ I )7x(l+75K(W2“ 1 / 2 (/S ont (^^) \Jx v +Jx A \N(kx )). (28) 

Hereg=(G cos0c, G sin0 c ) in (AS=0, | A5| = 1) reactions, J\ v and/x A are the appropriate vector and axial-vector 
currents, and kp and kx are, respectively, the four-momenta of 0 and of N. In the frame in which the initial nucleon 
N is at rest, the reaction cross section is given by 


<r 



d'kp 

- E 

(2t) 3 /mnt 


E 


b(kfr\"ki — k v —kx) 





In Eq. (29), E/s.int is a sum over the internal variables of the system 0, E* is an average over the initial nucleon 
spin, and (| m | 2 ) is the sum of | m \ 2 over the lepton spin states. From Eq. (29) we get 


with 


dfydQMi= & 2 /(2tt) 2 ](£ i /E,)^ , 

k= E E Bikpo+Ei-E,-M N )tnitn,(\tn \ 2 )\^. 
0.INT • 


(30) 

(31) 


Let us now study the quantity k. We introduce the abbreviated notation 

e\ = 2- l/2 ui(ki)y\(l+y&)uy(k v ), 

Vy ?= i(qo+Mx)2\Jx v \N(fl,Ufx )), 

A =flfl«*[q, i(qo+Mx )]| Jx A \ N(0,iMx)), 
E/j= E E 5(^o+E/— E r —Mx) - 

0,INT • 


Let us further denote by Vjf and by Arf the matrix elements of the divergences of the vector and the axial-vector 
currents, 


PV= - iqxVx*- <^° ut [q, i(qo+M N )l | d x JS\N(Q t iMx)) , 
A d » -(^° ut [q, »(?o+^)] I dxJx* | (0,Uf »)>. 


6 Locality theorems of this type are, of course, well known. See, for example, T. D. Lee and C. N. Yang, Phys. Rev. 126, 2239 
(1962) j A. Pais, Phys. Rev. Letters 9, 117 (1962). 
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Since the final lepton mass is neglected, we have 
Using Eqs. (33) and (34), we may write 


?xtfx=0. 


m=ei(V\ t3 +A\P) = e n (5 nk —q n q k /q o 2 )(Vk f *+A k ll )+i(q-e/qo 2 )(VD (i +A D 0) t 
where the repeated indices n and k are summed from 1 to 3. Defining t nm by 

tnm = {enem*)yUM'l=(kp)n(kl) m -)-(kl)„(kp.) m — k l ,*kl& nm -)-€ nm t 1 t (kr)t(kl ) n9 

we find that 


(34) 

(35) 

(36) 


K = E/9 WiWv(|w| 2 )| kg _ q 

+Mm«/?o 4 ){E/j| VuP\ s +EpMd'I 2 +Hel(A D ^V D p +(V^)*A D ^} 

+W5 n *-?™?*/?o 2 )(W?o 2 ){Ep[(W^^ 

The next step is to use the transformation properties of the currents under time reversal and parity to determine 
the form of the various Ep terms in Eq. (37). Denoting by T and by P the time-reversal and parity operators, 
respectively, we have 


TJ k v (0)T- l =-J k v (0), TJ k A (0)T- l =-J k A (0) t PJ k v (0)P~ 1 =~Jk v (0 ), PJ k A (0)I>-'=J k A (0) , (38) 

and similarly for the divergences of the currents. Under the assumption that the “in” and “out” states of definite 
total energy each form a complete basis for states of that energy, we have 


E S(k&+E l -E,-M lf ) | /S-WXr'W | = E Kktn+Ei-E,-M N )\PTp™\k e )){PT0°*'{k i! )\, (39a) 

/9.INT /S.INT 


and 


EI N(k N ))(N(k N ) I =E \PTN(k N ))(PTN(kx) I. (39b) 


Using Eqs. (38) and (39) we find that 

/9,INT » 

/9,INT » 

=E* Vffy/r =[Ep MW?. (40) 


Thus, the tensor Ep F/(F/)* is rca/, and hence sym¬ 
metric, Using P alone shows that this tensor is an even 
function of q. A similar analysis can be carried through 
for each of the Ep terms in Eq. (37), with the following 
results: 

(i) 'EffVfiVf)* and E^**W)* are real sym¬ 
metric tensors (even under q— ► — q); 

(ii) Ee [V#(Af)*+A /(F/)*] is an imaginary, anti¬ 
symmetric pseudotensor (odd under q— > — q); 

(iii) E^l Vd 0 \ 2 and EpMd*I 2 are real scalars; 

(iv) HeLVDKAD p )*+A D KVi> p )*l is ** imaginary 
pseudoscalar; 

(v) Ed [^(F fl 3)*- (F^)*F fl 3] and Ed 

— (Ak p )*AD p l are imaginary vectors; 

(v0 Ed [VAAd?)*-(Y/)*A D r\ and Ed [A AVd?)* 

— (^/)*Ffl fl ] are imaginary pseudo vectors. 

All of these quantities must be formed from the one 
vector available, q. Thus the only possible tensors are 
& tj and q t qj and the only pseudotensor is No 


pseudovectors or pseudoscalars can be formed. Con¬ 
sequently, the most general from of the quantities 
appearing in Eq. (37) is 

Ed( WF/= hA^m+q^V^W) , 

Ed(^/) *A k *= &ikAl(q*,W) +qjq k A,(q\W) , 

Ed UAWVf+QrWA^utivjfaW) , 

'L'W\*-Dt(?,W), 

'Lfs\A D ‘ , \*=D/i(q 2 ,W), 

Ed UyWVrf-WDWft-iqJytfJV) , C ) 

Ed lUWAo! 1 - (AA)*A k ^=iq k I A {q\W ), 

Ed [(^i/)*lV+(Fz/)Mi/] =0, 

Ed L(V k e)*A D i>-(A D e)*V k ei = 0, 

Ed L(AA*VA-(VA)*AA=0, 

with all the structure functions [Fj, F 2 eta] in Eq. (41) 
real. 
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All that remains now is to evaluate the tensor con¬ 
tractions contained in Eq. (37). Using the equations 

q n (8 n k-q n qk/qo 2 ) = — (gVtfo 2 )?*, 

8 nm t nm =q 2 +2E,Ei cos s (*/2), 
c n miqitnm=iq 2 (Ey+Ei ), 

we get, by some straightforward algebra, the result 

dfydQ4Ei=\j>*/(2*)*}(Ei/E,)K , 

K=q 2 a(q 2 f W)+2E v E l cos 

-qHE'+EfiyifrW) , 
a(q 2 i W)=Vi(q\W)+A l (q\W) t 

(43) 

+ (q 2 ) 2 LV 2 (q*,W)+A*(q*,W)l 
+q t Uv(f,W)+I A (q\W)l+Dv(q t ,W) 
+D A (q^W)}*M N 2 /(W 2 -M N 2 +q 2 ) 2 , 
y(q\W) = I( Q \W). 

The formula for antineutrino-induced reactions is the 
same, except that the final term in k is changed to 
+q 2 (E*+Ei}y(q 2 y W) [and, of course, in Eq. (32) de¬ 
fining Vi and Ai t the currents J k v and Jk A are replaced 
by their adjoints]. 

The simplest illustration of our result is the elastic 
reaction v+N Explicit calculation shows that 

dV(P+/>— >l-\-n)/dQvdEi has the form of Eq. (13), 
with 


4. DERIVATION OF THE SUM RULES 

In this Section we derive the sum rules of Sec. 2. In 
the first subsection we state and discuss the fundamental 
identity used in the derivations. In subsequent sub¬ 
sections we derive Eqs. (14), (15), and (16). The deriva¬ 
tions for the strangeness-changing case are identical to 
those for the strangeness-conserving case, and are 
omitted. 


(A) Fundamental Identity 

The starting point of the derivations is the identity 6 


- [ dt e i ^V|[4( i ) ) ^(0)]|Ar> 
qo Jo 

- i(N | \_A (0),fi(0)] | N) 

+(2?o)- 1 (tf | [A(0),.B(0)]+[i)(0)„4 (0)] | N) 


+qo[ dte«*‘(X\lA(t),B(0)l\N), 
Jo 


where 


dA(t) dBU) 

A(t)= -, B(t) = 


dt 


dt 


(46) 

(47) 


are the time derivatives of ^4(0 and B(t). Equation (46) 
is easily derived by repeated integration by parts, and 
holds for all q 0 in the upper half of the complex plane. 
In this paper, the operators A(t) and B(t) will always 
be of the form 


a <~Kq\w) = liw-M^ux+qyw^g^y 

+(?V4ilf*V(? 2 ) 2 ], 

fH-\q\W) = l{W-M li )[ jU (q*y (44) 

+7W) 8 +? 2 F*W], 

T W (flW = KW-M N )l-g A (q^ v (q^/M N -] . 

We have also computed, for this reaction, the individual 
structure functions appearing in Eq. (41). They are 

VS->(q*,W) = S(W-Ms)(q s /4Ms s )gv(q 2 ) 2 , 

Vy~Hq\W) = S(W- Mw){[1+? 2 /4Mw 2 ]/v(<7 2 ) 2 

-gv(? 2 )Mg*)/AM, 

Ay-Kq 2 ,W) = S(W-Ms)(l+q 2 /^Ms s )g A (q 2 ) 2 , 
Ay-)( q \w)=b(w- m n ) i( q yw N i)h A (?*) 2 

-h A (q 2 )g A (q 2 )/M N '], 

/(-)( ? 2 ,W) = 5 (W-Mw)[-gA(? 2 )gv(? 2 )/M K ], ( ' 

IS~) (q 2 ,W) = 8(W- Ms) (-1/2 Ms 2 ) 

X[2 MsgJi(q 2 )—q 2 h A (q 2 )y , 
D A ^(q\W) = b(W-M N ){ q y±M N 2 ) 

X [2 Msg^q 2 ) — q % h A (? 2 )] 2 , 
lv ( -> (q 2 ,W) =ZV~> (q 2 ,W)= 0. 


-•/ 


d 3 x , 


/ d s y e iB '7$ B (y,t); 



$a = $ a \ or ffax 6 , $ B = $b a or . 

In (I) we studied Eq. (46) with s=0; this led, in the 
limit q<>—> 0, to sum rules at q 2 = 0. In this paper we will 
study the case when s^O, and will find, in the limit as 
q 0 —» 0, sum rules for fixed q 2 (with q 2 = | s 2 ). 

There are a number of features which all of the 
derivations given below have in common. First of all, 
we will always use Eq. (46) with the nucleon N at rest, 
and with the nucleon spin averaged over. Secondly, 
each term of Eq. (46) can be divided into a part which 
is symmetric and a part which is antisymmetric in the 
unitary spin indices a and b . We will only study the 
identity for the antisymmetric parts . In each case below, 
we will find that the term 

U = (2q a y\N I [i (0), B (0) ]+ £8(0),A (0)] [ N) (49) 

8 Equation (46) is a more symmetrical version of Eq. (37) of 
Ref. 2. Equation (46) remains valid if (iV| and \N) are replaced 
by any two states of equal four-momentum. * 
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is purely symmetric in the unitary spin indices, and 
thus makes no contribution. Thirdly, since we have 

goj dte^‘(N\iA(t),B(0)3\lV) 


= -iqo E -—--—-- 

AINT iqo+MN—d&li+Mfi 2 ) 1 ** 
W0>Mn) 11 p/ 


(N\S B \0mSA\N) 

qo+Q&^+MflW-Mx 


(2tt) 3 5(0) , (50) 


the limit as q 0 0 of Eq. (50) is zero for all [ s [ 2 > 0. 
As a result, the third term on the right-hand side of 
Eq. (46) makes no contribution to the sum rules. 7 

Finally, we will always find that the unitary spin- 
antisymmetric part of 

f dt *(N | [A(0,B(0)] | N) (51) 

J o 

is an odd function of qo, O(qo,q 2 )- Thus, in the limit as 
qo —* 0 the identity of Eq. (46) will become the equation 


- O(qo,q 2 ) 

dqo 


(52) 


flO=*0 


where C is the unitary spin-antisymmetric part of the 
commutator — | [[^4(0),^(0)] 12V^). Equation (52) 
states that the commutator of A and B is related to the 
energy derivative of a forward scattering amplitude, 
evaluated at zero energy. Up to this point the derivation 
is rigorous . Now, in order to relate the left-hand side 
of Eq. (52) to physically measurable quantities, we will 
assume that the energy derivative (d/dqo)0(qo,q 2 ) 
satisfies an unsubtracted dispersion relation in the energy 
variable qo, for fixed q 2 . The discontinuity of (d/dqo) 
XO(qojq 2 ) across its cuts will, in each case considered, 
be related to the structure functions defined in Eq. (41). 

(B) Sum Rule for $ (± > 

The sum rule on jS (±) of Eq. (14) is obtained by adding 
together two separately derived sum rules on the axial- 
vector and the vector parts of /3 (±) , fri c±) , and £y (±) : 


■=gA(q 2 ) 2 + J 


-dW 


Mn+Mr Mn 

XDfc w (^-fr< +) (3W], (53a) 


r w 

Fx v (q 2 Y+q 2 Fi v (q 2 ) 2 + / - dW 

J Af»+Af, Mn 


X|><->(?W-/M +) (?W]. (53b) 

In terms of the structure functions defined in Eq. (41), 

/3/±> i (± >(? 2 ,W0+( ? 2 )M 2 <±> (q\W) 

+jV* (± > + D a <±> (q\ W)3 

X42/JV 2 / (IF 2 —MjvH-j 2 ) 2 , (54) 
/3v (±i (? 2 ,^) = ? 2 [Ei (± > (q\W)+qWY ±) (q 2 ,W)~] 

XAM^/iW^-M^+q 2 ) 2 . 

[The structure functions /y (±) (j 2 ,PE) and D v ^(q 2 ,w) 
vanish identically in the strangeness-conserving case, 
because of conservation of the vector current.] Since the 
derivations of Eqs. (53a) and (53b) are identical, we 
will treat explicitly only the axial-vector case, Eq. (53a). 

We start from the fundamental identity of Eq. (46), 
taking 


A(t) 


B(t) 


— — i J d z x e~ ix 
= — i J d z y e iut 


d z x ^~* s * x 9' 0 4 6 (x,/) , 


(55) 


7 SFM 5 (y,0 - 


Defining D a (x) = d\5 a \ b (x) we find, by spatial integra¬ 
tion by parts, that 


A(t) = jd z x tf“ ft,x [Z)fl(x,/)—w»SF flIl 5 (x,f)], 


(56) 


B(t)=Jd 3 y e is -^D b (y,l)+is^b^(y,l)3, 


where the repeated index n is summed over. With A and 
B as shown in Eq. (55), the first term on the right-hand 
side of Eq. (46) becomes, using the local commutation 
relation of Eq. (lc), 

-tE(^|[^(0),5(0)]|^)= €a&c (M(27r) 3 5(0). (57) 

a 

Thus this term is purely antisymmetric in the isospin 
indices a and b. [Note that the validity of Eq. (57) 
depends on the correctness of the local commutation 
relation. If Eq. (lc) were modified by the addition of a 
term proportional to V 2 5(x—y), a term proportional to 
| s | 2 would be added to Eq. (57).] The second term on 
the right-hand side of Eq. (46) becomes 


f d z x 

2 On l ® U 


asF«4 5 (x,f) 


, J <fyc‘-TSF M '(y,o]|fV) 

+EM U d z y e is * 


. d&M s (y,t) 


■/ 


d z x tf^* ,x JFo4 6 (x 


, 0 >>) 


(58a) 


7 Only when |s| 2 =0 does the one-nucleon intermediate state (Afp-Afw) make a contribution to the limit. This is the case con¬ 
sidered in Ref. 2. 
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-1 

2?o 


£< 4 / 


d5 a f(x,t) r “I 

d z xe~ iBtx -, I d?y e x *' y $uK y,0 | N) 


dt 






d$b4 6 (x,t) 

dt 


, (58b) 


where we have obtained Eq. (58b) by setting — y<-> x in the second term of Eq. (58a) and by using the parity 
transformation properties of the axial-vector current Clearly Ui ah is explicitly symmetric in a and b. Thus, if we 
agree to keep only the part of Eq. (46) which is antisymmetric in a and b, the second term on the right-hand 
side of Eq. (46), which involves the unknown commutator of dtfa^/dt with JF& 4 6 , drops out. As discussed above, 
the limit as £ 0 —» 0 of the third term on the right-hand side of Eq. (46) vanishes. 

Now let us turn to the left-hand ride of Eq. (46). Using translational invariance, the integral over y can be done 
explicitly, giving an over-all factor of (2 tt) 3 $(0). We cancel this against the identical factor in Eq. (57). Taking N 
to be a proton at rest, and multiplying Eq. (46) by an over-all factor e a &s gives 


1 


_- fd 4 xexp(—i^x)e(x 0 )J^(p\ZDa(x)—is n ^ an 6 (x) 1 Z)&(0)+^ m 3 : &m 6 (0)]|/>)+o(^o), (59a) 

qo J * 

i=(V?o)) (59b) 

where o(q 0 ) indicates terms which vanish as qo — >0. Let us define the amplitudes d(q^q 2 )j ai(qo 3 q 2 )j ^(tfo,? 2 ), and 
Li(<7o,£ 2 ) by the equations 

d(qojq 2 )=eabs j d*x e- iq ' x d(x 0 )Y,(p\[p a (x),D b (O)]ip) , 

^i(qo 1 q 2 )b nm +a 2 (qo J q 2 )q n q m =^ab 3 jd*x e~ iq ' x 0(x^£,^p | [9\ ln 6 (:r),9^, B 6 (0)]|/>) , (60) 

iq n iA(qo,q 2 ) = eab3 j d*x €~ iq ' x B(xo)YL{p | [JFan 6 ^),/)^)]— [Z) a (#),9^ n 6 (0)]|/>)- 

We will prove below that these are all odd functions of qo. Thus Eq. (59), in the limit as q 0 — >0, becomes 
the statement 

d 

1 =— \(q 0j q 2 ) , 

oqo flo=o (61) 

x (?o, ? 2 )=%o, ? 2 )+? 2 ai (^ 

with q 2 fixed at | s | 2 . 

Let us now study the properties of the functions d 7 a\, a 2j and i A . From their definitions as retarded commutators, 
it follows by the standard methods of forward dispersion relations 8 that they are analytic functions of qo in the 
upper half q 0 plane, for fixed q 2 . Thus if we assume that the amplitude (d/ dqo)\(q 0 ,q 2 ) approaches zero as qo —■► <» 
in the upper half plane, we can write the unsubtracted dispersion relation 

dqo 


-A(?o,? 2 ) 


1 r 00 

7T ./ —oo 


dqo " " 7r./_« (qo—qo) 2 

where the absorptive parts d\ at, a 2l u are defined by 


{^'(^0 , ,^ 2 ) + ^ 2 ^l'(^0 , ,^ 2 )+(^ 2 ) 2 a2 , (^0',^ 2 )+^ 2 tU , (^0',^ 2 )} , 


(62) 


id'(q a ,q 2 )=h*H Jd*x lD a (x),D b (0)l I P), 

»[ a l , (?0,? 2 )5nm+a2 , (?0,? 2 )?r l ?m]=|ea63 jd*X <r»V* ^{p \ [5av. 6 (x), ff!m> 6 (0)] | p) , 

iZiqJAqo,q 2 )l= J d*x T.(p | [ff^^ZMO)]- [_D a (x), ff ln 5 (0)] | p). 


(63) 


8 J- D. Jackson, Dispersion Relations, edited by G. R. Screaton (Interscience Publishers, Inc., New York, 1961), pp. 1-32. 
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The next step is to evaluate the absorptive parts. Let us consider explicitly the case of d\ Let k p =(QjiMx) be 
the proton four-momentum. Inserting a complete set of intermediate states, we find that 

f d*kp 

^W) = K>3(27T)* EE/ - 

0 ,INI « J ( 2 tt ) 3 

X|I<^[^(0) |^(^)K/3(^) |Z? & (0)|^>5(^——<^|Z? & (0) |^(^)><^(^) |Z? tt (0) |^>5(^+^— 

= 7 Te a &3 E E {C(^l-^a(O)l^(^ 0 )){^(^)|-^&(O)|^)]|kp«q 5 (^po — Qf)~ M tf) 

0,INT « 

-L(p I A(0) \Kh))(fi(h) I 0.(0) |*>] I k( =_,5(^o+?o-M w )}. (64) 

Parity invariance tells us that 

E E C(pl-^*( 0 )I-^a( 0 ) |/>)] | kp«-q 5 (^o+? 0 “ Mn) 

/ 9 ,INX « 

= E E [(/>l £&( 0 ) | P(k p ))(fl(k p ) I-°a( 0 ) I p)]I ^-q^o+tfo-M*). ( 65 ) 

0 ,INI « 

Thus Eq. ( 64 ) can be written, using the antisymmetry of e a &3j as 

^ , (? 0 j? 2 )=?T€ o fc 3 E E C(/ , |-^a( 0 )|^(^^))(/ 3 (^^)|Z>&( 0 )|/>)]|k tf =qC 5 (^ 0 — £o— 3 /#)+ 5 (^o+£o—Afw)]. ( 66 ) 

0 . 1 NI 8 


We see that d' is an even function of q 0 ; hence d is an odd function of go. Since 

ea i zD*D i =D 1 *D 2 -D i *D 1 =iil(D 1 *+iD 2 *)(D 1 -iD 2 )-(D 1 *-iD 2 *)(D 1 +iD 2 )^, 

we obtain finally the result that 

d'(qo,q*) = fr[.D^-D™ 3 , q 0 > 0 3 


with 


Z> ( ->= E EI 0&M(£o+-^w)] | Z>i(0)— iD 2 (0) | p) 1 2 $(&0o — qo—Mx) , 

0,INT a 

Z><+> = E EI</3[q,i(?o +M„)J\D 1 (0)+iD 2 (0)\p)\*8(kpo-go-M„). 

0,INI a 


(67) 


( 68 ) 


( 69 ) 


Clearly Eq. ( 69 ) is identical with Eqs. ( 41 ), ( 32 ), and ( 33 ), defining the structure function D , with q 0 given by 

q 0 =E P -E l =(W 2 -M N i +q 2 )/ 2 M N . ( 70 ) 

In a similar manner we find that ai, a 2j and i a' are even functions of q Q (which implies that <zi, a 2) and %a are 
odd functions of qo). Also, we find that for £ 0 > 0 , 


ai(qo>q 2 ) = &rZAi < ‘- ) — a2 , (qo J q i )=hir[A 2 ( - > — ^4 2 c+) ], iY(?o,? 2 ) = M7,t c >—/Y +) ], 


(71) 


where the structure functions ^ 4 i c±> , A 2 {± \ and Ia (±) are those defined in Eq. ( 41 ). Combining Eqs. ( 43 ), ( 61 ), 
( 62 ), and ( 68 )-( 7 l), we see that we have derived the sum rule 

W 


1 


-/ 


dq*[fi. 


(“) 


f W 

-Ai c+) ]= / - dW\$A^(q 2 ,W)-p A ™(q 2 ,W)l. 

J Mn 


( 72 ) 


Using Eq. ( 44 ), the pole contribution to Eq. ( 72 ) can be explicitly evaluated, giving Eq. ( 53 a). 

(C) Sum Rule for « (±) 

The sum rule on a c±) of Eq. ( 15 ) is obtained by adding together the two identities 

Cr 2 =fH 


/ q 2 \ r°° W 

= ( 1+-W) 2 + / - dWlaS-\q\W)- a S+Kq\W)-], 

\ ^Mn 2 ' J mn+m, Mn 

\ r w 

)gv(q*y+ / -^[ay(-)(? 2 ,W0-ay ( +)(?W]. 

/ J mn+m, Mn 


cy=\ — 

\4lMn 2 ' 

Here cu (±) and ay (±) are, respectively, the axial-vector and the vector parts of a (±) , 

a A <±> = Ay±> (q\W) , ay<±> = Fi<±> (q 2 ,W). 


( 73 a) 


( 73 b) 


( 74 ) 
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We will sketch the derivation of Eq. (73a); the derivation of Eq. (73b) is identical. 
In order to derive Eq. (73a), we use the fundamental identity, with 


A(t) 


-I 


d 3 x e~ ia,I ^ an b (x,t), B(t) 


-•/ 


d?y e is - y ff6„ 6 (y,0 


(75) 


Using Eqs. (4a) and (11), the first term on the right-hand side of Eq. (46) becomes 

— »XXplD4(0),-B(0)]|/>)=— e o6<; 5„mCr J (|T ,: )(25r) 3 5(0)+(symmetric in ab). 

a 

The second term is 


(76) 


U nm,ab— 


— l f f i rdffon 6 ^,^) “I r dSbm 5 (y 3 t) 

=~H(p\J <Px e- iz - x J d*y e' s ' T j|^---, SbJiyd) j+|---, 5 a * s (x 


2q 0 • J J IL dt 

which, by using the parity transformation properties of £F 5 , is equal to 


:,0]J \p), 


(77) 


If f i rd$ an 6 (x } t) “] 

-Z(p\J <Px<r* *j —-—, n 6 (y,oJ+[^ 


2 go 


dt 


} 9 : on 6 ( y 9 t) 


■a]}\*>. 


(78) 


The expression in Eq. (78) is explicitly symmetric under the simultaneous interchanges n*-*m y a*-*b. Since 
parity invariance requires that U$ be of the form 

U 2 nm ’ ab =ni ab 8n m +to ah SnS m) (79) 

U 2 is symmetric under the interchange a <-> b. Thus, if we keep only terms which are antisymmetric in a and b t 
the unwanted terms drop out. 

As a result, we are left with the identity 


5 nm Cr 2 = — y(qo } q 2 ) 
dqo 


oqo Iqo-o 

v(Qo,q 2 ) = di(qo,q 2 )8 nm +d 2 (qo,q 2 )q n qm= tabzj d*x er^' x d(x^(p 1 £ 


fd$a n *(x) 


dxi 


dt 


]l p)- 


(80) 


[Here d$bm 5 (0)/dt denotes 39 : fr m 6 ( y,t)/dt evaluated at y=0, /=0.] Let us now postulate that 

d$i(qo,q 2 )/dqo 


( 81 ) 


satisfies an imsubtracted dispersion relation. It is easy to see that the absorptive part of di(qo,q 2 ) is just q 0 2 times 
the absorptive part of the amplitude ai(qo 3 q 2 ) defined in Eq. (60). Thus, the 8 nm term in Eq. (80) becomes 


C/ 2 = fdq^-A^= [ -dWlA 1 ^(q 2 i W)-A^)(q\W)'\ i 
J J Mn 


( 82 ) 


which is the result to be proved. 


(D) Sum Rule for y (±) 

The sum rule on y (±) of Eq. (16) is derived by adding the fundamental identity, with 


Ai(f) 


~<I 


d*X an 6 (x,0, Bi(t) 


= —i Jd 3 y e ia ' 


y &bm(y 7 t) , 


(83) 


to the same identity, with 


Ai(t) = -i Jd 3 x <r« iff an (x,0 , B t (t)= ~i jd?y ■ 

Using Eq. (4b), the first term on the right-hand side of Eq. (46) is 

—^£(/ , |C^i(0)j5i(0)]+[^4 2 (0),B 2 (0)j|/>)=(symmetric in ab). 


(84) 


( 85 ) 
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since 


H(p\a ci (0)\p)=0 (86) 


for nucleon states at rest. The second term, using the parity transformation properties of the currents, becomes 



dt 


3 «Fbm(y>0 



dSF&m(x,/) 

dt 



=M3 a& e n »i5i. 


+ 


[ 


d 9 * an (X,/) 

dt 


, SF&» 5 (y,0 


■»]- 


r d 9'6t» 6 (x,/) 


dt 




Clearly, Ms®* is symmetric in a and b . If we keep only the antisymmetric part of the identity, the [d^/dt,#] and 
the [d$/dt i $ 6 ~\ terms drop out 
Thus, we get the identity 

0 =— i(q*,q*) 3 

dqo ao—o 


d$m?(x) d$b m (0)“] rd$*n(x) dS bn *(0) 


L dxt 


dt 


n r 
- + 


dxt 


i(q(hq 2 ) = *obzJd 4 xe iq ' x 6(xo)Yl(p 
The postulate that 

dq 0 

satisfies an unsubtracted dispersion relation in q Q leads immediately to Eq. (16). 


dt 


]i p) 


( 88 ) 

(89) 
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APPENDIX 

In this Appendix we give the generalization of the results stated in Sec. 2 to the case when all lepton-mass 
terms are included. In order to calculate lepton-mass corrections, it is easier to work covariantly, rather than to 
eliminate the fourth components of currents in terms of spatial components and divergences. Thus we write 

Tx,= E E $(k{jo-kNo-qo)(N(k N ) | (Ja V +Ja A ) ir \P(k N +q))<fi(k N +q) \ J\ V +J\ A \N(k N )) 

0,1NT « 


-- [Ab\ a +BkN$No^Ge\ 0 7iq 1 kNl^Dq\q<r'\-E(jq>JlN<r'\-q<rkN-))~], 

^N0 


(Al) 


(A2) 


with A } •♦*,£? functions of q i and W. Time reversal and parity invariance rule out the presence of a term propor¬ 
tional to qyJtNa — qirk]f\ in Eq. (Al). Comparing Eq. (Al) with Eq. (41), in the laboratory frame, shows that 

A=a(q\W), Mx t B=l3(q t ,W), M N C=y{q\W), D=5{q\W)= V*{q\W)+A t (tf,W ), 
M^ i (q\W) = q^{V l (q\W)+A 1 (^,W)+q^V i (q\W)+A i (q\W)^+hUv(q 2 ,W)+I JL (q\Wy]}. 

It is straightforward to calculate the contraction of T\„ with the leptonic trace. We find that Eq. (13) and Eq. (22) 
for the strangeness-conserving case are replaced by 

/ fv\ /l\ \ / G 2 cos 2 0c[(£,-? o ) 2 -mz 2 ] 1/2 

({-)+p^ (l) +fi(s=0) ) / dQ ^=— -=-* c±) > ( A3 > 


dVl 


dV 


(Q+p-» Qw=°)) / <w?o= 


(2jt) j 
G 2 cos 2 0c 


E r 


><±) 


(A4) 
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with 


K t ±)=( 3 2 +m I 2 )a<±)( 3 2 ,IV)+[2E > 2 -2£, 3 o-i(? 2 +w, 2 )]^t±)( ? 2 ,lV) 

^[(2£,-? 0 V-m, 2 ?o]Y c±) ^^ (A5) 

Inspection of Eq. (A5) and its analog for a neutron target shows that jS c±) , y c±) , e c±) , and a (±) +§m* 2 5 c±) are 
independently measurable. Since the derivation of the sum rule on a (±) given in Sec. 4 shows that 

0= jdqoiS^ — 5<+>), (A6) 

we may modify Eq. (15) to read 

Ci'+Ci*=(\+qyWx*)g A (q*y+W 

+^tni 2 Z(l+q 2 /4MN i )fv(^) i -gv(q 2 )fv(q 2 )/MN+(q 2 /Uf N i )h A (^) 2 -k A (q z )g A (q i )/M^ 

r 00 W 

+ / -rfTF[a<“)(^lF)+im, 2 5 c - ) (^,lF)-a t+) (7 2 ,lF)-^m, 2 5< + )( ? 2 ,lF)]. (A7) 

J Mti+M, Mn 


Thus, in the strangeness-conserving case, when lepton-mass terms are included there are still three sum rules which 
may be directly compared with experiment. 

In the strangeness-changing case, equations similar to Eqs. (A3)-(A5) hold, and jQ( P ,«) (±) , wi 2 e( Pr „ ) (± >=b^ 2 7 ( 1 ,,n) c±) , 
and a Cp.n) c±) +2Wz 2 5( 1 , t n) c±) =b7o7(p 1 n) c±) are independently measurable. We see that in this case, when lepton-mass 
terms are included, only the sum rules on jS( P ,n) (±) can be directly compared with experiment. 

It is easy to verify that the results of Eq. (26) and Eq. (27), referring to the high neutrino-energy behavior of 
neutrino cross sections, are unchanged by adding the lepton-mass terms. Equation (24) becomes 

do- rW-LHE,*) fa 

- = 1 dq 0 -, L=q*+tni z - 1-. (A8) 

dq 2 J (g a / 2 A/jy)— d(q 2 )dq 0 qt+m? 

If, in addition to Eq. (25), we postulate that 

f°° dq 0 f°° dq 0 

/ —(«<->-«<+>), / —( e c-)- e c+)) (A9) 

J qo 2 J qo 

are convergent (and similarly in the strangeness-changing case), then we immediately obtain Eqs. (26) and (27). 


Sum Rules Giving Tests of Local Current Commutation Relations in High-Energy Neutrino Reactions, 
Stephen L. Adler [Phys. Rev. 143, 1144 (1966)]. In Eq. (45) for Vt (r ~ ) (q i 9 W) t in both terms on the 
right-hand side,/^(g 2 ) should be Fi v (q 2 ), 
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Neutrino or Electron Energy Needed for Testing Current 

Commutation Relations* 

Stephen L. Adler! 

Institute for Advanced Study , Princeton, New Jersey 

AND 

Frederick J. Gilman! 

California Institute of Technology , Pasadena , California 
(Received 19 October 1966) 

For small ]eptonic invariant 4-momentum transfer we investigate what minimum neutrino or electron 
energy is needed to test current commutation relations. We find that at laboratory energies of order 5 BeV, 
it is reasonable to start trying to check the equalities or inequalities implied by the current algebra. 


R ECENTLY Gell-Mann 1 has postulated that the 
fourth components of the vector and axial-vector 
weak-current octets satisfy the local equal-time com¬ 
mutation relations 

Z$at(x),$h4(y)l\zr*t= ~fabc^c*(x)8(x- y), (la) 
[^9 : a 4(^)jSFM 6 (y)J | lo— ~~fabc*?c4?(%)8(x y) j (lb) 

Z$a*K x )>$u*(y)l\*9r**= -fabc$c*(x)8(x-y). (lc) 

It has been shown by Adler 2 that Eqs. (la) and (lc) 
can be tested in high-energy neutrino reactions, where 
they imply that, in the limit of infinite incident neutrino 
energy, the difference d<TT{v-\-N)/d(f ~~d<XT{v-\-N)/dcff 
approaches a constant which is independent of the 
leptonic invariant 4-momentum transfer <f. Bjorken, 3 
by an isospin rotation, has transformed the neutrino- 
reaction results into inequalities on electron-nucleon 
(or muon-nudeon) scattering which hold in the limit 
of infinite incident electron energy; these inequalities 
may make it feasible to test Eq. (la) in the near future. 

However, before proceeding to experiments, one must 
answer the question, what is effectively an infinite inci¬ 
dent neutrino or electron energy E? More precisely, 
what is the energy E(q\8) such that for E>E((f,8) 
the neutrino equalities hold to within fractional error 
5? No general answer to this question can be given. 
However, we will show, in this paper, that the experi¬ 
mental information used in evaluating the sum rules 
for the axial-vector coupling constant and for the 
nudeon isovector radius and magnetic moment suffice 
to determine 0, 5) in two cases. 

The results indicate that at incident energies of order 
5 BeV it is reasonable to start trying to check the equali¬ 
ties or inequalities implied by the current algebra, at 
least for small g 2 . It is encouraging that this energy 
range is accessible to the Stanford Linear Accelerator. 

* Work supported in part by the U. S. Atomic Energy Com¬ 
mission. Prepared under Contract No. AT(ll-l)-68 for the San 
Francisco Operations Office, U. S. Atomic Energy Commission, 
t Junior Fellow, Sodety of Fellows, 1964-66. 
t National Science Foundation Postdoctoral Fellow, 1965-1966. 
1 M. Gell-Mann, Physics 1, 63 (1964). 

* S. L. Adler, Phys. Rev. 143, B1144 (1966). 

8 J. D. Bjorken, Phys. Rev. Letters 16, 408 (1966). 
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Let us review the results 2 for the high-energy neutrino 
reaction 

v-\~N —>/-|-jS. (2) 

We neglect the lepton mass mg 9 and define the following 
kinematic quantities (noncovariant quantities always 
refer to the laboratory frame ): 

Mn ~nucleon mass, 

E— incident neutrino energy, 

E'= final lepton energy, 

k = neutrino 4-momentum, 

k'= final lepton 4-momentum, 

q 2 — (£—&') 2= l e ptonic invariant 4-momentum (3) 

transfer, 

qo=E— -E'=leptonic energy transfer, 

W =invariant mass of system /? 

= (2 Msqt+MJ-f}'*, 
q 0 =(m-M^+f)/(2M N ). 

Let G be the Fermi coupling constant and 6 C the Cabibbo 
angle. Then the cross section for strangeness-zero 
(5 = 0) final states may be written 

*{(>^ 0"^^( 5= o)]X dqidq ° 

G 2 cos 2 <9 c 

=-[ 9 2 a<±>+ (2&-lEq 0 -WW ±] 

=F(2£-g 0 )gV±>]. (4) 

The form factors a (±) , and y (±) are functions of 
g 2 and g 0 . The local commutation relations of Eqs. 
(la) and (lc) imply that 

2 = [ (5) 

Jo- 

Combined with Eq. (4) and with the expression for 
1598 
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Fig. 1. The function 
Fi{E) defined in Eq. (12). 
Above ?=5 BeV, we have 
assumed <r <+ >ccjr" a . 

See Ref. 7 for details of 
the numerical evaluation. 



dtx/ dq 2 , 


-£tl-« s /(4£T*)] 

df Jo 


dq { 


d?<r 

i 

d(fdq( 


( 6 ) 


Eq. (5) implies that 


lim 

E —»oo 


do-rUv-hp —> 0(S— 0)] do*r[j'+/> —> 0(5=0)] 


dg 2 


dg 2 

G 2 cos 2 0. 


- (7) 


T 


In order to study the manner in which the limit in 
Eq. (7) is approached, let us s^narate 0 C±) in the sum 
rule of Eq. (5) into vector and axial-vector parts, 

pi±)-fi Y t±)+p A t±) 9 ( 8 ) 


which separately obey the sum rules 2 


1 = 


dqotPA'-'-pA™! 


J o 

=[g* («!*)?+ 


,00 


Afy-j-Afr 


w 

m n 


00 


1 = / dqolPr'-'-Pr™! 

J 0~ 

= [Fi y (9 2 )?+9 2 [^(9 2 )] 2 + 


00 


Afy+Af* 


W 

m n 


XdW D3v^(g 2 ,^)-0v c+) (g 2 ,^)]. 




At g 2 =0 and for W>M N +M xt /V±>(0,PF) = O, so 
that Eq. (9b) becomes the trivial equality 1 = 1. Since 


Fig. 2. Input values (Ref. 7) of 
<r<’“>'-<r< + > for values of v from 
threshold to 10 BeV, including the 
asymptotic tail above 5 BeV for 
the case a=0.5. 



T> 



.3 A £ A .7 A 3 L0 

V (BeV) 
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Fig. 3. The function F 2 (-E) defined in Eq. (16). Above $0 = 1.1 
BeV, we have assumed 

20 r rE'y(^“ 1)+^ —► 1=4]—^[ 7 (^= 1 )-\~P ~ y §] cc< i r ir a . 

See Ref. 10 for details of the numerical evaluation. 


(3a (±) (0,W) is related to <r c±) (0,1^), the total cross 
section for the scattering of a zero-mass on a proton, 
by 4 

4 M^g A 2 o-^UO,W) 

0a (±) (0 ,W)= -—-, (10) 

rgr(0y 

we see that Eq. (9a) becomes the usual sum rule for 
gA- 5 ' 6 Thus, Eqs. (4)-(7) imply that, at ^ = 0, 

dcr^v-hp(3(S= 0)] <for[i'+#-^► £(£=0)]] 


d(f 


dq 2 


9-0 


G 2 cos 2 0. 


with 

Fi(E) = gA 2 


. E 


1- 


2t 

dqo 


: [l+Fi(E)], (11) 


M T +M T 2 tt2M N ) qo 


(-!) 


v=(W 2 —M n 2 —Mf)/{2M N ) is the pion laboratory 
energy. If PCAC is valid, the integrands of Eqs. (12) 
and (13) are expected to differ appreciably only for 
small center-of-mass energy W, where kinemadeal 
threshold effects may be important; this difference 
should not greatly affect the large-E behavior of F 1 (E ). 

The numerical evaluation 7 of Fi(E) is shown in Fig. 
1 [in Fig. 2 we plot the input data —<r (+) 3. It is 
seen that for energies of a few BeV, F\(E) becomes 
monotonic and greater than 90% of its asymptotic 
value of unity. Thus, neutrino energies of order 5 BeV 
are certainly adequate for testing the local current alge¬ 
bra at g 2 =0. 

The curve of Fig. 1 has no direct bearing on Bjorken’s 
inequalities for electron scattering, since these inequali¬ 
ties come from the vector sum rule of Eq. (9b) and do 
not involve the axial-vector sum rule of Eq. (9a). As 
we remarked above, Eq. (9b) becomes a trivial identity 
at g 2 =0. However, the first derivative of Eq. (9b) with 
respect to g 2 gives the interesting sum rule 

d r 00 W d 

0= 2—| a ! =o+[F/(0)] 2 + / —dW— 

d(f J m n +m v Mn dq 1 

X (?,W)-0v w ( 9 W] 1,*-o, (14) 

which has been derived by Cabibbo and Radicati and 
others. 2 - 8 To exploit this fact, let us keep only the vector 
part of Eqs. (4)-(7) and expand in a power series in q 2 
{we use the fact that a^ c±) | fl a _o=[?o 2 d07 C±) /dg 2 3| a ‘=o 
= Fi (±) (0,tF), in the notation of Ref. 2}: 

dcrvL /? (£ = 0)3 do- rv[ v-\-p —► ( S= 0) 3 

dtf dq 2 


2 2 

X-—[tr<+>(0,IV)—a-<->(0,l^)3: . (12) 

Tgr(0) 2 J 

In Eq. (12), g 0 — [W 2 —M N 2 )/(2M N ), as obtained from 
Eq. (3) for g 0 with ^=0. Rather than using Eq. (12) 
for our numerical analysis, we use the expression 


with 


G 2 cos 2 0 c 


2 T 


1+—, 



F,(E) 

Mw 2 


d 

2—Fi y (g 2 ) | « s -o+ [f 2 y (0)] 2 
dtf 


1 1 
2M N E 4 E 2 


[ r E dv / v 

Fi(E)~gA 2 \ 1 - / ~{v 2 -M v 2 )M 1- 

1 ijf f ^ \ E 

2 M n 2 I 

X-[a-W(tF)-a-t-)(TF)] , (13) 

TTgr 2 

which involves only the pion on-mass-shell cross sections 
<r c±) (tF)=(r (±) (— M x 2 t W). The statement that the 
right-hand side of Eq. (13) approaches 1 as £—>co is 
the polology form of the gA sum rule. 6 The variable 

4 In writing Eq. (9), we are defining the pion interpolating field 

to be the divergence of the axial-vector current, suitably normal¬ 
ized. The partially conserved axial-vector current (PCAC) 
hypothesis is used when we replace Eq. (12) by Eq. (13). 

6 W. I. Weisberger, Phys. Rev. Letters 14, 1047 (1965). 

6 S. L. Adler, Phys. Rev. Letters 14, 1051 (1965). 


f E dq 0 r qo qo 2 

_!_ j l }- 

7jvf.+Af» a /c2AfN) go 2 L E 2E? 

X [ (0,TF)- Fi< + > (0,TF)], (16) 

and ?0 =(IP-Af^)/(2Af w ) as in Eq. (12). The inte¬ 
grand in Eq. (16) may be related to the total cross 

7 We have used the cross-section tabulation of G. Hbhler, C. Ebel, 

and J. Giesecke [Z. Physik 180, 430 (1964)]. Above ?=5 BeV, we 
have assumed * = — sBev^/S BeV) “. For 

each value of a, we have normalized Fi, so that Fi(w) = l. 

8 S. L. Adler (unpublished); J. D. Bjorken (unpublished); 
Phys. Rev. 148, 1467 (1966); N. Cabibbo andL. Radicati, Phys. 
Letters 19, 697 (1966); R. F. Dashen and M. Gell-Mann, in 
Proceedings of the Third Coral Gables Conference on Symmetry 
Principles at High Energy (W. H. Freeman and Company, San 
Francisco, 1966). 
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1601 


Fig. 4. Input values (Ref. 10) of 

— cttC'K/— l)-f m p —*»/ = §] 

for values of qa from threshold up 
to 10 BeV, Including the asymp¬ 
totic tail above go*1.1 BeV for 
the cases a=0.3 and a —0.7. 



sections for isovector photons incident on nucleons to 
produce I=% and /=§ final states: 

(0 ,W) - 7i<+> (0 ,W )=— 

27r 3 a 

X{2«rrDy(/-l)+*->/=« 

-ot[t(/=1)+#->/=*]}. (17) 

These isovector photon cross sections have been esti¬ 
mated by Gilman and Schnitzer, 9 who find that Eq. 
(14) appears to be satisfied. Using numerical estimates 
similar 10 to those of Gilman and Schnitzer, we have 
computed Fi(E). The result, shown in Fig. 3, indicates 
that |F,(£)| is less than 0.5 for energies in the 5 to 10 
BeV range. {In Fig. 4 and Table I we give the input data 
2<rj{y{I=l)+p —!► /—<rr[y (/=!)+£ ► /= a]-} 

To conclude, for small g 2 , energies of order 5 BeV 
suffice to test local commutation relations. We must 
caution that as g 2 increases, the needed energy 
will be expected to increase rapidly. This is clear from 
the experimental fact that the single-nucleon and (3,3) 
resonance contributions, i.e., the small W contributions, 
to the sum rules of Eqs. (9a) and (9b) decrease rapidly 
with increasing g 2 . 11 Thus, to maintain a constant sum 

9 F. J. Gilman and H. J. Schnitzer, Phys. Rev. 150, 1362 (1966). 

10 Up to go“ LI BeV, we have included the contributions of the 

s-wave, V*(1238), iV*(1520), and iV*(1688). We assumed the 
2V* (1520) and iV* (1688) peaks measured in photoproduction come 
only from isovector photon transitions, and that for these two 
resonances, r e iaetic/rtotai*=0.6. Above 1.1 BeV, we have assumed 
2<rz£7(I~ 1)+^ —* I ~Y]~ = 1 )+£ —* / s =§] s =iV’ 5 o'““ f and 

for each oe we have chosen the normalization N of the tail to make 
F 2 (cq) =0. For none of the values of oe considered did this require 
an unreasonably large tail. 

11 Nucleon form factors: E. B. Hughes et al. t Phys. Rev. 139, 
B458 (1965); electroproduction: A. A. Cone et al., Phys. Rev. 
Letters 14, 326 (1965); weak production: CERN Report No. 
NPA/Int. 65-11, 1965 (unpublished). 


Table I. Values of 

2otCtU* 1)+/* I t= h1~~ !)+/> —* /*=§] 


up to go«l.l BeV. 


(MeV) 

2ffrr 7 (/«l)+^-»/-H 

— crj.[«y(/=l)-|-^ —► /*=§ ] 

fcb) 

150 

+8 

175 

+60 

200 

+94 

225 

+50 

250 

-11 

275 

-108 

300 

-238 

325 

-328 

350 

-287 

375 

-200 

400 

-138 

425 

-86 

450 

-49 

475 

-24 

500 

+3 

525 

+7 

550 

+30 

575 

+46 

600 

+64 

625 

+100 

650 

+114 

675 

+ 148 

700 

+182 

725 

+217 

750 

+227 

775 

+176 

800 

+125 

825 

+80 

850 

+54 

875 

+53 

900 

+54 

925 

+54 

950 

+66 

975 

+76 

1000 

+75 

1025 

+72 

1050 

+46 

1075 

+22 

1100 

+10 
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at large g 2 , the high W states, which require a large important question of how rapidly E(<f f 5) increases 
E to be excited, must make a much more important with g 2 , but only serve to indicate at what energies E 
contribution to the sum rules than they do at g 2 =0. it may pay to begin the experimental study of the local 
The calculations of this paper shed no light on the current algebra. 
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Low-Energy Theorem for the Weak Axial-Vector Vertex* 

S. L. Adler! and Y. Dothan! 

California Institute of Technology , Pasadena , California 
(Received 13 May 1966) 

A low-energy theorem is derived for the weak axial-vector vertex. The theorem enables one to calculate 
from strong or electromagnetic processes the two leading terms in the expansion of the axial-vector vertex in 
powers of the Ieptonic four-momentum transfer. Applications to weak pion production, decay, and 
radiative u capture are discussed. In particular, we express the radiative ^-capture matrix element, up to and 
including contributions linear in the Ieptonic four-momentum transfer and the photon four-momentum, in 
terms of the elastic weak form factors and pion photoproduction amplitudes. 


INTRODUCTION 

I T is well known 1 that the infrared divergent order hr 1 
term in the matrix element for the radiation of a 
photon of four-momentum k in any process (the matrix 
element of the electric current) can be expressed solely 
in terms of the matrix element for the same process with 
no current present. Low 2 has shown that current con¬ 
servation enables one to calculate the electric-current 
matrix element not only to order k~ l but also to order 
k° in terms of the process without the current. In the 
present work, we derive analogous results for the matrix 
elements of the axial-vector current. We express each 
such matrix element in terms of the matrix element for 
the process with no axial-vector current and the matrix 
element of the divergence of the axial-vector current. 
The relation is exact to orders k~ l and k°. Under the 
assumption of a partially conserved axial-vector current 
(PCAC ), 3 we can relate the matrix element of the di¬ 
vergence to the corresponding matrix element of the pion 
source, which is physically measurable, apart from the 
usual small off-mass-shell extrapolation . 4 Thus we 
obtain an expression for the axial-vector matrix element 
solely in terms of physically measurable quantities. 
Clearly, this shows that the essential point in Low’s 
derivation is not current conservation, but the fact that 
the divergence of the current is independently measura¬ 
ble. Results analogous to ours will hold for any current 
whose divergence is known. 

In Sec. I we state two simple lemmas and rederive 
Low’s results from them. In Sec. II we derive the 
analogous results for the strangeness-conserving weak 
axial-vector current. We also show how these results 
are modified when two currents are present, instead of 

* Work supported in part by the U. S. Atomic Energy Com¬ 
mission. Prepared under Contract AT(ll-l)-68 for the San 
Francisco Operations Office, U. S. Atomic Energy Commission. 

t Junior Fellow, Society of Fellows. Present address: The 
Institute for Advanced Study, Princeton, New Jersey. 

t On leave of absence from Israel Atomic Energy Commission, 
Soreq Research Establishment, Yavne, Israel. 

1 J. M. Jauch and F. Rohrlich, The Theory of Photons and Elec¬ 
trons (Addison-Wesley, Reading Massachusetts, 1955), p. 391. 

8 F. E. Low, Phys. Rev. 110, 974 (1958). 

8 M. Gell-Mann and M. L£vy, Nuovo Cimento 16, 705 (I960); 
Y. Nambu, Phys. Rev. Letters 4, 380 (i960); K. C. Chou, Zh. 
Eksperim. i Teor. Fiz. [English transl.: Soviet Phys.—JETP 12, 
492 (1961)]. 

4 S. L. Adler, Phys. Rev. 140, B736 (1965), Sec. IIIC. 
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only one. As an application, we treat in Sec. Ill the 
following processes: Weak pion production, K ei decay, 
and radiative ju capture. In particular, we find in the 
case of radiative ju capture that when terms of order qk 
and higher are neglected (q= lepton four momentum 
transfer, £ = photon four-momentum), the matrix ele¬ 
ment can be expressed solely in terms of the elastic 
weak form factors and pion photoproduction ampli¬ 
tudes. This means that structure effects linear in q or 
linear in k are determined, giving the leading corrections 
to the radiative ju capture matrix element previously 
calculated by Manacher and Wolfenstein 6 and by 
Opat . 6 

I. LOW’S RESULTS FOR THE ELECTRO¬ 
MAGNETIC CURRENT 

We consider the process a —> 6 + 7 , where a and b are 
arbitrary hadron states. The matrix element for the 
process is given by 7 

out(^T I a)ia= fe(27r) 4 5 C4) (p a — pb~ k) 

1 

X- NaNb€a*Ma, (1) 

(2tt) 3/2 (2£o) 1/2 

where p a , pb, N ai and N b are, respectively, the total 
four-momenta and the normalization factors of the par¬ 
ticles in states a and b , is the polarization of the 
photon, and k is its four-momentum. The quantity M a 
is related to the matrix element of the electromagnetic 
current 7« EM by 

NaNbM a = ou t (b 1/« EM | a)in • (2) 

Conservation of the electromagnetic current implies 
that 

k a M a = 0. (3) 

We state two simple mathematical lemmas from 
which Low’s results are easily derived. [In the follow- 

6 G. K. Manacher and L. Wolfenstein, Phys. Rev. 116, 782 
(1959). 

8 G. I. Opat, Phys. Rev. 134, B428 (1964). 

7 Four-vectors have an imaginary fourth component: p = (p,/>4) 
« (pt*A>) and />-g s =p*q+£4tf4=p*q'-/Wo. The quantity p* is de¬ 
fined by p*=p* p? = —p&y where * denotes the ordinary com¬ 
plex conjugate. The y matrices ( 71 * 72 , yz t y*» yt — yiyiyi'Y*) are 
all Hermitian, and satisfy 7070+7 &y a ~ 25 fl p. 
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Fig. 1. The nonradiative process. 


ing, 0(& n ) denotes terms of the nth or higher degree in 

k.~\ 

Lemma 1; Let M a JI be an arbitrary four-vector 
function of arbitrary independent variables, which is 
independent of the four-vector k a . Then k a M a Il7 =0(k 2 ) 
implies that Af« n = 0. Proof: Obvious. 

Lemma 2: If k a M a —0 and M a = Ma+Ma J +0(k) y 
where Ma 1 is independent of k and where 0, 

then M a =Ma+0(k). Proof: k a M a ^=k a Ma~0 implies 
k a MJ I =0(k r ) t so by Lemma 1, M a n =0. 

Note that “independent of k ” is not the same as 
“zeroth order in k .” For example, k a /p • k is zeroth order 
in k but is not independent of k . 

We now apply the lemmas to the two cases con¬ 
sidered by Low. First we discuss scattering of a charged 
scalar particle from a neutral scalar particle (Fig. 1). 
We denote the initial and final neutral-particle four- 
momenta by r\ and and the corresponding charged- 
particle four-momenta by pi and pi. Let T(s=pi 
•rvhpyft, (fi —fify Ai— Ai~p^-\-M 2 2 ) 
be the transition amplitude for the nonradiative process 
in Fig. 1. We have explicitly indicated the dependence 
of T on the amount by which the external charged par¬ 
ticles are off the mass shell, since the amplitude for the 
process in which the photon is emitted from one of the 
external charged particle lines involves the off-mass- 
shell nonradiative amplitude. The physical nonradiative 
amplitude is r(j,/,0,0). 

The radiative amplitude gets contributions from two 
types of terms: terms in which the photon is radiated 
from an external charged particle line [Figs. 2(a) and 
2(b); we call these terms lf« eit ] and terms in which the 
photon is radiated from an internal line [[Fig. 2(c); we 
call these terms The infrared divergent terms 

come only from H« eit , while M a Xat is finite at k = 0. We 
write 

Ma int (^=M a lnt (0)+O(k). (4) 



We can express lf« ext in terms of T, 

(2p2~hk) a 

Ma ^= - T[s+ryk t t, 0 , (Pi+k) 2 +M 2 2 l 

We expand T with respect to k } giving 

(2 pi~hk)a (2/>i ^)« 

Kr __ 0,0]- Tlsfiflfly 


(2pi m hk) • k 


(2pi—k)-k 


/ p2a pla \ d 

+ (- r t ■ k - rr k )—r[>/,0,0] 

\pyk pvk /ds 


+ 2pia - Ai] 

d&i A 2=0 


+2^i«—-2TV,Ai,0] 
OAi 


+0(k). (6) 


Ai-0 


We are now able to rewrite M a in the form required by 
Lemma 2, 

, (7) 

(2pi+k) a (2p\—k) a 

Ma J =~ - —Tls& 0,0]- 


(2pi~hk) 'k 

/ p2a Pla 

+ (- fyk-\ - ryk 

w 


(2pi — k ) 'k 


pyk pyk 


r 2a 


X—r[j,/,0,0], (7a) 
ds 


Ma n = (fia+ru)— 2T>,*,0,0] 

ds 


-\-2p 2a -2XMj0,A J 

dAi 


A 2—0 


dAi 


+M« lnt (0). (7b) 


A 1=0 


From this we conclude that M a — M t y+Q(k). In other 
words, the terms in the radiative amplitude of order k° 
as well as those of order k~ l have been determined. 

The procedure required by the lemmas may be re¬ 
duced to a simple recipe: (1) Write down M a ex \ the sum 
of the terms in which the photon is radiated from an 
external charged particle line. (2) Drop all terms from 
M a 6x11 which are explicitly independent of k y giving a 
truncated amplitude Af« ext '. (3) Add to M« ext ' a A M a 
independent of k so as tomake^Jf^'+AAfa^ 0(£ 2 ). 
Then M a ext '+AAf« is the Ma required by the lemma. 

Let us apply this recipe to the problem considered 
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above. We have computed M a 6 ** in Eq. (5). In the first 
term let us expand T with respect to the off-mass-shell 
variable but not with respect to the energy variable: 

T[s-\-ryk, t, 0, (^ 2 +£) 2 +M2 2 J 

=T\j+r r h, t, 0, ti]+lfa+ky+M<F\ 


X 

d 

- T[s+r 2 'ky ty 0, Aj 

« 

+0(k) 


dA 2 

As— 0 


The off-mass-shell derivative term in this expansion, 
when substituted into Eq. (5), leads only to terms 
which are either explicitly independent of k or are of 
first order in k. These terms are dropped in forming the 
truncated matrix element. We repeat this procedure for 
the second term in Eq. (5). Thus the truncated matrix 
element M« ext ' is 


(2/>2+£)« 

M a ext = - T[s-j-rt'k, t t 0, 03 

(2^2+&) 'k 

(2pi-k) a 

- T[s-ri'k y iy 0,0 ] " ■ - ■■ -+0(k ). 

( 2pi—k)'k 


The divergence of M a 6Ii ' is 

T[s+rrky t y 0 , 0 ]] 

-Tls-n-ky ty 0, 0]+0(£ 2 ) 



d 


= (rrk+rvk)-TZsyty0y0l+O(k>) . 

ds 



Hence, AM a is determined to be 

d 

A— (r 2 +n)«-“T[j,/,0,0]. (11) 

ds 


Clearly, M a Qxt '-j-AM a is identical with the Ma 1 of 
Eq. (7a) to order k. 

As a second illustration of the procedure, we consider 
the case when the charged particles have spin §. This is 
the simplest photon analog of the axial-vector case, 
since the axial-vector vertex cannot couple to a spin- 
zero particle line. As we shall see, the only difference 
from the preceding case is' due to slight complications 
caused by spin. 

We start by writing down M^y 




M 1 

(iy a+ i - Vafikfi) -'- 

2Mi iy(pz+k)+Mi 


XT£s+r 2 -ky ty 0, (pt+kY+M£~\ 

+ TlS-rvky ty (Pl-kY+M^y 0] 

X-^i<y a +t-tr«^^|tt(jl) . 

iy (Pi~ V 2Mi I\ 


(12) 


Let us discuss the first term of Eq. (12). Because the 
final fermion is off its mass shell, T^s+r^k, /, 0 , (^ 2+£) 2 
+M<f\ contains terms which give a vanishing contri¬ 
bution as k —» 0 when multiplied on the left by a spinor 
u(p 2 ). These terms are not physically measurable in the 
nonradiative process. It is therefore convenient to 
write T in the form 


T{_s-\-t2'ky ty 0, (^j+^) 2 +W] 


iy - (pr\-k)+W 
2 W 


T*ls+rvky ty 0 , (p 2 +k)*+M 2 *l 


-iy(pi+k)+W 

+- T^s+rr ky ty 0 , 

2 W 

X(pt+k)*+Mfiy (13) 

where W denotes (Pr\-kY^ 12 . The term T p [j,£, 0,03 
is the amplitude measured in the nonradiative process. 
We rearrange Eq. (13) in the form 

T{s-\-r 2 'ky ty 0, (p 2 -\-k) 2 -\-M^\ 

- T p \j+r t -k, ty 0,0 3+py- (*«+*)+Jfj 
xf[— h' (Pi+fy+Mi ]— 

l dA 2 


XT p {_s-\-r 2 'ky ty 0, Aj -\-0(k) 

\a 2 -o 

1 p iy(pi+k)~ Mt~\ 

+- 1 +- 

2 WL W+M 2 J 

X{T*\j+rrk, ty 0, (Pt+ky+Mfl 

- T p [_s+r t -k, t, 0, (/> 2 +^) 2 +M j 2 ]}! . (14) 


When substituted back into Eq. (12), the term in bold¬ 
face brackets in Eq. (14) leads to terms either independ¬ 
ent of k or of first order in k . Strictly speaking, we 
should have included in Eq. (12) the negative-frequency 
terms in the photon-spin- 5 -off-mass-shell spin-J vertex. 
By the same argument, these terms do not contribute 
to the truncated matrix element. Hence, the truncated 
matrix element is 


= u(p 2 ) (iy a +i - Vapkp) 

{\ 2 M 2 / 


1 


X 


iy (Pi+k)-\-M 2 


T p ls+r r kyty 0,0] 


1 


+ T p ]j—rvky ty 0 , 03 


iy (pi~k)-\-Mi 


x(iy a ~hi - <Tapkp ]|w(^0+O(^), (15) 

\ 2M i /) 
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which involves only the physically measurable matrix 
element. Using the identities 


1 

u(p 2 )iyk -— - =u(pt), 

ty(p2+k)+M2 

1 

:—:- —~—hr -ku(pi)=-u(pO, 

ty (pi—k)+Mi 



we can calculate k a M « ext ', 

^aAf« ext ' = «(^2){r p [j+r 2 *^ J t, 0, 0] 

-T*ls-r v k, t, 0 , 0 ])u(p,)+0(k 2 ) . (17) 

The expression between the spinors is identical to 
Eq. (10) in the spin-zero case. Therefore, AM a is 


d 

AM a = — (r 2 +rd a u(p£—-T p [s t tflfi]u(pi) , (18) 

ds 


and M a 1 is Jf« ext '+AAf a . This is Low’s result. 


n. AXIAL-VECTOR CURRENT 

We now consider the matrix element of the strange¬ 
ness-conserving weak axial-vector current /^ between 
hadron states a and 6 , 

N a NbM a 3 '= out (b | Ja A *\ (19) 


axial-vector current is coupled to external particle lines. 

(2) Drop all terms from MJ ext which are explicitly in¬ 
dependent of k t giving a truncated amplitude MJ ext '- 

(3) Add to MJ ***' a A MJ independent of k so as to 
make ^«(Af a J ext '+AM a J ) = D J ‘+0(fe 2 ). Then Mj exV 
+A Ma 3 is the Ma 1 required by the lemma. We actually 
will not omit all terms of order k, but will consistently 
retain terms of order k which explicitly contain a pion 
propagator. 

As an illustration of the recipe, we will consider the 
problem analogous to the second example in Sec. I, 
scattering of a spin-zero particle from a spin-J particle 
(which we will take to be a nucleon) with an additional 
coupling of the spin-J particle to the axial-vector 
current. The answer will involve the corresponding 
matrix element, in which the axial-vector current is re¬ 
placed by the pion source. We write the pion-emission 
matrix element in the form 


M r 3 =out(b\JJ\a) in (N a N b y i 


=u{p 2 ) 


igr(k 2 )r 3 y j 


1 


iy(p2+k)+M N 
X T F [_s+rr k, t-, 0, 0]+7 ,/ ‘[j— r^k, t , 0, 0]] 


1 

X----- igr(k 2 )T>’ yi +ifA0) 

.iy(Pi-k)+M N 


The superscript j is an isotopic spin index (7 = 1,2,3). 
We no longer have the equation kjdj= 0 , since the 
axial-vector current is not conserved. Let D 3 be the 
matrix element of the divergence of the axial-vector 
current, 

N a NbD 3 =N a Nbk a M a 3 '= out(&| —idaJa AS \&)\n* (20) 

Here, as in Section I, k=p a —p b . The PCAC hypothesis 
relates matrix elements of the divergence of the axial- 
vector current to matrix elements of the pion source, 

MpfgA mj 

out Q> I da.Jet A3 \ fl)in = " out(^| J x J | ^)in j (21) 

gr(0) k 2 +mj 

where Mn and m r are the nucleon and pion masses, 
JJ is the pion source, g A =gA( 0)«1.18 is the weak 
axial-vector coupling constant, and g r ( 0 ) is the off- 
mass-shell pion-nucleon coupling constant. The [physi¬ 
cal coupling constant is g r mg r (— mj ); g r 2 /4ir«14.6.] 
We wish to emphasize that the PCAC hypothesis 
allows one to measure D 3 in purely strong interaction 
experiments. 

Since the axial-vector current is not conserved, we 
will need a slightly modified version of Lemma 2: 

Lemma 2': If k<Mj=D 3 and MJ= MJ'+MJ 11 
+0(k), where MJ 11 is independent of k and where 
kcMJ l =D 3 + 0(k 2 ), then MJ= MJ I +0(k) . This lemma 
leads to a modification of the recipe stated in Sec. I: 
(1) Write down MJ ext , the sum of terms in which the 


d _ 

+ih—TJ(k) 

dk\ 


+ 0 ( 1 ?) 

fc =0 


u(pi). 



We have explicitiy exhibited the Bom terms in the form 
given by dispersion theory, where residues are evaluated 
at the Born pole and so no nucleon-off-mass-shell terms 
are present. The way we write the Born terms serves as 
the definition of the non-Born part TJ(k). 

We are now ready to write down MJ ext , 


MJ ext =u(pi) 


igA 0?)yayi — 


1 


2 iy (Pi+k)+MN 
XT]]s+r 2 ’k t t, 0 , (Pi+ky+MJ 1 ] 

+ T[s—n-k 1 1, (px—kp+MifiO] 

1 r 


X ----- igA (k 2 )y a y 5 — 

iy(pi-k)+M„ 2 


u(pO 


MftgA ik a 

f- MJ. 

*,( 0 ) k 2 +m 2 



The term in brackets in Eq. (23) is the direct coupling 
of the axial-vector current to the external nucleon lines. 
The term proportional to MJ comes from the diagrams 
shown in Fig. 3; although this term is formally of first 
order in k , it can be important because of the small 
mass of the pion. 
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As we have seen in Sec. I, the truncated matrix 
element is obtained by dropping the negative frequency 
part of T and by neglecting off-mass-shell terms. This 
gives 

T 3 1 


s 




igA (k 2 )y«yt — 


2 iy- {pt+k)+Mtt 
XT^s+r^k, t, 0, 0]+r p [>-ri - t, 0, 0] 


1 

X— - igA (W)y a 7s— 

iy(pi-k)+M„ 2 


u{pi) 


MugA ika , , 

4-—- MJ+0{k). (24) 


gr( 0) k 2 +mj 


Using the identities 


u(pt)iyky s 


1 


iy-(Pt+k)+M N 

u(pl —y h +2M N y] 


1 


1 


iy' (Pt+fy+M* J 


1 


(25) 


iy (pi—k)+M N 


iykysu(pi) 




1 


= “75+ 


iy(pi—k)+M N 


■2Afyy 




we can calculate k a Mj ext ', 


k a M a 3 hgAT j yj,T p [s+ri'k, t , 0, 0] 

— r p [j— n-k, t , 0, OJigAT^yi 


+ 


Mxg A m % 


1 


t ; 75" 


& 2 +m r 2 I iy-(pi+k)+M N 
XT^+ri’A, t , 0, 0]+T p [j— ri-k, t , 0, 0] 
1 


X 


iy. 

MtfgA k 2 


T J ys 


gr( 0) P+Mt 2 

XTJ(k) 


MP 2) 


+0(P) \u(pO 


TJ(0)+h 


dk\ 


+0(k 2 ) 


k=Q 


u(pd . (26) 


In deriving Eq. (26), we have combined the Born terms 
in MJ with the divergence of the first term in Eq. (24), 
and have expanded the form factors g A (P) and g r (P) 
in powers of k 2 . 

We determine A MJ by the requirement that 


Fig. 3. Pion pole 
contributions to the 
axial-vector current 
matrix element. The 
axial-vector coupling 
is denoted by X. 





k a (M a 3 ***'+ AM a j ) = D j , with 


D j = —i 


MtfgA Mi 


gr( 0) k 2 +m T 2 


u(p1) 


X 


1 


^r(0)r y 7r 


iy - (pr\-k)+Mx 
XT P ls+r^k y tfi , 0 ]+T*[s-rvk, t, 0, 0] 
1 


X- 


iy-(Pi—k)+M N 


igr(0)T*y h +iTj(0) 


-\-ik\ — T r 3 (k) 
dk\ 


+0(P) 


fc- 0 


u(pO . (27) 


Comparing Eqs. (26) and (27), we see that k a AM a j 
must satisfy 


k a &Mj=u(p2) 


2&4T y 7 B r p [H-*v£, t, 0,0] 

M N g A 


+ T p [s-n-k , t y 0, 0]^r i 7B+ 


gr(0) 


X[ fr 3 (0)+h—T r j (k) I ]+0(P 
L dk\ U=<h 


“*(#*) |k^ y 76F p [>,;,o,o] 

MxgA _ 

+ 2 v CMA0!li&4 Tj YH-—7V(0) 


u(pi) 


(28a) 


-\-k a u(p2) 


gr( 0) 


r^gAT^v- r p [j,/,0,0] 

ds 


u(pl) 


- T p \j,t,0fi~$gAT i ysr-i a 

ds 


M N gA d _ 

+-TVW 


£r(0) dk a 


i=0 


u(pd+0(k 2 ). (28b) 


As the reader has undoubtedly noted, the nucleon prop¬ 
agator terms have exactly cancelled between Eq. (26) 
and Eq. (27), and so do not appear in Eq. (28a). In the 
term involving TV, the pion propagator has dropped 




205 


R12 


1272 


S. L. ADLER AND Y. DOTHAN 


151 


out altogether, since 


& 2 +w r 2 


-= 1 . 

k 2 +mj 



In going from Eq. (28a) to Eq. (28b), we have simply- 
expanded in powers of k and collected together the 
terms of zeroth, first, and second order in k. 

Since k a A is of first order in k , the zeroth-order 
terms on the right-hand side of Eq. (28b) must vanish 
identically. This gives 


u(p-dTJ(0)u(pd = —u(pd 


g r ( 0) 


+ ^[^0,03 


*r(0) 


2M 




N 


u(pi ). (30) 


This formula, which has been obtained previously, 8 
expresses the matrix element for the emission of a zero 
four-momentum pion in terms of the matrix element of 
the process without the pion. Equation (30) can be used 
to eliminate 7V(0) from the term proportional to M J 
in Eq. (24). Comparing the terms of first order in k, we 
find 

d 

fiahgATiys —r p [>,;,0,0] 

ds 


AMJ~ u{pi) 


- T P \j , /,0, Oj^gA B^la 

ds 

MxgA d _ 

f—- TJ(k) 


gr(0) dk a 


*= 0 


u(P0- (31) 


Adding this expression to the M a j ext ' of Eq. (24) gives 
the analog of Low’s result for the axial-vector case. 

A similar method can be applied to the case in which 
more than one current is acting. As an example, we 
consider the matrix element 9 



d 4 y e i9 - v o*t(b\T{J a Ai (x)J 9 {yy\\a)\ n - (32a) 


The only difference from the case treated above is that 
the divergence, in addition to having the term with a 
pion vertex substituted for the axial-vector vertex, also 
contains an equal-time commutator term. Following 
the procedure of this section, we can determine M a <r J \ 
apart from terms of order k and higher. If the divergence 
of J a is also known, we can apply the technique a second 
time, determining terms of order k which are independ¬ 
ent of q. This leaves an error which only involves terms 
of order qk and higher. 10 We will consider such a case in 
the next section, when we discuss radiative n capture. 

HI. APPLICATIONS 

In this section we apply the results of the previous 
section to several concrete examples. We consider first 
single-pion production from a nucleon by the axial- 
vector current. As an illustration of the use of our 
method in the strangeness-changing case, we discuss 
K e4 decay. We finally discuss the process of radiative 
M capture on a proton, an example in which two currents 
are present. 


1. Weak Pion Production 

We consider the process 

*(k,)+N(pd^l(kd+NW+r*(q ), (33) 

where the four-momentum of each particle is indicated 
in parentheses. Let MJ n be the axial-vector matrix 
element for this process, as defined in Eq. (19), with 

kkHW), 

out(61 =out(N(pz)ir n (q) | , (34) 

k = ki—k P =pi— {pr\-q ). 

In this case, T p [j,/,0,0] is the pion-nucleon vertex 
*£rY 5 T", which has no s dependence. Hence the d/ds 
terms in Eq. (31) vanish. Clearly the matrix 

element with the pion source substituted for the axial- 
vector current, is just the amplitude for pion-nucleon 
scattering. We find 


Calculating k a M a<r^i we get 


I a v s = ( d 4 ye i9 ' v /fcl—t— T[J a A *(x)J v {y)^aS 

d out ' I dX a I ' in 


= / d 4 ye iq ' v out(b\— i Ai (x)J „(yy]\a)- ln 


+J d 4 ye i <*'» ou t(b\—iT[d a J tx A t(x)J v (y)'l\a)i n . 

_ (32b) 

8 Y. Nambu and D. Luri($, Phys. Rev. 125, 1429 (1962); 
S. L. Adler, ibid. 139, B1638 (1965). 

6 In Eq. (32) we have neglected “seagull” terms, which will be 
included in the calculations of Sec. HI. 




T* 1 

igA (£ J ) Y«Y5--- 

2 iy -{Pi+V+Mh 


1 

X igr7hT n +ig r y h T n - 

iy-(pi—k)+M N 


XigAik*) y«Y6— 
2 


u(p{) 


MxgA ik a 
f- MJ', 

gr(0) k 2 +mj 


(35a) 


10 This method has been applied to the case when only vector 
currents are present by G. K. Manacher, thesis, Carnegie Institute 
of Technology Report NY0 9284, 1961 (unpublished'). 
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M J n = u{p£) 1 


1 


igr(k*)rfy 6 


+igr7lT n 


ij' (pt-hfy-hMx 
1 


tgr7lT n 


iy(Pi—k) +M n 

~ d - 
+if^(0)+ikx—T^(k) 

dk\ 

From Eq. (30), we find that 

u(pJiT^(OMpd 

U 0) 


4' r (^ 2 )r J 76 


+0(^) 


b=> 0 


u{pd. 

(35b) 


= —iu(pi) 


2M 


-T 3 ytig r yiT n 


N 


, . U 0) . 

+tgr75T n _ 


^u(pi} 


\grgr(0) , 
-gnj 


2M 


If 


u(pi) 


M n 

From Eq. (31), we have 

M N gA 


u(pl) . 


(36) 


A MJ n = 


gr(0) 


■u(p t ) 


f d 

• 

—TJ n (k) 


dk a 

k- 0 


u(Pi). (37) 


From the usual expression for the pion-nucleon scatter¬ 
ing amplitude, 11 we find (remembering that — k is the 
incoming pion four-momentum), 


u(pi) 


—TJ'{k) \u(p{) 

Ska 


d fr 

= i — u(pi)\ -A"*™ 

dk a IL \ 


/ k'(pl+pt) 


2 M n 


q-k 


vb= 


2M 


8 n3 ' 

N / 




Xi[r n ,r J ] tt(^0 life-0 


tu(pt) 


([ 


3A tN c+) 


-[ 


dvB 

\ 

dv pssp£ssfc*ssO 2A£ If 


(#!+#*)- 


”1 

- gnj 

23/ jy- 




=0 






u(p {). (38) 


11 G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, 
Phys. Rev. 106, 1337 (1957). Note that, according to Eq. (34), 
—k is the ingoing pion four-momentum. 


Other derivative terms vanish at v=0 because of the 
well-known 11 ) crossing properties of A vN and B* N , 


A**n±)(- V , ...)=d=i4' w <±)(ir J 

B*n(±)(- Vj ...) = oFB*m±)(p 9 . ..). 



Since —k 2 is the (mass) 2 of the initial pion, Eq. (38) 
involves the pion-nucleon scattering amplitude ex¬ 
trapolated slightly off mass shell. Note that Eq. (36) 
is just the consistency condition on 7riV scattering, 12 


J[tW(+) 


grgr(0) 


y=zyj}=zk 


M N 



Equations (35), (37), and (38) give the two leading 
terms in an expansion of MJ n in powers of k , 

M a * n =Ma in ^+AMJ n +0(k ). (41) 


Alternatively, we can use the analog of Eq. (30) to 
find the leading term in an expansion in powers of q 
(the soft pion limit). In this case, one would take T p in 
Eq. (30) to be the axial-vector vertex. There will be an 
additional term in Eq. (30) arising from the equal-time 
commutator of the two axial-vector currents involved. 
Assuming the commutation relations postulated by 
Gell-Mann, 13 we find 14 


M a jn =MJ n ext '+AAf <* jV +O (q ), 

with 


Sr 0) 

A M a **'=i - 

2 Mn 


p, v (pi+pi) 

gA 2 Mn 


a 



+iySgA - 

L gA 



n v =3,70. 



Clearly, at the point q=k =0 we must have AM a * n 
= A MJ n \ At this point pi—pz and thus iy a and 
(P\+pi)a/ (23/y) are equal between spinors. Hence, 
consistency between Eq. (42) and Eq. (41) demands 


1 

1-= 


2M* 


N 


rdA* N ^ 


gA 2 gr(0) 2 L dv 


f 


] 


y=yB—fe 2 =g 2 —0 



which is the sum rule for the axial-vector coupling 
constant. 16 

12 S. L. Adler, Phys. Rev. 137, B1022 (1965). 

13 M. Gell-Mann, Physics 1, 63 (1964). 

14 Y. Nambu and E. Shrauner, Phys. Rev. 128, 862 (1962) j 
S. L. Adler (to bepublished); G. Furlan, R. Jengo, and E. Remiddi, 
Nuovo Cimento 44, 427 (1966). The dilig ent reader will actually 
find that in Eq. (42), and also in Eq. (45), we have dropped certain 
terms proportional to k a which are not singular at b?——m v 2 . These 
terms are, of course, determined by our procedure, but they are 
numerically Insignificant in weak pion production because k a , 
contracted with the lepton current, becomes proportional to the 
lepton mass. We have also in Eq. (45) neglected a very small 
extra term, proportional to $"*, whidi appears in Eq. (41) when the 
pion four-momentum q is taken off mass shell ["see W. I. Weis- 
berger, Phys. Rev. 143, 1302 (1966), Eq. (Il.lla)]. 

16 W. I. Weisberger, Phys. Rev. litters 14, 1047 (1965); 
S. L. Adler, Phys. Rev. Letters 14, 1051 (1965). 
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Comparing Eqs. (43) and (38), we may determine 
the terms linear in either q or k. Our final result is then 


with 

MngA rdA r *< + > 
A Mj n "=i —- 


gr(0) 


dVB 


?« 




8 nj 


we find that 


1 


Ma jn =MJ n ™ t, +AM a *"+0(qk y q 2 ) k i ), (45) — F i 


tn R 

1 

—1 

nt R 


- Ck — T rK [x t y,(k ) 2 ] 

x=y^=s(k ) s =s0 dX 

= cJ—T ,«[*,>, (A - ) 2 ] 

(fc') l =ai0 dy 


* =I/ .=(O 2 =0 


(52) 


I[D=y =(fc") 2 =0 


+ 


" gr(0) S / 1 \(/»l+/*2)a 

gAV 2 Mx 


Vapqp - 
-i—— 
2Mn 


gr( 0) 2 




4 

2M*n 2Mn 




u(pi) 



Hence, the K ei decay amplitudes at a point on the 
boundary of the Dalitz plot are related to the irK ampli¬ 
tude, with one K meson off mass shell. In terms of the 
conventional Mandelstam variables, the point 
x=y= (£“) 2 =0 is 

s={p++p~Y= — tn K 2 , 
t= (k~-\-p + ) 2 = — , (53) 

u= (k~—p~Y= — m T 2 . 


Unfortunately, it is doubtful if Eq. (46) will be of 
practical use, since there is a strong final-state inter¬ 
action leading to the (3,3) resonance, which is located 
only one pion mass away from threshold in energy. 
This makes it unlikely that k and q will be good expan¬ 
sion parameters. However, we will use the same method 
of comparing expansions in q and k in dealing with 
radiative n capture, where the final-state interaction is 
negligible and so the expansion may be physically 
interesting. 


2. K 0 a Decay 

Here we consider the process 

K+(k+) -> r*-(p+)+Tr-(p-)+e(k e )+v(k v ) . (47) 

Again the four-momentum of each particle is indicated 
in parentheses. Let the four-momentum carried away 
by the lepton pair be k~ y 


3. Radiative y Capture 

In this subsection we discuss the process of radiative 
M capture by a proton. This is an example of the situa¬ 
tion, discussed briefly at the end of Sec. II, in which 
more than one current is acting. Consider then 

^(.k^+pipi) —> v{k,)+y{k)+n{pt) , (54) 

and let 

q=k ll -k r (55) 

be the lepton four-momentum transfer. The matrix 
element for this process is given by 



(n\Ja W \p)U y y a (l+yi) 

1 1 

X- ieyxe^Up - 

iy* {k v .-k)-\-pip (2ko) 112 


k e +k y =k~. (48) 

The most general form of the axial-vector contribution 
to the decay matrix element is 

M a = (2k Q + 2p{i + 2pfr) llt out(ir+ir-\Ja A ' 

1 (49) 

= — LF 1 (p++p-) a +F 2 (p+-p-) a +F 3 k a -l. 
m K 

The form factors F are functions of the arguments 
x— {p + +p~) * kr, y~ {p + —p~)‘k~i and (£“) 2 . We define 
the matrix element for t + t“ —> K + K~ by writing 

(2k 0 +2p 0 +2p 0 ~)W ont(ir + ir-\j K \K+) in 

= iT rK£x t y, {k~ ) 2 ]. (50) 

Then if we assume PCAC in the strangeness-changing 
case, 16 

_ d a Ja A * A8 ~~’ 1 =CgiinK 2 4>K , (51) 

16 R. P. Feynman, in Symmetries in Elementary Particle Physics 
(Academic Press Inc., New York, 1965), p. 158. The constant Ck 


+ (ny | J a w \p )fl„Ya(l+Y5)M„ 



with ex the polarization vector of the photon and G the 
Fermi constant. The two contributions to T correspond, 
respectively, to radiation by the muon (which is nega¬ 
tively charged) and to radiation by the hadrons. The 
matrix element (n \ J a w \ p) is given by 



1/2 

(n\J« w \P) 


=iu{p i )LF 1 y^ q -my a -F i y^q-kyy a ^q-k) B 

+gA((.q-kyyY a yi-ih A ((q-k) 1 )y i (q—k) a ]u(pi). 

(57) 


is given by Ck= (M n+M &)gA AN / g KN a (0), with gA AN the A beta- 
decay coupling constant and g K the KNA coupling. For appli¬ 
cations of partial conservation of the strangeness-conserving axial- 
vector current to AT e4 decays, see C. G. CaUan and S. B. Treiman, 
Phys. Rev. Letters 16, 153 (1966) and M. Suzuki, ibid . 16, 212 
(1966). 
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Here, Fi v (t) and F^(f) are the isovector Dirac and 
Pauli electromagnetic form factors [Fi r (0)~l, 
Fi v (ty=H v /(2 Mn) 2, gift) is the axial-vector form 
factor, and h A (t) is the induced pseudoscalar form 
factor. Applying PCAC to the one-nucleon vertex of 
the axial-vector current, we find that k A (t) may be 
written in the form 


h A (t) = 


2M N g A £gr/gr(Q )] 


MO, 


(58a) 


If MT*gr(t)+tg r ~] 

r(t)=4 2M N g A (t)—2M N g A ———-- , 

tL gr(0)(t+m v 2 ) J 


(58 b) 


r(0)«2M^'(0), 


From Eqs. (60) and (62), we can deduce the gauge 
condition satisfied by 

/ piop2o\ li * 

(2k 0 - (ny\J r +\p)=etx*T w +x. (63) 

\ M 2 n' 


Replacing ex* by k\ in Eq. (62), and multiplying 
Eq. (60) by q aj we get 




q 2 +tn v 2 /#io# 2 o\ 1/2 

- ( - ) 

(y-^+wA M 2 n / 

q 2 +m* 

----\5fl(#s)i7tfr((?-A)*)«(#i) 

{q-k¥+m* 



which explicitly exhibits the one-pion pole part and the 
remainder r(t). 

We write (ny\J a w \p) in the following form: 

(2kopiopWM 2 N) ll *(ny\J a w \p)=ee\*M\ tx . (59) 

We wish to use our knowledge of the divergences of the 
vector and axial-vector currents to calculate A/\«, up to 
and including terms linear in q and in k. In order to do 
this, we have to know the quantities k\M\ a and q<M\a- 
The first of these may be determined by conservation of 
the electromagnetic current. When ex* is replaced by 
k\ in Eq. (56), the resulting expression must vanish. 
This tells us that 

hMxa= - (pufo/M*x) lt% (n\J* w \p). (60) 

In order to calculate ?aMx«, we made use of our knowl¬ 
edge of the divergences of the vector and the axial- 
vector parts of the weak current, 17 


Wheng 3 —— m T 2 , Eq. (64) becomes ^xTh-x= 0, the usual 
gauge condition for on-mass-shell pion photoproduction. 

Before stating the results for radiative n capture, we 
will discuss the significance of Eqs. (60) and (62). A 
more conventional way to proceed in calculating 
k\M\ a and q<M\* would be to contract the photon in 
Eq. (59), giving 

/pl0p20\ 1!i f m 
eMx«=(-) i I d A xe~ ik ‘ *(—□*) 

\M 2 n J J 


X(n\ 7ZAx(x)J m "(P)l\p) 


W {/— 

Xe(»1 (x)J a w (0)] |#)+Sx«], 


with 



Ju w =Jj+J* A , (61a) 

dcj a V = teAaJ J , 

d a Ja A = ieA aJa A + N W *g A /g r (0) ) , 

where A a is the electromagnetic vector potential and 
<p T + is the field which annihilates a positive pion. Equa¬ 
tions (61b) follow from the assumption of minimal 
electromagnetic coupling and from the divergence equa¬ 
tions in the absence of electromagnetism. (The factor 
y/2 in the axial-vector equation comes from the defini¬ 
tions of J a A and 0H-: J a A =Ja A1 —iJa AS and tfn- 
= (^t 1 — w^ t 2 )/\^.) Using Eqs. (61b) to evaluate 
(ny | d a J a w | p), we find 

/piopto\ l!i 

ex*?«Mxa=-(- *x*b\Jx w \p) 

\ M 2 n / 

'fZMjtgA Mr* / £ io £ 20\ 1/2 

+t-( 2&o-J 

gr(0) f+m w A M 


S\ a =J d < xe~ ih ‘ x b(x o) 

X(n\ldAx(x)/dxo,J a w (0)l | p), (66) 

where we have assumed that ^4x and J a w commute at 
equal times. The equal time commutator term S\ a in 
Eq. (65), sometimes called a “seagull” or “cata¬ 
strophic” term, describes the coupling of the weak and 
electromagnetic currents at the same point (see Fig. 4). 
It is a reflection of the extent to which A\ appears in 
J a w . Calculating k x M\ aj we now get 


/^ 10 ^ 20\ 1/2 f 

k\M\a=[ -J I dxQ5 tkox *8(xo) 

\m* n / J 


X 



d*xe-*"Jo EM (x),J a w 



+ 



1/2 

e~ l ik\S \ a . 



_ Xe~ l (ny\J r +\p). (62) 

17 S. L. Adler, Phys. Rev. 139, B1638 (1965). 



Fig. 4. A “seagull" diagram. 
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The commutator of the currents is 


s(*o)[/o EM (*),/ a no);i 

= — 5 C4) (x)J a w (0)+[possible gradient terms 

proportional to d«5 C3) (x)]. ( 68 ) 

The first term in Eq. ( 68 ) is the one conjectured by 
Gell-Mann 13 ; the possible presence of the gradient terms 
was pointed out by Schwinger . 18 We see that Eq. (60) 
implies that the Schwinger terms exactly cancel the di¬ 
vergence of the “seagull” terms. This cancellation has 
been proved by Feynman in a Yang-Mills theory and 
has been conjectured by him to be a general result . 18 In 
other words, when calculating the divergence of quanti¬ 
ties like M\a, if one neglects both the “seagull” terms 
and the Schwinger terms, one gets the right result. Note 
that the “seagull” terms cannot be dropped when cal¬ 
culating the matrix element Mx« itself. 

In order to state our answer for radiative n capture, 
we have to define the amplitudes for pion photoproduc¬ 
tion with the pion off-mass-shell. This process is re¬ 
lated by crossing symmetry to the matrix element 
{ny\ Jr+\p) in Eq. (62). We write the photoproduction 
amplitude in the following form , 20 


/ piopzo\ li * 

[2k 0 -) 

\ M* n / 


(N\J*\Ny) 


= epfX 2 *ii(Pz)< 




1 


t 

iy • (P2+q)+M^ 


1 

y xU+r 3 )-(M s +M y r 3 ) 

2M n J 


<T\gkt q 

+H 7x(l+r 3 )-—^( M s +M y r 3 )J 


L 


1M 


N 


1 


X 


iy(pi-q)+M N 

+^r(-m T 2 )[r J ',r 3 ]7 B - 




q-k)x 


(q—kf+Mi r 2 

+£ 0 8 x[F/+)i6^+F/-)l[rV 3 ]+F/°Hr^ 


4 

£ 


+ ( i+ ~h 


g r (- m/) - g r ( 0 ) 


tn, 


] 


u(p)X iex, (69) 


16 J. Schwinger, Phys. Rev. Letters 3, 296 (1959) and Phys. 
Rev. 130, 406 (1963). 

18 R. P. Feynman (private communication). 

20 G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, 
Phys. Rev. 106, 1345 (1957). The amplitudes (Yi.Y^Yj.Y^, 
as defined in Eq. (69). are respectively double the corresponding 
amplitudes (<4,£,C,Z)) ^ ±0) of CGLN. [The isospin matrix elements 
in Eq. (69) are one-half those of CGLN.] 


where fa is the isospin wave function of the pion, X x and 
Xi are the nucleon isospinors, k is the ingoing photon 
four-momentum, and q is the outgoing pion four- 
momentum. The isoscalar nucleon anomalous magnetic 
moment has been denoted by 2MnFi s {$) = ii s 
= — 0.12]. The four-vectors 0 a x, which satisfy £x0*x—0, 
are given by 

0ix=^75(7x7’£— 7*^7x) j t?i=l 

Oix—iyi^ipi+p^xq^k— (Pi~hpz) m kq\2> 1 
03 \=y 5 (y\q-k—ykqx) , m=— 1 (70) 

04x=7bC7x(£i+/>2)*£— y m k(pi-hpt)\i 

— iM N 7 b (7x7 ■ k— 7 ■ ky\), 174 = 1 

The amplitudes V 6 are functions of the invariants 
<7*, k 2 , v y and vb, with 

v= — k* (^i+^ 2 )/ 2 Af^, VB=q'k/2M N . (71) 

The bar on top of the F, is a reminder that the Bom 
term has been separated off. The numbers rj 8 specify the 
crossing properties 20 of the amplitudes F„ 

F s tt.°)(_,, ...)~l.(=bl, 1)F S <±‘<»(.,*•■)■ (72) 


The terms explicitly proportional to (1 +<f/m ir 2 ) in 
Eq. (69) are necessary to satisfy Eq. (64), the gauge- 
invariance requirement when the pion is off-mass-shell. 
Since 


g r {—mf)-gr( 0) IB, 1 
*/(0)+—-;-0) , 


tn* 



the gauge-invariance term is numerically very small. 
The matrix element ( 7 «|/h-|/>), which is the one 
needed in Eq. (62), is obtained from Eq. (69) by the 
replacements 



Since the final nucleon is a neutron and the initial one 
is a proton, we have 



We can now state the result for radiative m capture: 

ft qk\ 

+Mx«*+0(-,-). (76) 

\tn R 2 mft 2 / 

The mass mg, which characterizes the terms neglected 
in our calculation, will typically be several pion masses 
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We discuss numerically the restrictions imposed by the partially conserved axial-vector current (PCAC) 
on the pion photoproduction amplitude Fi (+) (0) and on the pion electroproduction amplitude Fa (_) (0). 

We find that the magnetic-dipole dominance and the narrow-resonance approximations are unreliable. 

The nonresonant s waves make an important contribution to Fi (+) (0), and we find that the PCAC pre¬ 
diction for tins amplitude is reasonably well satisfied. The electric and longitudinal multipoles appear to 
make a much bigger contribution to Fj h (P) than does the magnetic dipole M i+, which is strongly sup¬ 
pressed by the kinematics. 


I. INTRODUCTION AND CONCLUSIONS 

A S has been much emphasized recently, 1 the partially 
-*** conserved axial-vector current (PCAC) hypoth¬ 
esis, supplemented by current commutation relations, 
relates any weak or electromagnetic process in which a 
zero four-momentum pion is emitted to the same process 
in the absence of the pion. In particular, when applied 
to pion electroproduction, PCAC implies the relations 2 

(fr(0 )/Jf »)*/(**) 


= VyM(v=VB=(M r f )*= 0,k ! ), (la) 

fe r (0)/Jf w )IV(**) 

= K I «°)(r-ra=(Jf.O* = 0,A*), (lb) 


gr(0)pA(£ J ) 

Mn L #a(0) 




-1 


= = k 2 ). 



Here Fi v Q?) is the isovector nucleon Dirac form factor; 
F^ik*) and Ft 8 (k*) are, respectively, the isovector and 
isoscalar nucleon Pauli form factors; gA^k 1 ) is the 
nucleon axial-vector form factor [£a( 0)= 1.18]; and 
g r (0) is'the pion-off-mass-shell pion-nucleon coupling 
constant [&■—£»■(—Af* 2 ), g r 2 /4T« 14.63- The pion 
photoproduction amplitudes Fi (+ * 0) and the pion 
electroproduction amplitude V 6 C “ J will be specified 
more precisely below. When £ 2 =0, Eqs. (la) and (lb) 

* Junior Fellow, Society of Fellows. 

t National Science Foundation Postdoctoral Fellow, 1965-66. 

1 Y. Nambu and D. Lurid, Phys. Rev. 125, 1429 (1962); Y. 
Nambu and E. Shrauner, ibid. 128, 862 (1962); S. L. Adler, ibid. 
139, B1638 (1965); M. Suzuki, Phys. Rev. Letters 15, 986 (1965); 
C. G. Callan and S. B. Treiman, ibid. 16, 153 (1966). 

2 These relations are contained implicitly in the weak pion 
production results of Nambu and Shrauner (Ref. 1). The covariant 
forms have been derived by a number of authors: S. L. Adler, in 
Proceedings of thelntemational Conference on Weak Interactions, 
Argonne National Laboratory, 1965, p. 291 (unpublished); 
Riazuddin and B. W. Lee, Phys. Rev. 146, B1202 (1966); 
G. Furlan, R. Jengo, and E. Remiddi, Nuovo Cimento 44, 427 
(1966). 
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become the photoproduction relations of Fubini, 
Furlan, and Rossetti 3 ; and Eq. (lc) becomes a relation 
between the axial-vector and charge radii of the nucleon. 

The main purpose of this paper is to give a careful 
numerical analysis of Eqs. (la) and (lc) at& 2 =0. In 
the dispersion integrals for Fi (+) and PV""* we keep 
only the multipoles which resonate around the #*(1238) 
and the #**(1520), and the nonresonant s waves. As a 
preliminary, in Sec. II we state the needed kinematics 
and briefly derive Eqs. (1). In Sec. Ill we give the 
numerical discussion, using the photoproduction analy¬ 
ses of Schmidt and Hohler 4 and of Walker 6 in the region 
of the first two pion-nucleon resonances. 

We reach the following conclusions: 

1. The magnetic-dipole (Af 1+ ) contribution to 
Fi c+) (0) from the neighborhood of the #*(1238) equals 
only about 0.75 times the left-hand side of Eq. (la). 
Estimates based on the narrow-resonance approximation 
indicate a larger Mi + contribution, but we find that the 
narrow-resonance approximation for the #*(1238) 
overestimates integrals over the resonance by about 
60%. When the resonant Ei+ , M 2 _, and £ 2 - multipoles 
are included, the value of PV +) (0) is reduced to about 
0.6 times the left-hand side of Eq. (la). However, the 
nonresonant s waves make a large contribution to the 
integral, 6 making the total integral for PV +) (0) equal 
to about 0.85 of the value predicted by PCAC. 

2. The dispersion integral for PV'^O) is not mag¬ 
netic-dipole-dominated, because the Mi+ contribution 
is kinematically suppressed. For instance, the multipole 
Ei+ (electric quadrupole) in the #*(1238) region makes 
a contribution three times as big as the multipole Af 1+ 
to PV _) (0), even though the £i+ multipole is much 

»S. Fubini, G. Furlan, and C. Rossetti, Nuovo Cimento 40, 
1171 (1965). 

* \V. Schmidt and G. Hohler, Ann. Phys. (N. Y.) 28, 34 (1964); 
W. Schmidt, Z. Physik 182, 76 (1964). 

8 R. L. Walker (private communication). 

8 The nonresonant s wave also makes an important contribution 
to the sum rule relating the isovector nucleon magnetic moment 
and charge radius to photoproduction cross sections—see F. J. 
Gilman and H. J. Schnitzer, Phys. Rev- 150, 1362 (1966). 
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smaller than the M i*. The value of depends 

sensitively on the hard-to-measure longitudinal multi¬ 
poles. Under the dubious assumption that the known 
proportionality of longitudinal and electric multipoles 
for zero photon momentum holds unchanged for large 
photon momenta as well, Eq. (lc) predicts an axial- 
vector form factor which falls off somewhat more 
slowly with k 2 than does Fi v (k 2 ). 

The results of this paper should not be regarded as 
final, since the input multipole data may change as 
better analyses of photoproduction become available. 
What is definitely indicated, however, is that a compar¬ 
ison of Eqs. (1) with experiment must avoid unreliable 
narrow-resonance and M ^.-dominance approximations. 


H. KINEMATICS AND DERIVATION OF 
PCAC RELATIONS 

A. Kinematics 

Let us consider the reaction 

7 (k)+N(p 1 )^ir(q)+N(p i ) J (2) 

where the initial gamma may be real or virtual. The 
external particle masses are, respectively, 

-k 2 , -p x 2 =M N 2 , -f=(M/?, -pt=M N 2 . (3) 

We define invariant-energy and momentum-transfer 
variables v and vb by 


y= — (Pi+p2)‘k/(2M #), v£=q‘k/(2Mff); (4) 

these are related to W, the invariant mass of the final 
pion-nucleon system, by 

w-wb=<W 1 -MJ)/(2M n ). (5) 


All nomnvariant quantities used in this paper refer 
to the reaction center-of-mass frame, in which k+px 
= q+p2=0. We denote by y the cosine of the angle 
between the photon and pion directions: 

( 6 ) 

and by |k| = (k<?+k 2 ) 112 and | q| — (?o 2 “ (M/) 2 ) 1/2 the 
photon and pion momenta. The photon and pion 
energies are given by 



W 2 -M N 2 -k 2 
2 W 


Qo 


2 W 



The matrix element for the electroproduction reaction 
of Eq. (2) takes the form 

* r e x „t<x (q)N(pt) | Jx n +J\ Y \N(pi )), (8) 

with e r the electric charge, ex the virtual photon 
polarization vector (which satisfies £*e=0), and with 
J\ n and J\ Y , respectively, the third component of the 
isospin current and the hypercharge current. The 


x 

p. 

Fig. 1. Bom approx¬ 
imation diagrams. 




q 


P 2 


k 

wvwi- 

I 

I 


q 



isospin structure of the matrix element is given by 

out(irN\JJ\N)=aWVx«», 

with 7 


<z<±) = X/*pc*l (r c r 3 zb T t T e )Xj , 
a<o) = X /^ c *|r c X/. 



In Eq. (10), p Cj X/ 7 , and X* are, respectively, the 
isospinors of the final pion, the final nucleon, and the 
initial nucleon. The space-spin structure of the matrix 
element is given by 


exF x (± - 0) =E 

/*=i 

xa(pt)o(vMPi) > (W 

Defining {a,6} = a* <£•£ — a-kb-e, we may take the 0(F/) 
as 

0(Vi) = \iy*{%y} , 

0(Vj)=*75{#i+^i> ?} > 

0(V 3 ) — 7s{7>?} , 

0(Vi) = y*{y, px+pi}— iM N y b {yjy) , 

0(7 5 ) = i75{^), 

0(V^ = yz{k,y}, 

The numbers ijj y specify the crossing properties of the 
invariant amplitudes: 

= (±,+)v; v V^'H-v, *b, (MS) 2 , k 2 ). (13) 

To make the normalization precise, we state the 
contribution of the Born approximation diagrams of 
Fig. 1 to the invariant amplitudes. [In the following 
equations we take the external pion to be physical 



7 Our notation follows that of a redew article on pion electro- 
and weak production in preparation by one of the authors (S.L.A.). 
Our amplitudes are related to those of CGLN fG. F. Chew, 
F. E. Low, M. L. Goldberger, and Y. Nambu, Phys. Rev. 106, 
1345 (1957)] as follows: 

covariant amplitudes—[F i,F*»Fi»F 4] (±,0) thia paper 

= 2[ri,B,C,D](±-oW N , 

center-of-mass amplitudes—this paper 

— (87rW/iWjv)[]£F, v ] (±,0 ^oaLN, 

multipoles— |~Aft+, etc.] (±,0) tfai a paper 

*= (&irW/MN)[.Ml+ t etC.] (±,0) oGLN- 
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( M/=M X ); F r (k 2 ) is the pion charge form factor.] 
g r F Sik*) 

Fl (±)B = _!-1 


F*<±>*= 


F»<±>*= 


iM/pVB \VB 

gfl V (k 2 ) 


2 M N \ VS~~V J'tf+J'/ 

grFSQt*)/ 1 1 \ 

\vB~y j’s+j’/ 


7 4 (±)b= 


2M n 

grFJ{k*)f 1 


\VB V VB^vJ 


2 Ms 


) ■ 

\VB~~V J'B+J'/ 


(14) 


7 6 (+)b= 0 , 7 6 <-)b=__^ 


-2 g r /FS(k*) 2 Frik*) 


k 2 \2 Mnvb 4 M^vb 


n \ 
-k v ’ 


f 6 < ±)B =o. 


While the consequences of PCAC are most simply 
expressed in terms of the invariant amplitudes Fy, pion 
photoproduction and electroproduction experiments are 
most easily analyzed in terms of the center-of-mass 
frame amplitudes $j v , defined by 7 

e 

cx Fx c±,0) =E , (15) 

y—i 

Here X/ and X* are the nucleon Pauli spinors, and the 
S's are chosen as follows: 

2i r — i(a- £—o t'kk‘ z) , 2i v =iv§(§- z-Q-kk- e), 

2 2 y =<r £or* (&Xe), X 5 V = — ik* 0 ‘kk‘ z/ko, (16) 

2a v =io‘k(fi‘ z—^‘hh‘ e), 2$ v =—ik 2 cr-§k- z/ko. 


The amplitudes 3 : j v have simple multipole expansions 7 : 
£ (lM^+E^)P l+ /(y) 

i-0 



W+VM^+E^P^/iy) 


} 


$ 2 V = E [(I+l)I 1 ++(I I -]Pi'(y), 


Z =1 


3^= E (-M^+E^P^'iy) 


Z =1 


+ E (M^+E l J)P l ^ ff (y), 


t -3 


(17) 


sf 4 7 = E 

Z =2 

E (z+^P^+i'Cy)- E u^-Pw'(y), 

z =0 Z -2 

E Ea^-cz+i^lP^). 

Z—1 

The index /d= of the multipole specifies the orbital 
angular momentum (l) and the total angular momentum 
(7—/zb|) of the final pion-nucleon system. It is 
straightforward, but tedious, to calculate the linear 
transformations connecting the amplitudes Vj and 5j v . 3 

B. Derivation 

The PCAC relations of Eq. (1) come from the identity 


iId*x e- i <‘-^*(-n l +M^)(N(p 2 ) | T[_d,J^(x), (Jx I3 (0)+Jx r ml\N(p 1 ))e k 

= -ild i xe- i <‘^*(-O I +M^S(x 0 )(N(p 2 )\ZJ B ^(x),Jx n (0)+Jx T (0)l\N(p{))^ 

-q,jd*x e-^^*(-n x +M, 2 )(N(P,) I TlJ, Ac (x), (Jx n (0)+Jx r (0m\N(pi))ex, (18) 


which is obtained by integration by parts. Using the partially conserved axial-vector current hypothesis, 9 

M N M x 2 g A 

doJ ff ^ c (^)— SPx c (^) j 

gr( 0) 

we see that the left-hand side of Eq. (18) is just 

MpjMSgA 6 _ _ _ 

-E ^(/>2)O(F;)w(/»i)Qa (+) F/ (+) +a ( ” ) 7/” ) +a (0) 'P/ 0) ]+Bornterms, 

gr{ 0) f-* 

where V$ denotes the non-Born part of the amplitude V/. 


(19) 


( 20 ) 


•See, for example, R. Blankenbecler, S. Gartenhaus, R. Huff, and Y. Nambu, Nuovo Cimento 17, 775 (1960); P* Deanery, 
Phys. Rev. 124, 2000 (1961). 

• M. Gell-Mann and M. L6vy, Nuovo Cimento 16, 705 (1960); Y. Nambu, Phys. Rev. Letters 4, 380 (1960). 
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Let us evaluate the two terms on the right-hand side of Eq. (18) in the limit as q —» 0. The equal-time com¬ 
mutator term approaches 

-WMQHrt 11" f d 3 xJ a Ac (x), A n (0)+A y (0)l| I NiPr^ey. 

lJ JI r 0 =o 


( 21 ) 


Because of the integration over all space, possible gradient terms in the commutator do not contribute, and we 
find for this term 

(M T i g A (k i )/k^u(p i )0(V,)u(p 1 )^ (22) 

(To simplify the algebra we have dropped terms proportional to £*€=0.) The term proportional to q aj in the 
limit as q c —» 0, can be evaluated by keeping only the one-nucleon-pole terms. 10 This gives 




r e ypi^iM^ 

igAyqys -^Crx(^ r a 2 )r3+F^(^))-(rxA(i ? 2 r (^)r 3+W 2 ))] 

2 — 2p2' q 


ypx+iM N t c 

+ ¥lyx(Fi r (^)r 3 +^ s (^))-tr x ^(E^(^)r 3 +E/(^))]- ig A yqy 5 ~ 

2p v q 2 


u(pi)t\ 


= MSg A 


Fx v (&) 

- u{p2)0(V^u{p x )+{a^F2 V (k^+a^F2 s (k^ 


k 2 


Fx v (k 2 ) 


[" a( +) { -1--^l«(^O0(^i)w(£i)+the other Born terms 

L \VB — v VB^V/ \VB — v J'B+J'/J 


(23) 


Comparing Eq. (20) with Eqs. (22) and (23), we get the relations 

(g r (0)/^)E^(^) = Fx^(^,B-(M.0 2 -0, k 2 ), 

(g r (Q)/M N )F2S(k*) = VxV^v=vB= (M/)*= 0, k 2 ), 

g r (0)rgA(^ 2 ) 

.. (&y~ i =V't i ‘~ ) (v=vB= (if/) 2 —o, k*). 


(24) 




M N Lg A (0) 




If we take v=vb= 0 to mean “first set *^=0, then set ?=()” the bars in Eq. (24) may be dropped, since the Born 
approximation to Fi vanishes at ^b= 0 (for all v¥^ 0). This completes the deviation of Eqs. (1). 

in. NUMERICAL ANALYSIS 

We now proceed to a numerical analysis of Eqs. (la) and (lc) at £ 2 —0. Introducing the abbreviations Fj (+) (o) 
ES Fi <+) (i'=i'B= (MJ) 2 = & 2 =0), Fe (_) (0)= Fe <_) (i'=i'B= (M*0 2== ^ 2== 0), we write the equations in the form 


*» 


M 


-FS( 0) 


N 


Sr K. <+>«)), — r^-^-l ? 1 y '(0)l=-^-F 6 <-)(0). 

J £r(0) 


gr( 0) 


-MtfLgA(O) 


(25) 


In order to calculate Fi (+) and Fe ( ' ) from experimental photoproduction data, we assume that Fi (+) and F6 C “ ) 
both satisfy unsubtracted fixed-momentum-transfer dispersion relations in the energy variable v 11 : 


Vx^^vbXMJ)^) 


-gr{-(MmFx v (k*) 


2M 


N 


(—+—) 
\VB—V VB^TVf 


l r°° 

+~ / 

7T J pb -Mf +Af 


3/ImKj«->(r>a,(Jf,0 *,**>(—■+~~) » (26) 

»V( 2 Afjv) \V f —V /+?/ 


10 See S. L. Adler, Ref. 1, where the rules for calculating the “pole insertions” are discussed. In Sec. IIB we have ignored questions 

of gauge invariance. It is easily shown [S. Adler and Y. Dothan, Phys. Rev. 151, 1267 (1966), and M. Nauenberg, Phys. Letters 22, 201 
(1966) j that when the final pion is off mass shell, the photoproduction or electroproduction amplitude is not divergenceless, but has a di¬ 
vergence proportional to In order to maintain the correct divergence, additional terms must be 

added to the Bom approximation calculated from the diagrams of Fig. 1. However, these additional terms vanish when p** &•€ = (), 
and thus do not affect the results of this paper. See also S. Fubinij Y. Nambu, and A. Wataghin, Phys. Rev. Ill, 329 (1958). 

11 Validity of the unsubtracted dispersion relation for Fi (+) (0) is indicated by the Regge-pole analysis of photoproduction given by 
G. Zweig, Nuovo Cimento 32, 689 (1964). 




R13 


215 


1464 


S. L. ADLER AND F. J. GILMAN 


1 

V^(vyV B y(MJ)\k 2 )=- 
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>oo 


vB+M v +M r */(2MN) 


di/ ImPV ^v'yVBy 
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which imply that 


gr 

[^<+>(0)] 

1 1 

2 

r W gr 

/ -Im 

Fi'+V, 0,0,0) 

gr( 0) 

[pv-’Co)! 

T . 

}M*+MJ/2Mir V g r (0) 

[f 6 <->(/,0,0,0)1 



In order to calculate the integrand of Eq. (27), we make a multipole expansion, keeping only those multipoles which 
can at present be determined from experiment. These are: (i) the nonresonant s-wave multipoles and L^. 
The E(y+ makes a large contribution to charged pion photoproduction; (ii) the multipoles Afi+< 3/2) , £i + < 3/2) , and 
L 1+ ( zn) , which are important around the /=fiV*(1238); (iii) the multipoles and which 

are important around the /=§iV**(1520). 

Doing the necessary arithmetic, we find 

gr 2 M n g r 

—-— PV +) (p, 0,0,0)—-[i£a f <^>+f£a f <v«+|if^(W)+ 2 £ 1 + (V«_jf w (i/« + ^(V«]|^^ (28a) 

gr( 0) W 2 -M N 2 g r (0) 


gr 2 M n 1 gr rE^'M-Eo.M 1+ ™v 

-Fa <_) (*',0,0,0)=--+- 

gr( 0) W*-M N 2 3gr(0) L W-M n W 2 —M# 2 W+M n 

3(3W+M n )E x +^ 2) 8PFL 1+ < 3 ' 2 > ZMiJW (M N -5W)E^ 1 ^ 8 

W*—M n 2 W^-Mn 2 W+Mn + W 3 -Mx 2 W 2 -M n 2 1 


. (28b) 


The multipoles appearing in Eq. (28) are not actually the physical multipoles, since they refer to zero final 
pion mass (M/= 0). We relate them to the physical multipoles by the prescription 


' M ' 

1 1 ) 


’ M ' 



gr(0) 


• E 

. 

■ « 

E 



gr 


. L . 

*± 

1 

© 

n 

55 

. L . 


U) 


i± 




where the subscripts on |q| indicate that | q| is to be 
computed from W with MJ —0 or M respectively. 
The prescription of Eq. (29) gives the unphysical 
multipoles the correct threshold behavior and, approx¬ 
imately, the correct nearby left-hand singularities . 12 
Using Eq. (29) eliminates the obnoxious factor g r /gr( 0 ) 
in Eq. (28) and leaves us with simple integrals over the 
physically measurable multipoles. 

From pion-photoproduction experiments, the electric 
and magnetic multipoles can be measured. However, 
the longitudinal multipoles can only be measured in 
pion electroproduction experiments; so far little data 
is available. Consequently, we will have to make a 
guess as to the magnitude of the longitudinal multipoles. 
When the photon momentum |k| approaches zero, the 
longitudinal and electric multipoles become propor¬ 
tional with known coefficients , 13 

Li+/Ei+-^1, l> 0 , 

_ l> 2 . 1 J 

12 For a more detailed discussion see S. L. Adler, Phys. Rev. 
140, B736 (1965). 

13 J. D. Bjorken and J. D. Walecka, Ann. Phys. (N. Y.) 38, 35 
(1966); Y. Dothan and R. P. Feynman (private communciation). 


For want of a better estimate, we will assume that these 
proportionalities hold for nonzero |k| as well. In other 
words, we take 

Lfy+ZzEfyjr , Z/ 2 —^— TjEz— (31) 

in the numerical work described below. 

A. Narrow-Resonance Approximation 

We begin by discussing the narrow-resonance approx¬ 
imation for the magnetic dipole (Afi+< 3/2) ) contribution. 
It is convenient here to use the CGLN model 14 for 
Afi + C3/2) , which, as Schmidt and Hohler 4 and Schmidt 4 
have shown, is in good agreement with photoproduction 
experiments. According to this model 

,, 4.70g r W |q||k| exp(iS 3 iJ )sin 8 s ,3 

Mi+W 2) = -, (32) 

3 M n M n 2 if\q\*/M 2 

with / 2 =0.08, 5 3 ,a the pion-nucleon scattering phase 
shift in the (3,3) partial wave, and | q| evaluated with 


14 G. F. Chew, F. E. Low, M. L. Goldberger, and Y. Nambu, 
Phys. Rev. 106, 1345 (1957). 
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Table I. Parameters for multipoles. 


Resonance 

Wr 

(units of 
M t ) 

. klfi 
(units of 

M x ) 

r R 

(units of 
Mr) 

Multipole 

A 

(units of 
Mr~ l ) 

AT* (1238) 

8.85 

1.65 

0.860 

M 1+ <W 

Et + W*> 

+0.112 

—0.0080 

N**( 1520) 

10.80 

3.20 

0.860 

E* W*) 

+0.0155 

+0.0628 


MJ=M V . Substituting Eq. (32) into Eq. (27), we find 
for the magnetic-dipole contribution to Fi (+) ( 0 ) 


gr 8 4.70 

V (0) I magnetic dlpole = 7 , 

M n 9 2 M n 

1 f°° sin 2 5 3 , 3 

7=- / dW - 

ir J m n +m x [| q|J£/V M T 2 



According to the narrow-resonance approximation , 16 
7=1, giving 

El (+) (0) | magnetic dipole (narrow resonance) ~0.62 / /A7 x 2 , (84) 

to be compared with the value predicted by PCAC 
[the left-hand side of Eq. (25)3, 

(gr/M N )F, v (0)^0.55/Mr 2 . (35) 

Actually, the narrow-resonance approximation is very 
misleading. Direct numerical evaluation of 7, using the 
experimental (3,3) phase shift , 16 gives 7=0.63, so that 
actually 

El (+) (0) | magnetic dipole ~0. 39/I, 2 . (36) 

In other words, the narrow-resonance approximation 
overestimates the integral 7 by 60%. [The narrow- 
resonance approximation is also misleading when used 
to evaluate the gA sum rule. If only the (3,3) contribu¬ 
tion to this sum rule is kept, one gets gA^lA when the 
integral is evaluated using the experimental irN cross 
section , 17 and gA =3 when one uses the narrow-resonance 
approximation.3 To sum up, the narrow-resonance 
approximation for the TV"* (1238) is useful for making 
order-of-magnitude estimates, but should be avoided in 
quantitative tests of sum rules. 


B. Resonant Contributions 

We turn next to the evaluation of the resonant 
contributions to Vi (+) ( 0 ) and PV -) (0), using Walker’s 
photoproduction analysis. Walker 6 has parametrized 
each resonant multipole SfTC around the TV"*(1238) and 


15 G. F. Chew, F. E. Low, M. L. Goldberger, and Y. Nambu, 
Phys. Rev. 106, 1337 (1957). 

19 We obtained the same numerical result using the (3,3) 
phase-shift parametrizations of Schmidt (Ref. 4) and of L. D. 
Roper, University of California Report No. UCRL-7846 (un¬ 
published). For an independent evaluation of this integral, see 
D. Lyth, Phys. Letters 21, 338 (1966). 

17 See W. I. Weisberger, Phys. Rev. Letters 14, 1047 (1965); 
S. L. Adler, ibid. 14, 10?1 (1965). 


Table II. Multipole contributions. 


Multipole 

Contribution to 

[gr/gr(0)]7l< + >(0) 

(units of M T ~ 2 ) 

Contribution to 

tgr/g r (0)lFe^>(0) 

(units or M x ~ z ) 

E 0+ (,/2) 

+0.055 

+0.0329 

Eo + W» 

+0.081 

-0.0212 

W 1/2) 


-0.0365 

Lo + W 


+0.0238 


+0.413 

+0.0133 

E 1+ <W) 

-0.088 

+0.0471 

7i+ (8/2) 


-0.0333 


-0.031 

+0.0018 

EuS'M 

+0.042 

—0.0305 



+0.0281 

Total 

+0.472 

+0.0255 

PCAC prediction 

+0.550 

gr[gA( 0) “I 

- -^'(O) \M 

3IjvLgA(0) J 

|"g/(0) 0.045 

»2 MA - + - 

Lg„(0) M, 2 


the ^**(1520) in the form 18 

SirW -<4(| q| a/1 q| ) 2 r/2 

3E=-, 

M n Wn-W-iV/2 


/ I q| \ 3 l+0.77351 q| r 2 /M t 2 

r=r fl (-)- 

\|q| J l+0.7735|q|YJf T a 



The parameters A , lb*, Wr , and | q| ^ are listed in 
Table I. Using Eqs. (37) and (31) we have calculated 
the integrals for Fi< +) ( 0 ) and F 6 (-) ( 0 ), obtaining the 
results listed in Table II. 

We note, first of all, that according to Table II the 
J7i+ (3/2) contribution to PV +) (0) is 

V i^"*" ) (0) | magnetic dipole ™0A1/M T \ (38) 

in good agreement with the value of 0.39/Af,r 2 obtained 
above from the CGLN-Schmidt-Hohler work. The 
multipole £i+ (3/2) makes a significant contribution to 
the sum rule because it appears in Eq. (28a) with a 
coefficient three times as large as the coefficient of 
Afi+ (3/2) . Walker’s Im£i+ (3/2) has a constant negative 
sign across the ^*(1238). If the suggestion of the CGLN 
model 14 [that Im£ 1+ < 3/2) changes sign from negative to 
positive around the (3,3) resonance peak3 should prove 
to be correct, then the value for the Ei + (8/2) contribution 
given in Table II may be an overestimate. 

Looking at the contributions to PV -) (0), it may at 
first seem surprising that the small Ei+ (3/2) multipole 
makes a much bigger contribution than the large 
Afi+< 3/2) multipole. But a glance at Eq. (28b) shows 
the reason why—the ratio of the coefficients of M i+< 3/2) 

18 Equation (37) does not give the multipoles M i_, the 
correct threshold behavior, but the iV**(1520) is far enough from 
threshold so that this is not too important. To make the off- 
mass-shell correction we have multiplied each 3TC by [| q| juv-o/ 
Iqlw-J/i], so that the off-mass-shell multipoles all have the 
correct threshold behavior. 





R13 


217 


1466 


S. L. ADLER AND F. J. GILMAN 


152 



BMfl. 9M*. lOMfj- 

CENTER-OF-MASS ENERGY W 

Fig. 2. Ratio of electric to longitudinal multipoles, for ft 2 — 0. 


and £i+ (3/2) in the integral for F6 (-) (0) is 


C. Nonresonant S Wave 


It is well known that there is an important 5-wave 
contribution to charged-pion photoproduction. Since 
the 5-wave pion-nucleon phase shifts are of order 
15°-20° in the low-energy region, the imaginary parts 
of the j-wave amplitudes will make an important 
contribution to the integrals for Fi (+) (0) and PV -) (0). 
We estimate this contribution as follows. The Born 
approximations for the 5-wave multipoles E(h- (±, 0) are 14 


W-M n 4.70 


£,*<+>*«- 

t 

E^ B & 


Mu Mr 


1 

- 1— F 2 J 

'1+PVl 

£ <-)*«- 

1+-In 

—) > 

m t 

2F > 

d-7/J 


W-M n 0.88 


M n M r ' 
(43) 


r=|q|/?o- 


1 r-3(3PF+jkf w )-i-‘ (W-M k ) 

- - =-, (39) 

W+MtX- W*-MJ J 2,(2>W+Mh) 

which is numerically 0.02 at the peak of the 
W*(1238). In other words, the Afi+ (3/2) contribution is 
very strongly kinematically suppressed. The longi¬ 
tudinal multipoles contribute with strength comparable 
to the electric multipoles. To emphasize the dubious 
nature of the approximation of Eq. (31) for the longi¬ 
tudinal multipoles, we have computed the Born 
approximations Ei+ (8/2) and L i+ (3/2) from the diagrams 
of Fig. 1, splitting them into parts proportional to the 
electric charge e and the difference of the nucleon total 
magnetic moments ju p —ju n : 


Pion-photoproduction experiments, as analyzed by 
Schmidt, 4 indicate that (i) in charged-pion photo¬ 
production, the multipole is equal to the Born 
approximation; (ii) in neutral-pion photoproduction, 
(Mn/W)E(h. is independent of energy, and at threshold 
is roughly one-half of the Born approximation. The 
charged and neutral pion amplitudes are related to 
E(h- (± ' 0) by 

E^=(im(E^+E^), 

E^=KE^+>+E^l. 

If we assume that the isoscalar amplitude (which is 
small anyway) is not much different from its Bom 
approximation, 19 then the experimental results imply 


L 1+ ^ 2 >B= e L 1+(e) ^)B+( JUp _ Mn )L 1+(/i) ( 3 ^^. ( ' 


W 4.70 

ReE(H- (+) »-0.4-, 

M n +M, M n 



(Numerically, the e terms, which come largely from the 
pion-exchange graph, are much larger than the ju terms, 
which come from the crossed nucleon graph.) One can 
verify, by direct calculation, that at |k|=0 (for 
all ft 2 7^0), 


ReEo* <->«£(* <“ )B . 

We get the imaginary parts of the multipoles E(h- ( 1/2,3/2) 
by using the Fermi-Watson theorem, which tells us 
that 


E 


l+<«) 


(3/2)3 


El +00 


(3/2)3 


L 1+(a) <WB L 1+0l) < 3 ' 2)B 


1 . 


(41) 


But at the physical threshold | q| =0 we find for real 
photons that 


2 )3=L88, 

Ei +( ,,^>V^i + (,) ( 8 / 2 )b = 0 . K 

In Fig. 2 we have plotted the ratio of the numerically 
dominant e terms as a function of energy. Clearly, in 
determining the longitudinal multipole contributions 
to PV^O), assumptions such as Eq. (31) are unreliable 
and there will be no substitute for measurement 
of the longitudinal multipoles in electroproduction 
experiments. 


ImE(H- (1/2) ~sin5i R eE^ w 

-sin5i[ReE^(+>+2 ReE*.^], 

IiiLEo+ (3/2)!==f sin$3 ReE(H- (3/2) 

=sin5 3 [ReE (H .( +) —ReE(4- (_) ], 



with 5i, 5 3 the J=|, i 5-wave pion-nucleon phase shifts. 

The numbers given in Table II have been obtained 
by using Eqs. (45) and (46), integrating from threshold 
to a center-of-mass energy W= 10 M T , and taking the 
the pion-nucleon phase shifts from Roper’s Z m =4 
analysis. 20 Adding the 5-wave result to the other 


19 This Is suggested by the CGLN model, in which the jsoscalar 
amplitude is given by the Bom approximation, but the.isovector 
amplitude differs appreciably from the Bom approximation due to 
the presence of the dispersion integral over the iV*(1238). 

20 L. D. Roper, Ref. 16. 
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contributions to Fj (+) (0) raises the total to about 
0.85 of th e value predicted by PCAC. 21 If we assume 

21 If Re£<) 4 - (+) is taken to be zero, the and 

contributions to W + >(0) become 0.062 /M T 2 and 0.066/JwV*, 
respectively. Thus, as expected, the 5-wave contribution comes 
mainly from the charged-pion photoproduction amplitude 
The only multipole significant in the Low-energy region 
which we have omitted from our analysis is While ReJ/i_ is 
known, the Pn pion-nucleon phase shift becomes large only when 
the inelasticity in this channel is also Large. This means that 
Im.Afi_ cannot then be reliably determined by the Fermi-Watson 
theorem. 


1467 

Eq. (31) for Z^, the result for PV'HO) obtained from 
the resonant multipoles is changed veiy little. 
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Possible Measurement of the Nucleon Axial-Vector Form Factor in 

Two-Pion Electroproduction Experiments 
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Current-algebra techniques and the hypothesis of partially conserved axial-vector current are used to 
derive a low-energy theorem for the reaction e+N —> e-HA'+ir+ir(soft). Particular attention is paid to 
satisfying the requirements of gauge covariance. Except for recoil corrections, the resulting matrix element 
is proportional to the nucleon axial-vector form factors, and we suggest that this electromagnetic process 
may be used to measure gA(k 2 ). 


L INTRODUCTION 

EASUREMENT of the momentum-transfer de¬ 
pendence of the nucleon axial-vector form factor 
gA(&) would clearly be of great interest, since it would 
give information about the spectrum of axial-vector 
mesons, just as our experimental knowledge of the nu¬ 
cleon electromagnetic form factors has provided much 
useful information about the vector mesons. Unfortun¬ 
ately, the elastic and inelastic weak-interaction experi¬ 
ments 1 to measure gu(& 2 ) are much more difficult than 
their electromagnetic counterparts, and as a result very 
little about gA^) is known at present. Clearly, it would 
be useful to have alternative, even if very indirect, 
methods of measuring &i(& 2 )- We discuss in this paper 
the possibility of measuring £*(#) in the electroproduc¬ 
tion reaction 

e-j-N —► e+Af+7r+7r(soft), (1) 

assuming the validity of the current algebra and of the 
partially conserved axial-vector current (PCAC) hy¬ 
potheses. This possibility is suggested by the recent 
work of a number of authors , 2 showing that when cur- 
rent-algebra-PCAC methods are applied to the photo¬ 
production reaction 7 +TV—► TV+ 7 r+ 7 r(soft), which is 
the & 2 = 0 case of Eq. (1), the results of the old Cutkosky- 
Zachariasen static model 3 are obtained, with the domi¬ 
nant term coming from the matrix element of the axial- 
vector current {Nn | J\ A | N). 

* A. P. Sloan Foundation Fellow. 

f Supported in part by the U. S. Air Force Office of Research, 
Air Research and Development Command, under Contract No. 
AF 49 (638)-1545. 

1 For a discussion of the determination of &a(£ 2 ) in neutrino ex¬ 
periments, see E. C. M. Young, CERNReport67-12 (unpublished). 

2 T. Ebata, Phys. Rev. 154, 1341 (1967); P. Carruthers and H. 
W. Huang, Phys. Letters 24B, 464 (1967): P. Narayanaswamy 
and B. Renner, Nuovo Cimento 53A, 107 (1968); S. M. Berman 
(unpublished) (Berman has also considered the extension to electro¬ 
production); W. I. Weisberger (unpublished). 

8 R. E. Cutkosky and F. Zachariasen, Phys. Rev. 103, 1108 
(1956). [See also P. Carruthers and H. Wong, ibid. 128, 2382 
(1962).] The soft-pion result generalizes their model to a relati¬ 
vistic framework in the same way that Chew, Goldberger, Low, 
and Nambu extended the Chew-Low static model for A 3 . 3 * 
photoproduction. 
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In Sec. II we apply soft-pion methods to the reaction 
of Eq. (1), and get a relation between the matrixelement 
for this process and the matrix elements for single-pion 
weak production and electroproduction, (TYttI/x^I A”) 
and (A - 7r|/x EM | A). By carefully keeping all pion pole 
diagrams, we eliminate some discrepancies noted in the 
previous work on two-pion photoproduction. The ma¬ 
trix element {Ntc | J\ A | N) can be related, in turn, to the 
axial-vector form factor gAQf), using models analogous 
to the very successful CGLN 4 treatment of pion 
photoproduction. 

In Sec. Ill we retain only the /=/=§ partial wave, 
treated in the CGLN approximation, and discuss the 
possibility of measuring gA(&) in the reaction e+N —» 
tf 3 . 3 *(1238)+7r(soft). 

H. DERIVATION 

We will consider the electroproduction reaction 

—► e(fe)+A'(/)2)+7r(g)+7r*(^), (2) 

with the superscript s an isospin index. Letting k=ki 
— fe be the four-momentum transfer between the elec¬ 
trons, the hadronic matrix element for Eq. (2) is 

M\= jd A xd 4 y e ik ' x e~ iQ9 ' v (— DyM-Af* 2 ) 

X(N(pMq) I mAy) A EM (*)) I (*)>. (3) 

We wish to find the limit of Eq. (3) when tt* is soft, that 
is, as q 8 —► 0. This can be done by the standard soft- 
pion methods 6 ; the only delicate point is to insure that 
our soft-pion approximation for M\ satisfies gauge 
invariance. 

Let us begin then by studying the gauge properties 
oflfx. Multiplying Eq. (3) by — ik\, integrating by parts 

4 G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, 
Phys. Rev. 106, 1345 (1957). Hereafter referred to as CGLN. 

6 See, for example, S. L. Adler and F. J. Gilman, Phys. Rev. 
152, 1460 (1966). 
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with respect to x 9 and using d\J\ BU =0 gives 
— ik\M\= jd*xd A y e ih ' x e ~ iQt ' v (—□ 


In all simple canonical field theories involving pions 
one finds® 


5(^o—yo)[/o EM W,^T-(y)]=^3 ac 5 4 (^— y)<t>Ay ); (5) 

substituting this into Eq. (4) and finally integrating by 
parts with respect to y gives 

hM x = -eUtf+MS) J d*y 

\<l>Ay) \N(pi)) 


C3 «c(?* 2 +M t 2 ) 

(k-q.)*+M.* 



Xe i ( h ~ q * ) ' v {N(pii)ir( ( l)\JAy)\N(pi))» ( 6 ) 


As expected, when t* is on the mass shell, k\M\= 0, but 
in the off-shell case the divergence of M\ is nonzero. 
Our soft-pion approximation for M\ will not actually 
satisfy Eq. (6) exactly, but will obey the approximate 
version 

tuc(q, 2 +M r 2 ) f 
- I d 4 y 

(k-qP+M v * J 

Xe ik AN{h)Aq)\JAy)\N(pi )), (7) 


obtained by neglecting q a in the matrix element of J T 
but keeping q a in the rapidly varying factor (qf+M* 2 )/ 
{.(k—q^+M^ Clearly, Eqs. (6) and (7) are identical 
both in the soft-pion limit (?«=()) and on the mass shell 
(q 2 = — M t 2 ). 

In applying PCAC to Eq. (3), it is helpful to introduce 
the “proper part” J\ AP of the axial-vector current, de¬ 
fined as follows: Let a and b be arbitrary hadron states, 
and let q=p a —pb • Then we define J\ AP by 


Clearly, the proper current J\ AP has no pion pole; Eq. 
(9) is thus a convenient decomposition of the axial- 
vector current into pion-pole and non-pion-pole pieces. 
[As an illustration, let us take a and b to be nucleons. 
Then (N | J\ A | N) « u(g A yxy 5 +iq}JiAy&)u. In the approxi¬ 
mation in which the induced pseudoscalar form factor 
h A is given by h A =2MNg A /(q 1 +M v r ) J the proper part 
of (N | J\ A \N) is just the piece ug A y\y^ur\ Let us now 
introduce the PCAC hypothesis in the form 

MnM^gA 

dffJ<r aA = -( 11 ) 

*«» 

Then using Eq. (10) we can write Eq. (11) as 



which says that the divergence of the proper part of the 
axial-vector current is a smooth interpolating operator 
for the pion source. Thus, we can rewrite the gauge con¬ 
dition [Eq. (7)3 in the alternative form 

^3.c(?. 2 +-M‘ir 2 )kr gr(0) f 

-/ d*y 

(k-qA+MS MifgA J 

X ** ■ I J* eA *(y) I Nip ,)). (13) 

To get a soft-pion approximation for we substi¬ 
tute Eq. (11) into Eq. (3) and integrate by parts with 
respect to y . This gives 

M x 2 

-Mx=Mx 13 ™+Jkfx SURF , (14) 

q, 2 +M** 

with 

Mx ET0 = - I d*x 

M N g A J 

x e i{h - Q,) ’ z (N(p 2 )ir(q) \Jx‘ A (x)\N(j>{)) (15) 


(a | |6>=<a| Jx A I b)+—(a | q,J, A | b ), (8) 

Mr 2 

which implies that 

{a | J X A |6>=<a| Jx AP \b) - —{a | qJ A * | b ), (9) 

f+MS 

q 2 -\-M x 2 

(a | qxJx AP \b) =- (a\ qxA A \b). (10) 

_ Mtc 2 

6 When integrated over space with respect to x, Eq. (5) becomes 
Uz+hYy which is just the statement that the pion 

is a particle with the quantum numbers 7= 1, 7=0. The local 
form, Eq. (5), follows from the integrated version in canonical 
field theories, since in such theories the charge density /o EM is 
a bilinear form in the canonical fields and momenta, and thus 
[7o EM (*)^>T*(y)11 *o-y 0 contains no gradient of 6-function terms 
which vanish when integrated spatially. 


the equal-time commutator of Jo aA with /x EM , 7 and with 

£r(0) f 

lfx SURF =iff f(r - / d*xd*y e ih ‘ x e~ iq *' v 

M N g A J 

X(NiPzMq) I r(A^WA EM W) I N(p{)) (16) 

the remainder. Separating Eq. (15) for into a 


7 We have, of course, evaluated the equal-time commutator us¬ 
ing the Gell-Mann algebra of currents [M. Gell-Mann, Physics 1, 
63 (1964)]. The possible presence of Schwinger terms in the time- 
space commutators is irrelevant because of the cancellation of 
the Schwinger term and ‘‘seagu 11-diagram ’ y contributions in soft- 
pion calculations. See, for example, S. L. Adler and R. F. Dashen, 
Current Algebras (W. A. Benjamin, Inc., New York, 1968), Chap. 
3. 
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NfP|) (q) N(p 2 ) 



N(P| * (b) N(p2 * 




isA 



i 

A 


N(p,) 

(c) 


‘vL 




x 

-w(q) 

Wftf 

(d) 

NtP2) 


Fig. 1. Contributions to J|/ x suhf [[Eq. 
(16)J. (a) Axial-vector current couples to 
a virtual pion. (b) Axial-vector current 
attaches to the initial external nucleon 
line in single-pion electroproduction. 
There is a similar diagram (not shown) in 
which the axial-vector current attaches 
to the final external nucleon line, (c) 
Axial-vector current and vector current 
attach to a virtual pion at the same space- 
time point [a “seagull” diagram], (d) 
Axial-vector current couples to internal 
fines in the matrix element (N(to)v(q) I 


Mx BmF ^ = e 9 z c q 9a - 


1 


Jd*x 


(k-q.y+MS 
gr(0) q.xkv 


— IZiZc 


proper part and a remainder gives 

g'( 0) 


M^=-h,Zc 


[/ d*x e'X* - ®*) ’ x (N(p 2 )ir(q) | 

/■ 


( k~q.)\(k~q .), 


XJ\ cAP (x) | N(pi ))— : -—-— / d 4 x 

(k-q.y+MS J 

x«“*-«■> \j,•**(*) I^W)] (17) 

[/*, e' k ' x {N(p 2 )ir(q)\J\ cAP (x) 


gr( 0) 




M N gA 


X\N( Pl ))- 


(Jt j 

(k—q a ) 2 +M r 2 


Jd*x e ik -‘ 


X(N(p 2 Mq) \J,° ap (x) |ff(fc)>] • (18) 

In going from Eq. (17) to Eq. (18) we have neglected 
q a in matrix elements of the proper part J\ cAP and its 
divergence d,/, flAP , but have retained q a in the rapidly 
varying factor (k— q^)\/[_(k— g f ) 2 +Jf x 2 ]. The surface 
term Mx SURF contains four types of terms, shown in 
Figs. l(a)-l(d). In Fig. 1(a), the axial-vector current 
couples to a virtual pion; it is easy to see that 


Jf x SURF(a) = . 


-?. 2 


q, 2 +M r 2 


-M x . 


(19) 


In Fig. 1(b), the axial current attaches to an external nu¬ 
cleon line in single-pion electroproduction; an expression 
for jf x smtF(b) can b e obtained from the usual axial- 
current insertion rules and is given below. In Fig. 1(c), 
the axial current and vector current attach to a virtual 
pion at the same space-time point; this is a "seagull" 
diagram contributing to virtual radiative pion decay and 


Jd*x 


MxgA (k-q.y+MS 

X e ik - x {N(p 2 )ir(q) I Jf AP (x) | N(pi)). (21) 

Finally, in Fig. 1(d) the axial current couples to internal 
lines in the matrix element 

(N(pMq) I n^(y)A EM (*))|A^i)>; 

consequently, lf x sunF((i) [ s Q f or der q a and may be 
neglected. 

Comparing Eq. (21) with Eq. (18), we see that the 
effect of including the radiative pion decay diagram is to 
change the coefficient of the pion-pole term in Afx 3310 
from (k— q 8 )\ to (k—2q t )\. This eliminates the factor of 
two discrepancy noted by Carruthers and Huang, 2 who 
neglected Afx SUBF(c) , and leads to the satisfaction of 
the approximate gauge condition (13). 

Combining all the terms, we may write our answer as 
follows: 

/ MJ \ 1/2 

M\= (2tt) A S\pz+ q—pi—k)( -j 

\2pitip2oqo/ 

Xu(p2)N\u(pi)+0(q a ), (22) 

r -^r(0)\ 


f (2q t -k)^ 

N\= e,3<| - 


L (k-q,) 2 +M* 2 


■Hxi ^- ]O n 


M NgA / 


gr(0) P 2+%Mn 

■f - T*q, 75-0x EM 


2 M n 


2p2-q» 


pi+iMw g r ( 0) 

+Ox EM - T a q a ys, 

— 2pvq a 2 Mn 

where O n cAP and Ox EM are defined by 

/ Mn 2 \ lf2 

(XfaMq) | Jf AP | N(p J)- ( -- 

\2piQp2oqo/ 

Xu(p 2 )O n cAP u(pi ), (23a) 

/ Mn 2 v 1 ' 2 

(N(p 2 )ir(q) | /x EM | N{p J) = -- 

\2/>iop20ffO' 

Xil(p2)0\ EU u(pi ). (23b) 

The terms proportional to Ox EM are the single-pion 
electroproduction contribution H X SURF(6) mentioned 
above. 8 Since the single-pion electroproduction matrix 

8 In writing the matrix element 0x EM we neglect the additional 
momentum q a carried by the intermediate nucleon. It is clear that 
the error is 0(q a ), consistent with our approximation. 
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element is gauge-invariant, we have hSjfr+iMx) 
X 0 x EM «(/>i)= n) = 0 , and thus the 
divergence of N\ is 


k\N\— €*3c" 


q, 2 +Mi 


(k-q.) 2 +M 2 \M NgA 


f-igr( 0) 

4~ 




cAP 


(24) 


Combining Eqs. (22)-(24), it is clear that the approxi¬ 
mate gauge condition of Eq. (13) is satisfied. In particu¬ 
lar, when q g 2 = — M w 2 , k\N\= 0, so on-mass-shell Eq. 
( 22 ) gives a gauge-invariant approximation to the ma¬ 
trix element for two-pion electroproduction. 


m. DISCUSSION 

Let us now briefly consider the possibility of indirectly 
measuring gA(k 2 ) in the reaction e+V —> e+V+T 
+ 7 r(soft), by use of Eqs. (22)-(23). For simplicity, we 
will restrict ourselves to the case in which the soft pion 
is at rest (threshold) in the center-of-mass frame of the 
final baryons , 9 and in which the hard pion and nucleon 
emerge in the (3,3) resonance. At the soft-pion thresh¬ 
old, the kinematic structure of two-pion electroproduc¬ 
tion becomes identical to the kinematic structure of the 
more familiar case of single-pion electroproduction; this 
makes it easy to compute the two-pion cross section 
from the matrix element in Eqs. (22)-(23). When the 
hard t and N form an # 3 , 3 *, the matrix elements in Eqs. 
(23a) and (23b) describe weak production of the (3,3) 
resonance from a nucleon target and have been exten¬ 
sively studied . 10 The vector matrix element [Eq- (23b)] 
is found to be dominated by the magnetic dipole 11 am¬ 
plitude M i+ C3/2) , while the axial-vector matrix element 
[Eq. (23a)] is dominated by the electric, longitudinal, 
and scalar amplitudes £i+ C3/2) , £i+ C3/2) , and 3Ci+( gjl ) <3/2) . 
[[The subscript (g A ) indicates that the part of 3Ci+ (3/2) 
proportional to the induced pseudoscalar form factor 
h A is to be dropped and only the part proportional to 
the axial-vector form factor g A retained; this restriction 
arises because only the proper part of the axial-vector 
current appears in Eq. (23).] For momentum transfers 
k 2 less than 50 F” 2 , a model which should give a good 
approximation to M i+ <3/2) , * 4 4 is 


Table I. Isospin coefficients. 



ai 

02 

«3 

e-\~p — > e-|-ir + (soft)+# 3 , 3 *° 

N,.,* 2 — P+T- 



0 

\n+r » 

-KV2)- 1 


JV2 

e-h/» —> e+ir'“(soft)-hJVs.s*^ 

JW++-. p+t + 

i 

0 

-i 


A straightforward calculation shows that, in terms of 
the weak (3,3) production multipoles, the cross section 
for e+N —> e+Nz, 3 *+ir (thresheld) is given by 


ai(k 2 ,W)=- 


1 d*<rle+N-*e+N it z*+ir(soft)] 


dq 8 tflk 2 dW 


QaQ=~Mx 


a 2 g r (0)‘ 


(w+M*y 


M N 2 W 2 +(W+MtY-Mt 2 (/Sio 1 ) 2 


r 1 / 2 ^ 10 ^ 20 — hk 2 \ 

x b(‘ +- l^) CMI ’ +3|2 ’ l ’ +|c|,:i 


+ 


4&1C&20— k 2 


|k| 


£| 2 ], 


(26) 


1 |k| 

A = —ai&i+<- 3 ' 2 >+a2- MhS'M 

gA plO 

|k| 

£10 


C=a$—Mi + W 2 \ 
pi 0 

1 

D— —ai 
gA 


(27) 


X' 


(^ o-2M T )£ 1+ < 3 ^ 2 )+2M y (|k|/^o)3C 1+(gA) t3/2) 

k 2 -h2M x ko 


M 1+ ^=M 1+ ^ l2)B f 1+ ^ l2 Vfi+ (z,2)B f 

« 1+ (*ft)=« 1+ o/a)Byi + <8/a)/y 1+ (8/a)U ( 

JEi + (, «)= £ 1+ < 3/2 > Vi+ (3/2 V/i + <3/2)JJ , (25) 

^l+(M) <3/2) = 3Cl+(»A) (3/2)a /l+ (3/2) //l+ (3/2>£ ) 

where /i+ (3/2) is the pion-nucleon scattering amplitude 
in the (3,3) channel and where the superscript B de¬ 
notes “Bom approximation.” Expressions for / i+ (3/2)5 , 
lfi+ C3/2)B , £i+ C3/2)B , • 4 4 are given in the Appendix. 10 

9 That is, the frame defined by ps-bq-bq.^O. In the case q t = 0 
which we consider, the center-of-mass frame of the final baryons is 
identical with the center-of-mass frame of the hard pion and nu¬ 
cleon (the N 8,8* rest frame). 

10 S. L. Adler (to be published). 

11 Our multipoles are a factor (fhW/Mm) times those of Ref. 4. 


where kio L is the laboratory-frame initial electron en¬ 
ergy, where q s 0 and all other noninvariant quantities 
refer to the center-of-mass frame of the final baryons, 
and where W is the invariant mass of the resonating 
pion and nucleon. Values of the isospin coefficients a lt2 ,3 
are given in Table I. For comparison, the cross section 
for the ordinary (3,3) electroproduction reaction 
e+p—> e+Nz,**+ is 

d 2 <r(e+p—> e+A 7 8,3* + ) a 2 Iql 1 

<r 2 (k 2 } W)= -=- 

dk 2 dW 3 t (kio L ) 2 2k 2 

f 2£io&2o— i£ 2 \ 

X V 1+ - \k[* -(28) 
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Looking at Table I, we see that the most promising 
reaction for the measurement of gA(k 2 ) is 


statement about the relative rates of decrease of <ri and 
o "2 is that 


e+p—> e+T“(soft)+A'3,3*' H ‘ 

\ 

P+1T + (29) 


<rm/<n(0) ^ \jA(kVgAl 2 |k| V„o 
<r 2 (k 2 )/* 2 ( 0)~ fE^ 2 )] 2 |k|V 


for the following three reasons: (1) The coefficient a\ of 
the axial-vector multipoles is the largest in this case. 
(2) The coefficient a 2 vanishes and, consequently, the 
vector multipole Mi + ^ }2) enters only through the very 
small recoil-correction term | C | 2 . (3) In this case there 
is no soft-pion background coming from single-pion 
electroproduction, which can only lead to a soft ir + or t°. 

Because the Bom approximations £ i+ < 3/2)B and 
£ i+ <3/2)B are known functions of W and k 2 , and are 
proportional to £a(£ 2 ), Eq. (26) C a P art fr° m the small 
term |C| 2 [] is proportional to gA(k 2 ) 2 , and thus a mea¬ 
surement of <ri as a function of k 2 will determine the 
momentum transfer dependence of gA> 12 

There is, however, a possible problem, which may be 
illustrated by comparing Eq. (26) with Eq. (28) for 
ordinary (3,3) resonance electroproduction. Just as <n 
is proportional to gA(k 2 ) 2 , <r 2 is proportional to F v (k 2 ) 2 , 
where F v (k 2 ) is an isovector electromagnetic form fac¬ 
tor. There seems to be some evidence that the axial- 
vector form factor g A (£ 2 ) falls off considerably more 
slowly with k 2 than does F v (k 2 ). This in turn suggests 
that the soft pion+AVs* production cross section <n 
falls off much more slowly with k 2 than does the Af 3i3 * 
cross section 0 - 2 . Unfortunately, however, this conclusion 
is not correct. The reason is that the multipoles M i+ C3/2) 
and £i+ (3/2) have different small-1 k| threshold behavior, 


JjAjkVgAj (W—Mn) 2 
CF v (k 2 )J ( W-M N ) 2 +k 2 ‘ 

Even if gA(k 2 ) falls off appreciably more slowly than 
F v (k 2 ), the effect of the factor (W—Mn) 2 /[_(W—Mn) 2 
-\-k 2 ~\ is to cause <n to decrease more rapidly than <r 2 . 

The importance of the threshold behavior in Eq. (31) 
illustrates a problem which might invalidate Eq. (22), 
our soft-pion approximation for the two-pion produc¬ 
tion matrix element, and thus destroy the possibility of 
measuring gA(k 2 ) in the reaction Eq. (29). In deriving 
Eq. (22), we have neglected terms of first order or 
higher in the soft-pion four-momentum q s . At k 2 =0, 
we feel fairly justified in this approximation, since it 
leads to the Cutkosky-Zachariasen formulas, which 
seem to work. However, it is always possible that some 
of the terms of order q s , which are negligible at £ 2 =0, 
increase rapidly relative to the terms of zeroth order in 
q t as k 2 increases, because of a different threshold be¬ 
havior in | k . If this happened, the soft-pion approxi¬ 
mation could become bad precisely in the large-£ 2 re¬ 
gion, where we must look to measure gA(k 2 ). Hopefully, 
this does not happen, but in using Eq. (22) to interpret 
two-pion electroproduction experiments, this danger 
must be kept in mind. A more detailed investigation of 
this problem is being undertaken. 



M 1+ (3/2) ~|k| 
£i + C3/2J ~l 





and this behavior, in the model of Eq. (25), persists 
into the physical region as well. As a result, the correct 


APPENDIX 

We give here expressions for the Bom approxi¬ 
mations / i+ t3 ' 2)B -Af i+ t3 ' 2 > B , & + < 3 ' 2 > B £ i+ t3/2)5 , and 
3Ci+(^) (3/2)B : 


g 2 

fl+ tw)B = -1- [W-(p 2 ,+M N )A ( a)+W+(Pn-M N )C(a )], 

8 ttW I q | 2 


Mi+ < - w)B = 


^ 2 |q| IH/-*r\ ... r _jM N W-( Pl o+M N ) 

W^ 2 / 


0 2 + 


\[F\ V (k 2 )+2M xF 2 v (k 2 )~] 


[ 


W 2 


A (a) 
q 1 2 1 k | 2 


W+ B(a ) MnW+ C(a) ~\ 

- 1 -+nucleon and pion charge terms, 

_ H^ 2 |q| |k| W 2 (p 2 o+M N ) q |k|J 

12 A similar calculation would lead to a determination of gA(k 2 ) in electroproduction of a single soft pion. The relevant matrix elements 
are given in Ref. 5, which gives further references. Experimental data on single- and double-pion photoproduction reactions indicate 
that double-pion electroproduction may yield more reliable results for gA(k 2 ) than single-pion electroproduction. The reason is that 
the soft-pion matrix element seems to give an accurate description of the experimental results for two-pion photoproduction up to 
about 100 MeV above the Nz, 3 *+t threshold, while the single-pion photoproauction is dominated byJV 8i3 * production (which cannot 
be described by soft-pion methods) as soon as one goes away from threshold. In fact, it is interesting to note that the recent DESY 
results on y-\~P —* Ns, 3 * ++ +*'“ show a cross section rising less rapidly above threshold than indicated by earlier experiments and 
agree within experimental error with the prediction of the Cutkosky-Zachariasen model. The relevant experimental results and refer¬ 
ences are given in Fig, 9 of M. G. Hauser, Phys. Rev. 160, 1215 (1967), If both methods of measuring g A (k 2 ) are feasible, one will be 
happy to have two independent determinations. 
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£ 1+ C3/2 )B= W 2Q 1+ q 


'-grgA{k 2 )\rW-(p™-Mx) A(a) 2 B(a) 


f -grgAKk* ) \n 
\ 2 M n /- 


W 2 


+ 


q 1 2 1 k | 2 W 2 | q 11 k | 

W+ C(a) 3 


E(a) 


1 


1 

£ 1+ (3/2)b =- W 2 Oi+ 1 q 

k 0 W 


WKpn+M,,) |q| |k| W 2 (pia+M N )(p 20 +M N )J 
~ grgA(k 2 ) \rM N (pio~ M n ) W++ (iW- ~ Pn) k 2 


/ -grg4* J \ 

\ 2 Mm ) - 


W 


3Ci +(M >»«> a =Oi + q 


grgA(k 2 )r .. . A (a) 


with 


2M n 


r 




A (a) Mtf(pio-)-MN)W-—(%W+—p 2 o)k 2 C(a) “1 

X |q[ 2 [k! 2+ (pio+M N )(p2o+M N )W |q[|k|J ’ 

(^+-go) C(a) 


q| 2 |k| (Pio-)-Mn)(P 2 o-)-Mn) |q| 


I* 


(Al) 


W ± =W±Mm , Oi+=[(pio+M*) {pi,+M N )J 12 , 0^=1{P 1 ,+Mm)/(P2,+Mm)J' 2 , 

a=(2^„fe„+fe J )/(2|q| |k|), d=(2p m ,-M 2 )/(2\q\ 2 ). 


(A2) 


The functions A through E are defined by 

/a+1 


/a+l\ r /a+l\" 

^4(a) = l—|aln^- -J, £(a) = | a+£(l—a 2 )iln^--J 


C(a)=-* 


3a+J(l—3a 2 ) ln^ 


/a+l\l 




E(d) 


= f£|— a 2 +\a(a 2 — 1) ln^- 


(A3) 


and Fi v (k 2 ) and F 2 v (k?) are, respectively, the isovector nucleon charge and magnetic form factors, normalized 
so that Fi v (0)+2 MnF 2 f ( 0)=4.7. For reasons explained in Ref. 10, only the part of Mi+ al2)B proportional to 
the total nucleon isovector magnetic moment (given explicitly in the equation above) is used in Eq. (25); the 
part proportional to the nucleon and pion charges should be dropped. 
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We give a unified account of single-pion photo-, electro-, and weak production. 
The emphases of the paper are fourfold: (1) We give a detailed kinematic discussion 
of single-pion electro- and weak-production; (2) we develop a dynamical model for 
electroproduction and weak production in the (3,3) resonance region, based on the 
CGLN model for photoproduction; (3) we systematically discuss the partially-conserved 
axail-vector current (PCAC) and current-algebra constraints which relate the single-pion 
electro- and weak-production matrix elements to the matrix elements for other processes; 
(4) we compare our model with experiment. 


1. INTRODUCTION 

Production of a single pion is the simplest inelastic process that can be studied 
in electron-nucleon and neutrino-nucleon scattering experiments. Already, 
several pion electroproduction experiments have been performed and some crude 
data on weak pion production is available. Since, in the future, there will be a 
substantial accumulation of data on these processes, the theoretical interpretation 
of pion production experiments becomes an important problem. 

We give in this paper (/) a detailed, unified treatment of single pion photo-, 
electro- and weak production. The parallel discussion of the three processes is 
natural, since they are closely related. Photo-production is, of course, just a special 
case of electroproduction, in which a real photon, rather than a virtual photon, is 
involved. According to the conserved vector current (CYC) hypothesis (2), the 
isovector electroproduction amplitudes are related by an isospin rotation to the 
vector-current weak-production amplitudes. The weak production process also 
involves axial-vector amplitudes which, while not directly related to electroproduc¬ 
tion amplitudes, are most naturally treated in analogy with the treatment of the 
vector amplitudes when making dynamical models. Comparison of photoproduc¬ 
tion and electroproduction models with experiment gives an idea of how good 
weak-production models may be expected to be. 

The main emphases of this paper are fourfold: First of all, we give a detailed 
kinematic discussion of single-pion electroproduction and weak production, 
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including a derivation of differential and total cross-section formulas for the weak- 
production case, and a discussion of the kinematic singularities which appear in 
the electroproduction matrix element when gauge invariance is imposed. The 
kinematic results are general, and are not limited to single-pion production in the 
(3, 3) resonance region. Secondly, we construct a dynamical model for the single- 
pion electro- and weak-production matrix elements in the (3, 3) resonance region. 
We limit our dynamical discussion to this region because, as shown in the pioneer¬ 
ing work of Chew, Goldberger, Low, and Nambu (3) (CGLN), in the (3, 3) 
resonance region a very successful model for pion photoproduction can be made. 
Our work is essentially an extension to the electro- and weak-production cases of 
the version of the CGLN model discussed recently by Hohler and Schmidt (4). 
Thirdly, we discuss various PCAC and current algebra constraints which relate the 
single-pion electro- and weak-production matrix elements to the matrix elements 
for pion-nucleon scattering, for two-pion electroproduction and to the nucleon 
vector and axial-vector form factors. Wherever possible, we test whether our 
(3, 3)-dominated model satisfies the PCAC conditions. Finally, using predictions 
of our theoretical model in the (3, 3) resonance region, we give a comparison with 
experimental data obtained in recent photon, electron, and neutrino experiments. 
The CGLN dispersion-theoretic dynamical model is not the only approach to 
getting predictions which we could have used. Other recent methods involve (i) 
use of a postulated higher symmetry, such as SU G , to relate the N-N£ 3 (vector or 
axial-vector current) vertex to the nucleon vector and axial-vector form factors (5), 
or (ii) direct introduction of iV-jV 3 * 3 (vector or axial-vector current) form factors, 
which are parametrized in a convenient fashion and are used to generate a family 
of cross section curves (d). We prefer the CGLN approach because it has given 
more detailed and more accurate results than the higher symmetry method in the 
case of photoproduction, and because approach (ii) is little better than phenom¬ 
enology unless specific dynamical assumptions, such as use of a higher symmetry 
or of the CGLN model, are made in order to give definite values for the N-N 3i3 
(vector or axial-vector current) form factors. Of course, there exists already a 
considerable literature on the dispersive approach to single-pion electro- and weak 
production (7), (3). The most extensive recent treatments are the electroproduction 
calculation of Zagury (7), and a weak-production calculation by Salin (8) based 
on the work of Dennery (7), (5), In an Appendix we give a detailed comparison of 
our model with the work of Zagury and of Dennery, As noted above, we only 
discuss pion production in the (3, 3) resonance region, We do not treat higher 
isobar production, a topic which has been discussed recently by several authors (9), 
Having explained the aims and scope of this paper, we turn next to a brief 
elaboration of its contents. Section 2 is devoted to a discussion of the kinematics 
of pion weak production and electroproduction; most of the subsection headings 
are self-explanatory. In order to keep this section readable, we have put most 
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detailed kinematic formulas in appendices, In Subsection 2D(3) we discuss only 
the most elementary consequences of the partially-conserved axial-vector current 
(PCAC) hypothesis, obtained by equating the divergence of the axial-vector weak 
pion production amplitude to the (off-shell) pion-nucleon scattering amplitude, 
and expressing the resulting equality in terms of multipoles. Other consequences 
of PCAC, including soft-pion theorems, are given in Section 5, Our principal new 
kinematic results are the formulas for the weak-pion-production differential cross 
section in terms of helicity amplitudes, and for the weak-production total cross 
section (integrated over the pion emission angles) expressed as a sum over multipole 
amplitudes, given in Subsection 2F and Appendix 4, In Subsection 2G and 
Appendix 5 we use the Rarita-Schwinger formalism to define direct 
(vector or axial-vector current) couplings, and we relate these couplings to the 
multipoles leading to the N to N£ s transition. This will enable the comparison of 
our paper, and other dispersive approach papers which calculate the multipole 
amplitudes leading to excitation of the (3, 3) resonance, with papers using the 
phenomenological, direct-coupling approach. 

In Section 3 we write down the fixed momentum transfer dispersion relations 
which the weak production amplitudes obey and discuss the questions of kinematic 
singularities and convergence. We show that the use of gauge invariance to reduce 
the number of independent vector amplitudes from eight to six necessarily 
introduces a kinematic singularity in some of the invariant amplitudes, but that 
the effect of this singularity on the physical matrix element can be eliminated by an 
appropriate subtraction in the dispersion relations. 

In Section 4 we develop a dynamical model for pion weak production and electro¬ 
production. When specialized to pion photoproduction, our model differs only 
slightly from the Hohler^Schmidt version of the CGLN model. The general method 
is to write fixed momentum transfer dispersion relations for the invariant ampli¬ 
tudes, Under the dispersion integrals we approximate the imaginary parts of the 
amplitudes by keeping only multipoles which excite the (3, 3) resonance and which 
are dominant in Born approximation. We then project out integral equations for 
these multipoles; an examination of the nearest left-hand singularity structure of 
the multipoles shows enough of a resemblance to the familiar case of pion-nucleon 
scattering to allow a simple approximate solution to the integral equations. We 
guess this approximate solution by the heuristic procedure of first studying the 
static-nuleon limit of the integral equations, (At the same time we try to clarify 
the relation between several approaches found in the literature for solving the 
Omnes equations involved,) We then check numerically that the guess is a reason¬ 
ably self-consistent solution to the integral equations when no static approximation 
is made, so that our final answer is not a static limit result. We show that the same 
arguments leading to our model for the dominant multipoles also can be used to 
justify an approximation which we have used elsewhere (10) for the pion off-shell 
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extrapolation of partial-wave and multipole amplitudes. Our model is summarized 
in Subsection 4E; the static limit of the model is calculated here only as a check, 
and is not used in the subsequent numerical work, A comparison of our model 
with other weak production and electroproduction calculations is given in Ap¬ 
pendix 7, 

In Section 5 we derive a large number of PCAC and current-algebra conditions 
on the pion electroproduction and weak production amplitudes, and compare 
them with values for the amplitudes calculated in our model. We interpret one 
particularly bad discrepancy as indicating that we have neglected a vector meson 
exchange contribution to weak pion production by the axial-vector current. We 
briefly discuss the low-energy theorem which relates two pion electroproduction, 
with one pion soft, to single-pion weak production and electroproduction ampli¬ 
tudes. 

Finally, in Section 6 we compare our model with experiment. Agreement with 
photoproduction data and with electroproduction data for electron momentum 
transfers less than 0.6 (BeV/c) 2 is good, but our theory appears to break down for 
momentum transfers much greater than 0.6 (BeV/c) 2 . In weak production, a fit of 
our model to CERN data for neutrino production of the (3, 3) resonance suggests 
an axial-vector form factor which falls off more slowly with increasing momentum 
transfer than do the vector form factors. We also discuss some features of weak 
pion production which may be of interest in future experiments. 


2. KINEMATICS 


In this section we discuss the kinematics of the pion weak, electro-, and photo¬ 
production reactions. Many of the formulas give here have been published else¬ 
where; our aim has been to collect all of the kinematic equations needed in this 
work, using a standardized notation. 


2A. Energy and Angle Variables 

Let us consider the weak, electro- and photo- pion production reactions 

£ j (*!) + N(pd - £ J (k 2 ) + N(p 2 ) + 7 T(q), 


e(ki) + N (Pi) -*• e '(k 2 ) + N{p t ) + t r(q), 
y(k) + N( Pl ) -* N(p 2 ) + ir(q). 


(2A.1) 
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Substituting the static limit equations into Eq. (2F.8) for the differential cross 
section gives, after some algebraic rearrangement, 


d 2 


a 


dQdW 
with 

and with 


kijc 


L 7-L 
20 


d 2 a Gp 2 cos 2 9 C ( k 10 — w) 2 / 

’ 2M n 

) a 3 .i(W) a, 

(4E.6) 

d(k 2 ) dW ~ 4t r> (w* - M 2 yt 2 \ 

gr 

= 167r(| a (+) - * a ( ->) 2 |/< 3 + ' 2) | 2 


(4E.7) 


« = Sa 2 [ 1 + sin 2 (0/2)] + (F/ + 2 M N Fjy sin 2 (0/2) 
x [K^o + * 20 ) + * 10*20 sin 2 (0/2)]/Afy 2 

+ 2&4FS + 2 M n F/) sin 2 (0/2)(£i O + k 20 )/M N - mfg A h A u>l(2M N k 2 ,) 

+ [h A /(2M N )] 2 m 2 \uP + 4 k l0 k 20 sin 2 (0/2)][sin 2 (0/2) + (4E.8) 

As the notation suggests, o Zt 2 (W) has been defined so that in (v/v)-induced weak 
reactions it is the cross section for (3, 3) excitation in (7r + /7r~)-nucleon scattering. 
Eqs, (4E.6-8) are identical with the static model results of Bell and Berman. 
(Bell and Berman omit the lepton mass terms.) 


5. WEAK. PION PRODUCTION AND THE PARTIALLY-CONSERVED 

AXIAL-VECTOR CURRENT HYPOTHESIS 

In this section we discuss in a systematic way the implications of the PCAC 
hypothesis for weak single pion production. We first derive the PCAC predictions, 
and then, wherever possible, compare them with the model for the pion production 
amplitude derived in the previous section. We interpret a glaring discrepancy 
between one of the PCAC predictions and our model as indicating that we omitted 
an important vector meson exchange contribution to weak pion production by the 
axial-vector current. Finally, we briefly discuss the connection between the reaction 
e-\-N—*e' 7r(soft) + tt and single pion electro- and weak production. 

5 A. Derivation of the PCAC Relations 

(1) Small-k 2 Conditions (Axial-Vector Part ) 

We saw in Subsection 2D that for small k 2 the PCAC hypothesis relates the 
axial-vector multipoles and 3P e ± to the corresponding pion-nucleon scattering 
partial-wave amplitudes f e ± [see Eqs. (2D. 12), (2D. 15) and (2D. 19)]. It will be 
useful to rewrite the identities contained in Eq. (2D, 12) in terms of the invariant 
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and the center of mass amplitudes which were introduced in Subsection 2Q This 
is most easily done by going back to the statement of PCAC, 

out <ttN \ d.iJt 1 + Uf ) | N} = V2 -J ^^ 8A - out <nN | <P„\N> (5A.1) 

and expressing the right- and left-hand sides of this equation in terms of either 
invariant or center-of-mass frame amplitudes. 


Invariant amplitudes: 

Writing 33 

out<>(tf) N(p 2 ) I 9| N(pJ) 

= p + * M j c m [A” nM (v, v B , k\ q 2 = -Mj) 

- iy ■ kB™ U \v,...)] + V2 a^A^Cv,...) - iy ■ kB^iv,...)]} u N (pJ 

(5A.2) 

and expressing the left-hand side of Eq. (5A.1) in terms of the invariant amplitudes 
Vj and Aj , we find 


— 2M N v{A l + A 2 ) {±) + 2M n v b A\ } + k 2 A\ 


(-t) 


.2 ^(i) 


2 M N g A M„‘‘ 


17 T 


N(±) 


g r (0) k 2 + M 2 


(v, v B , k 2 , 


(5 A. 3) 


2M n A^ - M n AT‘ + 2 M n vA™ - 2 M n v b A^’ - k'A' a 


(±) 


(•t) 


(±) 


.2 ^(i) 


2M N g A 


g r (0) k 2 + M 2 


B 


71 


at(±) 


(v 3 v B >Jc 2 , — M 2 ), 


The physical content of Eq. (5A.3) is, of course, just the same as that of Eq. (2D. 12). 
Since we will want, in particular, to study Eq. (5A.3) at the point v = v B = 0, we 
must separate off the Born terms, which become singular at that point. The pion- 
nucleon scattering Born approximation is 

B™ i±)B (v, V B , k\ -AC) = - 8r f:i k - 2) -F - 4 —) (5A.4) 

2 M n \v b — v v B + vi 


33 Equation (5A.2) defines the off-shell pion-nucleon scattering amplitudes A nNl±) (v, v B , k 2 , q 2 = 
— MS) and v B , k 2 , q 2 = — MS\ in which the initial pion has (mass) 2 = —k 2 . They are 

related to the physical pion-nucleon scattering amplitudes v B ) and v B ) by 

A^ N{± \v t v B , k 2 = -MS>q 2 = -MS) - A” N '±'(y, 

B" N '±'(v f * B ,k 2 = -MS> q 2 = -MS) - B" Nt ±' (v, y B ). 

Our notation follows Adler {10), 
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and the weak production Born approximation is given in Eq. (2E.6). Using a bar 
to denote the non-Born part of the amplitude, e.g., A= A j ±) + Al ±)B , we find 
by substituting Eqs. (5A.4) and (2E.6) into Eq. (5A.3) that 34 


—2M n v(A 1 + A 2 ) (±) + 2M n v b A^ + k 2 A^> + ( 0 ) 2 g r g A (k 2 ) 


7 (±) 


.2 l(±) 


2M N g A M 2 

gr(0) k 2 + M. 


nrrs A' K{ *(y, *b , k 2 , -M 2 ), 


2M n A ( ^ - M n AT’ + 2 M n vAT’ ~ 2M N v B Ar - k 2 A' s 


1 (±) 


t(±) 


t(±) 


.2 a (±) 


(5A,5) 


2 M N g A M 2 

g r (0) k 2 + M, 


B” Ni±) (v, v B , k\ -M 2 ). 


Eq. (5A.5) can be further simplified by separating A n and A 8 into their one-pion- 
exchange contributions, coming from the diagram of Fig, 3, and a remainder, 


A^ = _ 2M n8a _ 1 A‘ nN ^(v v k 2 _ M 2 1 4 - A^ r 

7 g r (0) k 2 + M 2 A ( v > v b , k , ) ~r A 7 ; 

7(±) _ 2 M N g A 1 v B , k\ -M 2 ) + 4 ±)s . 


/t 8 — 


gr(0 ) k 2 + M t 


(5A,6) 


Equation (5A.5) becomes 

— 2M N v(A 1 + 4) (±) + 2M N v B A { f + k 2 A ( 7 ±)R + (*) 2 g r g A (k 2 ) 

_ 2M N g A -nrjy(±), .. 1,2 Ttt 2\ 


gr(0) 


\v, v B , k 2 , -M 2 \ 


2M N A i f ) - M N A ( t' ) + 2M N vAf ) - 2M N v B Af ) - k 2 A[ ±)R 
= B" Ni±) ( V> v B , k 2 , -M 2 ). 

In Eq. (5A.7) the pion propagator ( k 2 + M n 2 ) L , which varies rapidly with k 2 > has 
dropped out; the physical content of PCAC is the assertion that the left- 
and right-hand sides of Eq. (5A.7) vary only slowly as k 2 is varied in the interval 
-MJ < k 2 ^ 0 (apart from possible threshold corrections of the type discussed 
in Subsection 2D), 

FromEq. (5A.7)and the crossing properties of Eq. (2C.3), which state that certain 
of the amplitudes Aj vanish at v = 0, we deduce the following relations (32): 


(5A.7) 


grgr(0) 




(5A.8) 


l„=„ 5 =Jt 8 -0 


31 We use the equation 2M N g A {k 2 ) — k 2 h A {k 2 ) = [2M N g A lg r (0)][M n 2 j{k 2 4- M„ 2 )} g r {k 2 ), 
obtained by sandwiching Eq. (2D. 10) between one nucleon states. 
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VP - i^]\ 




* 2 -0 8r(0) 


girNi-) 


v—v B —k—0 


- [4~> + A?] 


4' ] 


gA 


W2 ,-**=0 gr(0) dv 


v=v B *=k =0 


(5A.9) 


v=v s —k—0 


g A 5/4 77jv(+) 
g r (0) dv B 


v—vg'—k*'— 0 


Equation (5A,8) is the familiar consistency condition on ttN scattering implied by 
PCAC (33). To interpret Eq. (5A.9) we note that, for small four-vector k , we have 


Un(Pz) 


6 8 
'(±) 


Z o(v s ) V r + z o(a s ) a; 


(±) 


L 




Wat(a) 


= WatCa) 


I 0(V 3 ) vl ±)B + £ 0(A } ) A] 


8 


(±)B 


\ J-l 


J=1 


+ + ^ 2 (±> ]lo (Pi + Pi ) ' e + iA [ 0 q • e 


: 7 <±> 


- [A<* - 2A ( f>]\ a M N ye+ 0(k) 


u n(Pi)* 


(5A.10) 


The notation | 0 means that the invariant amplitudes are evaluated at k = 0, 
that is, at v = v B — k 2 = 0. Since crossing symmetry implies that A[ +) | 0 = 
A l 2 +) | 0 = A { 4 +) | 0 = Tg” 1 | 0 = 0, the Born approximation and Eqs. (5A.9) com¬ 
pletely specify the weak production amplitude, up to terms of first order in k . 


Center-of-mass amplitudes : 

Using the definition of the pion-nucleon scattering center of mass amplitudes 
A and f 2 , 

4ttW 


u N (Pt>[A" N — iy * kB" N ] u N ( Pl ) = 
we find that Eq. (5A.1) takes the form 


M 


Xf 


L/i + « • 4° ' &fi] Xi . (5A.11) 


N 


& 


A(±) 


0 


&a(±) I k | 2 

fl k 

K 0 


«a<±> *! 
7 k 0 

&A{±) & 

*8 u 
Kq 


8t TWg A _ 

g r (0) k 2 + MJ 


M 2 ,(±) 
/ 2 > 


M 2 


87 TWgA _ 

gr( 0) k 2 + MJ 


/l to . 


(5A.12) 
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If/x and/ 2 are expanded in partial waves according to 

/i = t fi+PhiU) - i fc-PU{y\ 

<r=o <02 

(5A.13) 

00 

/. = I (/*- - O ^C), 


comparison with the partial-wave expansion of ^ i8 Eq. (2D.4) leads imme¬ 
diately to Eq. (2D, 12). 

(2) Small-q Conditions (Axial-Vector and Vector Parts) 

As has been much discussed recently, the PCAC hypothesis, combined with the 
algebra of currents proposed by Gell-Mann (34), leads to a “pion low-energy 
theorem’’ for any weak or electromagnetic process in which a pion is emitted (35). 
The theorem relates the matrix element of the process, at zero-pion four- 
momentum, to the matrix element of the corresponding process in which no pion 
is present and to an equal time commutator of currents. As applied to weak or 
electroproduction of pions, the method gives restrictions on certain of the invariant 
amplitudes at the point q — 0. We give a detailed derivation of the restrictions 
on the axial-vector amplitudes A if and state (without giving a derivation) the 
similar results for the vector amplitudes Vj . 

The low-energy theorem is derived from the identity 

i/ d'xe-^X-n* + AC) COC I T[dX\x)(Jt\0) + itf*(0))] I N( Pl )} e, 

= -i(q 2 + AC) j 

X <N(pJ | [CO, O 0) + O 2 (0)]lx 0 =o I CA)> e x 
-q a j d*xe~ iQX (— □, + AC) C 

x (N(p 2 ) | r[CO(C(0) + ,7f(0))] | N(p J) e A , (5A.14) 

obtained by integration by parts. We evaluate each of the terms in the limit as 
q 0. Using the PCAC hypothesis in the form 
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the left-hand side of Eq. (5A. 14) is seen to be proportional to the matrix element 
for weak production of a pion of (mass) 2 = — q 2 , 

left-hand side = Mn ^§ a - £ [a M A^\v 7 v B , k 2 , q 2 ) + v Bj k 2 , q 2 )] 

X u N (p 2 ) 0(A j ) u N (p x y (5A.16) 

At q = 0 the invariants q 2 , v and v B are zero. Using the postulated commutation 
relation (34) 

[J d 3 xJt a (x), if(0)][ ^ = -e ai X c (0), (5A. 17) 

the first term on the right-hand side of Eq. (5A.14) becomes 

-MJ j d 3 xtf<N(p 2 ) I UtXx), Jf(0) + Ufm, x ^ | N( Pl )) e, 

= MjuxipJlF/ik 2 ) 0(A 2 ) - A/-. 1 

x (F/(k*) + 2M N F/(k 2 )) 0(A t )] u N ( Pl ) a<->. (5A.18) 

Finally, in the limit as q —► 0 only the one-nucleon pole terms contribute to the 
term proportional to q a on the right-hand side of Eq. (5A.14). In other words, this 
term is 

\ig A y ■ qy h 7 - 2 + l *?J L [iy x y^g A (k 2 ) + y h k x h A (k 2 )] Xi + «>a) 

< z z /? 2 q 

+ IOi + i^[iy x y. 0 g A (k 2 ) + y h k A h A (k 2 )] 

y ■ Pi + iM N , t c ) 

x 2 ~ igAy ■ <775 y-J Ma'(Pi) + 0(q). 

After some algebra, this can be rewritten in the form 

M^g^xipJlM^gX 2 ) a^OiAj - h A (k 2 ) a U) 0(A 7 )] u N ( Pl ) 

gr(0) g A (k 2 ) 


(5A.19) 


+ M -,i gA u N (p 2 ) 


gXO) 


X 


+ 


+ 


X 


[ al+> 


2M n 

_>_) + fl ,-, / — ! — + — ! — )] 

V B — V V B + V! \V B — V V B + V)\ 

%m *-(*■>. 0(Ai) r„ w /—!— + —i—) + ( 1 

L \Vg — V V B V' \ 


2 M n 

g r (0) h A (k 2 ) 


V B — V V B + V 


)] 


2 M 


0(A a ) 


N 




1 


V B — V V B + v> 


+ a { - } | 


1 + 1 


V B — V V B + V! J 


u N ( Pl ), (5A.20) 
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with the term in curly brackets just the Born approximation for weak pion 
production. Substituting Eqs. (5A.16), (5A.18), and (5A.20) into the identity of 
Eq. (5A.14), we see that the Born approximation terms, which are singular at 
q = 0, cancel out. This leaves us with the following conditions on the non-Born 
parts of the amplitudes (denoted again by a bar) (36): 

A[~\v =v B = 0, k 2 , q 2 = 0) = F^k 2 ), 

A[-\v = = 0, k 2 , q 2 = 0) = - [F/(k 2 ) - g A g A (k 2 ) + 2M N F 2 v (k 2 )], 

IYL N oA 

A { f\v = v B = 0, k 2 , q 2 = 0) = - h A (k 2 ). (5A.21) 

An entirely analogous derivation can be carried out for the vector weak produc¬ 
tion (and the electroproduction) amplitudes, leading to the identities (37) 

V^( v = = 0, k 2 , q 2 = 0) = ^ F/(k 2 ), 

F<V = va = 0, k 2 , q 2 = 0) = Ft s(k 2 ), (5A.22) 

V ( -\v = v* = 0, k 2 , q 2 = 0) = _ py (k%) ] (k2yi 


Equations (5A.21) and (5A.22), together with the Born approximation, completely 
specify the weak production amplitude up to terms of first order in q . 

The condition on A has a simple interpretation when k 2 is near ~M 2 , so 
that only the pion pole terms in A { 7 +) and h A need be retained. In the pion pole 
approximation, 


A\+\v = = 0, k 2 , q 2 = 0) 

- - FT*? i' 

h (h&\ ^ nEa \SrjE r(B)] 

h A( k } ~~ k*+ M 2 ’ 

and substituting these relations into Eq. (5A.21) gives 

= v B = 0,k 2 = -M 2 , q 2 = 0) = 


0 ), 

(5A.23) 


(5A.24) 


Eq. (5A.24) is identical with the pion-nucleon scattering consistency condition 
of Eq. (5A.8). (It makes no difference whether it is the initial or the final pion which 
is off mass shell.) 
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(3) Combined Relations 


A number of interesting relations can be obtained by combining the small-# 
equations of Eq. (5A.21) with the small-A: equations of Eq. (5A.9). [More properly, 
we use the analog of Eq. (5A.9) in which the final pion mass — # 2 has been extra¬ 
polated from M n 2 to 0. We neglect a small additional term, the so-called “<7 term,” 
which appears in the equation for A { 3 +) when # 2 — M^ 2 (35).] Taking the linear 

combination of A and A which eliminates F 2 v (k 2 ) gives 

2 4 -> lo + \ 0 = -g T ( 0)(1 - g/)KM/g A ), (5A.25) 

while eliminating A[ _) from Eq. (5A.9) gives 




= r g A i r 8A” N< -> 

0 Ur(0)J 1- Sv 


+ 





(5A.26) 


[The abbreviation | 0 indicates evaluation of the amplitudes at v = v B = k 2 = 
q 2 = 0.] Taken together, Eqs. (5A.25) and (5A.26) imply that 


1 


gA 2 


2M n 2 r 

gr 2 (0) L dv 


J Ip. p 5 -A; 2 -Q 2 aaO 


(5A.27) 


the usual g A sum rule (39). In other words, the g A sum rule emerges as the condition 
that the small-# and small-A: expressions for the axial-vector weak production 
matrix element be consistent at the point # = k = 0 (32). 

Since Eqs. (5A.9), (5A.21) and (5A.22) determine the terms in the weak produc¬ 
tion matrix element linear in either q or k , one can use them to write down an 
expression for the weak production matrix element, exact up to terms of order 
qk, q 2 and k 2 . Dropping lepton mass corrections, one finds (32) 


Un(Pz) 


I 0(V,) + X 0(A,) A) 


(±) 


u n(Pi) 


= u N (p 2 ) 


6 6 

(±)B ( V r\/ a \ a (±)£ 


I O(Vj) V} ±)B + £ 0(A } ) A, wb + J (±) + 0(qk, q 2 , k 2 ) 

j-=l 


«v(M 


j<+> = 


J<-> = 


igA 

ba* n <+> 

1 „ „ 

gr(0) 

dv B 

\ 0 q 2M n * yh ' a 

igA j 

i gr(0) 2 

( ] - “T) ^ + Pi ' 

gr(0) 1 

! 2MJ 


(5A.28) 


- ie^q.B^ | 0 + (l - - 


M 1 


gA 4 
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This result, while valid near q = k = 0 (and perhaps even good at the pion 
production threshold) is sure to fail in the (3, 3) resonance region, since it does not 
take final-state interactions into account. Thus, the small-#, -k expansions will 
not be of practical use in calculating weak pion production cross sections. 

Another useful relation obtained from Eqs. (5A.9) and (5A.21) is (40) 



Sa dA” N ^ _ g r (0) 
g r (0) dv 0 M N g A 


(5 A. 29) 


or equivalently, by use of Eq. (5A.27) 



o 


— DitN(-) 

“ gM 


gr(o) r l _ , 2M n FS(0) i 

2m/V7a~ 8a+ 7 a J - 


(5A.30) 


If the weak production amplitude A[~ ] | 0 were zero or negligibly small, Eq. (5A.29) 
or Eq. (5A.30) would give a relation between the isovector nucleon magnetic 
moment and pion-nucleon scattering. 35 However, the numerical analysis of the 
next subsection shows that our weak production model does not give any theoretical 
reason for neglecting A | 0 . 


5B. Comparison with Weak Production Model 

In Table V we compare the PCAC predictions for the various covariant ampli¬ 
tudes with the values calculated from the model given in the previous section, at 
the point v v B = k 2 = q 2 = 0. The amplitudes F/ 40 , A[~\... in Column (A) are 
calculated directly from Eq. (3A.2). The bar, we recall, means that only the non- 
Born part of the amplitude is retained. In our model, the non-Born part of the 
amplitude comes from the dispersion integrals over the dominant (3, 3) multipoles, 
which in turn are given by Eq. (4D.22). Since we actually need the dominant 
multipoles at the off-mass-shell point q 2 = 0, we use the off-mass-shell form of 
Eq. (4D.22), 


w(3/2) , r (3/2 )B 

M l + (a) « 2 -0 = A*l+(,i) 


IQ “0L/ 1+ 


L/i 


(3/2) /x(3/2)5-, r i 1 2\2 // \ -i 

//i+ ][1 + a(k ) /(ww 8i8 )] 


(5B.1) 


and similarly for | q2j=0 and \ q2 ^ 0 . The factor g r (0) -1 multiplying all the 

amplitudes in Column (A) of Table V cancels the factor g r (0) in | q2m=0 , 

^i+ 2)S |fl*-o an d l«*=o , an< ^ thus drops out of the numerical evaluation. 


3S A number of authors (41), because of incorrectly using amplitudes with kinematic sin 
gularities, have obtained Eqs. (5A.29) and (5A.30) with A [ 0 replaced by 0. 
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The PCAC predictions in Column (B) were obtained from the experimental 
values 


pv = 3.70, 

fji s = — 0 . 12 , 

gA = 1.18, 

F*'(0) = -0.045 /M v * 


(5B.2) 


and from pion-nucleon scattering phase-shift data. 36 

For some of the amplitudes, the agreement between the low-energy predictions 
of PCAC and our weak production model is poor, indicating that, at least in the 
region near v = v B = k 2 = q 2 0, significant omissions have been made from 
the weak production amplitude. Let us consider the entries in Table V individually: 

l,V[ +) | 0 ; The prediction of the model, 0.38, is 70% of the value 0.55 
predicted by PCAC. A detailed analysis of the PCAC prediction for K 1 {+) | 0 has 
been made by Adler and Gilman (37), who find that, in addition to the multipole 
M}^ /2) , the multipoles E^ 2 \ E$ 2) , and E^ 2} also make significant contributions 
to the dispersion integral for V[ +) | 0 . Using experimental values for the important 
multipoles in the regions of the (3, 3) resonance and the second pion-nucleon 
resonance (7V*(1520)], Adler and Gilman found £K 1 (+) | 0 = 0.47. 

ly i (0) lo : In our model the isoscalar amplitude is pure Bom approximation, 
so the barred, or non-Born, amplitude vanishes. 

lo : The analysis of Adler and Gilman (37) shows that the contribution 
of the M^l 2) multipole to the dispersion relation for V | 0 is kinematically sup¬ 
pressed, and consequently is smaller numerically than the contributions of the 
J E| 2/2) , Lfl 2) and other multipoles. This means that use of the magnetic dipole 


38 The quoted value of - > j 0 was obtained from the analysis of HOhler and Strauss (42), 

who make the approximation 

lo ~ £»*<->(0, —Mtt 2 12M N , — A/tt 2 , — A/tt 2 ) 

and calculate the physical amplitude on the right-hand side from phase shift data and a Regge 
model for the high-energy region. Similarly, to calculate £ 2 dA^^/dyg | 0 , we make the approxima¬ 
tion 


peA^'ldvs , 0 * a^^+»(o, v,, — Af w *, — Af^/a^ 1^— 

* (A^+KO, 0, - MJ, - A//) - Aw+K 0, , - MJ, - M.„*)} 


M. 


71 


and use the value 

^<+>(0,0, —Mtj 2 , -MJ) - T^'+ J (0, — Mtt 2 I2M n , —Af n 2 , —A/tt 2 ) w 2.66/M n 
calculated from phase shift data by Adler (43). 
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TABLE V 

Comparison of PC AC Predictions with the Model Developed in Section 4* 


(A) 

Value in Model 

[£ = grigriO)] 


(B) 

PCAC Prediction 



Equa¬ 

tion 

No. 

1 

0 = 0.38 

w F/9) = {Sr* = °- 55 

M n 2A/n 




5 A. 22 

W[ Q \ 


= -°' 018 

Mtf 2A/ N 




5 A. 22 






/0.012 



7 


gr r g A (0) 

FSXO)] = 0.090 - 

4 

\ for M a 

= 1 BeV; 

5 A. 22 

„ = 0.012 


j 0.071 


« 

0 

m n 1 ^(0) 


A/. 2 








Vfor M a 

= 2 BeV. ft 


U'-'\ 

0 “ °- 32 

gr L 

- _ T (1 ~ X* 

2M N *g A * 

+ ^)] 

= 0.28 e 


5A.30 


L = 0.12 

gr uX 

—— = 0.47 




5A.21 

2 

0 

2M N *g A 








gA dA ” Nl +’ 






U< + > 

L = 1.3 

* A {* 

= 3.1* 




5A.9 

3 

! o 

gr &V B 

0 





ar 

l 0 - -0.096 

ttT- [1 - Sa 2 + n v ] = -0.83 
M N 2 g A 




5A.21 

CA' 7 +> 

l 0 /A,(0)=-1.7 

gr - 2.0 
M n 





5A.21 


a Mi, 1 throughout. 

b We have parametrized g A (k 2 ) in the form g A (k 2 )jg A ( 0) — (1 -f k*/M A *)~*. 
c ‘ Obtained from the analysis of Hohler and Strauss (42). See Footnote 36. 

Obtained from the analysis of Adler (43). See Footnote 36. 

dominance approximation in the dispersion relation for is dubious, and that 
comparison of the magnetic dipole result £V£~ } | 0 = 0.012 with the PCAC 
prediction has little meaning. 

| 0 : The PCAC prediction here is in reasonable agreement with the 
value given by the model. Since the value of £A{ -) | 0 in the model (0.32) is of the 
same order of magnitude as the magnetic moment term in the PCAC prediction 
(—0.47), the model gives no theoretical reason for the neglect of the weak pion 
production terms in Eq. (5A.30) relative to the magnetic moment and the pion- 
nucleon scattering terms. 
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lo and ^ 4 ~ J lo : For each of these amplitudes individually, the model 
disagrees badly with the PCAC prediction. However, for the linear combination 
£[2A2~° lo + A{~ } | 0 ] which enters into the g A sum rule [see Eqs. (5A.25-27)], the 
prediction of the model is 0.14, in good agreement with the PCAC prediction of 
-grV - gA 2 )/(M N 2 g A ) = 0.10. 

£^ 3 +) lo : The prediction of the model here is in fair agreement with PCAC. 

£A 7 +) \ Jh A (0): Here the integral over the (3, 3) resonance 37 is in good 
agreement with PCAC. However, this is somewhat of an accident, since as we 
noted in Subsection 3C, A 7 +) (v, v B , k % ) does not satisfy an unsubtracted dispersion 
relation in v\ The significance of the good agreement is that the two terms in the 
subtraction constant of Eq. (3C.3) nearly cancel when v B ~ k 2 = 0, making the 
subtraction constant small. 

The comparison of our model with the PCAC predictions indicates that while 
agreement in the case of the photoproduction amplitudes K 1 t+) | 0 and | 0 is 
good, agreement for most of the weak production amplitudes is less than satis¬ 
factory. Thus, it may not be correct to justify our model for weak production by 
its success in photoproduction, since the comparison with the PCAC predictions 
indicates that in the weak production case, important pieces of the amplitude 
have been omitted. However, this problem may not be as serious as it appears from 
Table V. The worst discrepancies occur in the amplitudes A^ and A^) we will 
show below that the trouble with these amplitudes comes from neglecting certain 
vector meson exchange contributions to weak pion production. While the vector 
exchange terms make the major contribution to A 2 ~° | 0 and A J - * | 0 , we shall see 
in Subsection 6C that they do not greatly change the weak pion production cross 
sections in the (3, 3) region. 

5C. Vector Meson Exchange Amplitude 

In this Subsection we calculate the vector meson exchange contribution to weak 
pion production by the axial-vector current (44). We will not limit ourselves to p 
exchange alone, but rather will sum over all diagrams in which a particle with the 
quantum numbers J PG = is exchanged. As is discussed above in Subsection 
3C, such /-channel singularities are not in general correctly included when the 
5-channel dispersion integrals (the integrals over x) are extended only over the 

37 The equations relating Im A wj to Im analogous to Eqs. (4B.6-8), are 

Im v g , k> 1 = (_^) a, , 

a i = —2A/ N H / [H / _(p l0 + ^4n)(Pzo + M n ) -j- 3W+(2M n vb ■+■ Qo^o)\ 

x Im ! q ! [ k [ kj. 
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Working within the framework of perturbation theory, we show that the axial-vector vertex in spinor 
electrodynamics has anomalous properties which disagree with those found by the formal manipulation of 
field equations. Specifically, because of the presence of closed-loop “triangle diagrams,” the divergence of 
axial-vector current is not the usual expression calculated from the field equations, and the axial-vector 
current does not satisfy the usual Ward identity. One consequence is that, even after the external-line 
wave-function renormalizations are made, the axial-vector vertex is still divergent in fourth- (and higher-) 
order perturbation theory. A corollary is that the radiative corrections to vj elastic scattering in the local 
current-current theory diverge in fourth (and higher) order. A second consequence is that, in massless 
electrodynamics, despite the fact that the theory is invariant under y* transformations, the axial-vector 
current is not conserved. In an Appendix we demonstrate the uniqueness of the triangle diagrams, and 
discuss a possible connection between our results and the it 0 —* 2y and ij -+2y decays. In particular, we 
argue that as a result of triangle diagrams, the equations expressing partial conservation of axial-vector 
current (PCAC) for the neutral members of the axial-vector-current octet must be modified in a well- 
defined manner, which completely alters the PCAC predictions for the 7r° and the ij two-photon decays. 


INTRODUCTION 

T HE axial-vector vertex in spinor electrodynamics 
is of interest because of its connections (i) with 
radiative corrections to vi scattering and (ii) with the 75 
invariance of massless electrodynamics. We will show 
m this paper, within the framework of perturbation 
theory, that the axial-vector vertex has anomalous 
properties which disagree with those found by the formal 
manipulation of field equations. In particular, because 
of the presence of closed-loop “triangle diagrams,” the 
divergence of the axial-vector current is not the usual ex¬ 
pression calculated from the field equations, and the 
axial-vector current does not satisfy the usual Ward iden¬ 
tity. One consequence is that, even after extemal-lme 
wave-function renormalizations are made, the axial- 
vector vertex is still divergent in fourth- (and higher-) or¬ 
der perturbation theory. A corollary is that the radiative 
corrections to vi elastic scattering in the local current- 
current theory diverge in fourth (and higher) order. A 
second consequence is that, in massless electrodynamics, 
despite the fact that the theory is invariant under 
7 s transformations, the axial-vector current is not 
conserved. 

In Sec. I we derive the usual formulas for the axial- 
vector divergence and Ward identity, and then show 
how they are modified by the presence of triangle 
diagrams. In Sec. II we discuss various consequences of 
the additional term found in Sec. I. In the Appendix 
we show that it is not possible to redefine the triangle 
diagram in a physically acceptable way so as to elim¬ 
inate the anomalous behavior discussed in Secs. I and II. 
We also discuss in the Appendix a possible connection 
between our results and the 7r° — > 2y and rj — > 2y decays. 
In particular, we argue that as a result of triangle 
diagrams, the equations expressing partial conservation 
of axial-vector current (PCAC) for the neutral members 
of the axial-vector current octet must be modified in a 
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well-defined manner, which completely alters the PCAC 
predictions for the 7 r° and the rj two-photon decays. 

I. AXIAL CURRENT DIVERGENCE AND 
WARD IDENTITY 

We work in the usual spinor electrodynamics, 
described by the Lagrangian density 1 

£(x)=$ (a;) (iy-n— nto)p (x) — iF^ (x)F» v (x) 

- : e$ (x)y }1 p (x)A ^(x): , (1) 

dApix) dA,(x) d 

F^ix)^ -, 7-Ds=7*-• 

dx y dx* dx* 

We define the axial-vector current j^(x) and the 
pseudoscalar density p(x) by 

;'/0)= :4>(x)y fl y^(x): , 
j 5 (x)= :$(x)y 5 t(x):; 

the corresponding vertex parts T M 6 (p,p f ) and T 6 (p,p f ) 
are defined by 

d*xd*y Tty (x) jj (0)f (y))) 0 , 

(3) 

Sp'ip^ip^S/ip') 

d^ye^ x e-^-y(T{p(x)j^ 0 )£(y))) 0 . 

Using the equations of motion which follow from Eq. 
( 1 ), the divergence of the axial-vector current may 

1 We use the notation and metric conventions of J. D. Bjorken 
and S. D. Drell, Relativistic Quantum Fields (McGraw-Hill Book 
Co., New York, 1965), pp. 377-390. Note that coi 28 = ~ e° m = 1. 

2426 
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easily be calculated to be 


p-p' 


dxt 




(4) 


From Eqs. (3) and (4), we obtain the usual axial-vector 
Ward identity 


(*-* WOrfO = 2 moT 5 (p,p') 

+-S , /(/>)~ 1 76+7 6 6'f 


(5) 


Our task in this section is to see whether Eqs. (4) and 
(5), which we have formally derived from the field 
equations, actually hold in perturbation theory. 

To this end, let us rederive Eq. (5) in perturbation 
theory. It is convenient to write 


T /—y fftb ~h A/, 
r 6 =Y5+A 6 , 


( 6 ) 


where the vertex corrections A/ and A 6 and the proper 
self-energy part 2 (/>) are calculated using {p—m^r 1 as 
the free propagator. (Use of the bare mass mo=m—8m 
in the free propagator automatically includes the mass- 
renormalization counter terms.) In terms of A/, A 5 , 
and 2, Eq. (5) becomes 

(7) 

In order to derive Eq. (7), let us divide the diagrams 
contributing to A ^(p,p f ) into two types: (a) diagrams 
in which the axial-vector vertex 7^,75 is attached to the 
fermion line beginning with external four-momentum 
p* and ending with external four-momentum p\ (b) 
diagrams in which the axial-vector vertex 7^75 is 
attached to an internal closed loop [See Figs. 1 (a) and 
1(b), respectively]]. A typical contribution of type (a) 
has the form 


2n—1 fc-1 p 1 “I 

E n y u) - \t 

1 3=1 L P~hpj —woJ 


(*). 


1 


1 


-Y„Y5- 

P+Pk—tKo P'+pK—tHQ 


x —V-’o-o, 

7 -H- 1 L p'+pj—ntoJ 


( 8 ) 


where we have focused our attention on the line to which 
the y „75 vertex is attached and have denoted the 
remainder of the diagram by (* * *)• Multiplying Eq. ( 8 ) 
by (p-p'Y and making use of the identity 


1 


p+pk—tno 


(P—P f )ys 


1 


1 


1 


X 


P f +Pk— nto p+pk—nto 
1 1 


(2mo7 6 ) 


+ 


•75+71 


p'+pk-nto p+ph—tno p'+pk—tno 


(9) 


7«> y{k) y<k*») y{2«| 

p - - p+p * p 7 k , p/ > P ' 


-W- 




) 


(a) 



yl 2 n-l) 


'♦WP'-P 




P-P / 

(b) 


Fig. 1. Diagrams contributing to the axial-vector vertex, 
(a) The axial-vector vertex is attached to the fermion Kne begin¬ 
ning with external four-momentum p f and ending with external 
four-momentum p. (b) The axial-vector vertex is attached to an 
internal closed loop. 


gives, after a little algebraic rearrangement, 


2n—1 *-l p 1 “] 

e n - \ 

*-1 /=iL p+pj—triQ J 


.(*). 


1 


£+£*—wo 


•2wq7i 


X 


1 2n-ir 1 "I 

—- II 7 C/) —- 

p'+Pk — W 0 7=*H-1L £'+ Pi— Wo J 


2 n-lr 1 “I 

-(••on 7 C,) -fy 

7=*1 L p +£/— Wo J 


P r +£y— m 

(2 n) 


75 


2n-ir 1 7 

-75II y U) - U<2»)(. . • ). (10) 

/-iL P ^Pj —wqJ 


The first, second, and third terms in Eq. (10) are, 
respectively, the type-(a) piece of A 6 , and the pieces 
of — 2 (/>)Ys and —y$(p f ) corresponding to the type-(a) 
piece of A „ 6 in Eq. ( 8 ). Summing over all type-(a) 
contributions to A/, we get 

(p-pyA^(p,p’) 

=2m^KP,P')-^(P)yt-y^(P') ■ (ii) 

We turn next to contributions to A/ of type (b). A 
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typical term is 


appears to be quadratically divergent. Actually, since 


/ 


d 4 r Tr 


2n fc—1 r 1 “1 1 

s n -^ (t) - 

;-»iL f'irps —fWo-» r+pk—tn o 


-7//V5 


X 


---fi \y U) --- 1 } 

r+pk+p'—p—fn o ^"*+iL r+&+£'— p— m 0 JJ 

( 12 ) 

Multiplying by ( p—p*Y and using Eq. (9) gives 


/ 


dV Tr 


f 2» Mr 1 -| 1 

e n 1 t<»—— - 

i —WnJ 7-\~Pk—tn o 


fc-i y-iL r+^y—m 0 . 
1 2 nr 


-2woY5 


X 


n - 


</). 


r+Ab+£'-^-w 0 f-*+iL r+£ 

X(-. • )+ f d 4 r tr 75 II [y w) -1 

J l j"i L r“h^#“-wnJ 


— 1 

>j+p t —p—tn Q J 


-y>nW» ---ll(**->* ( 13 > 

7-1 L r+pj+pt—p— m 0 J J 


tr{7a7 (1) rY C2) r}=0, (15) 

the integral in the n= 1 case is superficially linearly 
divergent. Since it is well known that translation of a 
linearly divergent integral is not necessarily a valid 
operation, 2 we must check carefully to see whether 
Eq. (14) holds for the triangle graph. 

To do this we make use of an explicit expression for 
the triangle graph calculated by Rosenberg. 3 The sum 
of the diagram illustrated in Fig. 2 and the correspond¬ 
ing diagram with the two photons interchanged is 




(2*y 


-R 




r d 4 r 

I (2*y ( 


l)tr 


r+fo—Wo 


(— ie<fY*) 


t 1 

X -(- Wc 7 p ) 


r—w 0 


7 —— Wo 


7*75 


. ( 16 ) 


Evaluation of Eq. (16) by the usual regulator techniques 
leads to the following expression for R aptt \\Aj denotes 
Ajikifkz)"]- 


The first term in Eq. (13) is the type-(b) contribution 
to A 6 corresponding to Eq. (12), while making the 
change of variable r—tr+p f —p in the integration in 
the second term causes the second and third terms to 
cancel. This gives, when we sum over all type-(b) 
contributions, 

(14) 

The Ward identity of Eq. (7) is finally obtained by 
adding Eqs. (11) and (14). 

Clearly, the only step of the above derivation which 
is not simply an algebraic rearrangement is the change 
of integration variable in the second term of Eq. (13). 
This will be a valid operation provided that the integral 
is at worst superficially logarithmically divergent, a 
condition that is satisfied by loops with four or more 
photons, that is, loops with n> 2. However, when the 
loop is a triangle graph with only two photons emerging 
(See Fig. 2) we have » — 1, and the integral m Eq. (13) 



Fig, 2. The axial-vector triangle graph. There fe a second 
diagram, with the photon four-momenta and polarization indices 
interchanged, which makes a contribution equal to that of the 
diagram pictured. 


Rapjtihlfkz) A lki r € T *t>p+A,2h2 T €T<rpii 

+^4a£lp£l^2 T €f T <r#i+^4 4&2p£l*&2 T €fr(rp 
m \~Ahk\. a kl*kv. T *tTpp'\'A 6&2<r^ 1^2 r CfT -pn 7 

Ai = ki* kzA z~\~h^A 4 , 

^4 2 = hj^A * h%A q , 



A3(kiM)= — A 6 (k 2y k i)= — 16ir 2 In(k 1 ,^ 2 ), 

A 4(kl,kz)= —^4 5(£2,£l) = 1 2 ) — I lo(^l)^2)] , 


where 


I,t(k h k 2 ) 



dy x?y t [y(\ m “y)h'£ 


+#(1—#)^2 2 -|“ 2%yki'k% —wq 2 ] -1 . (18) 


8 J, M. Tauch and F, Rohrhch, The Theory of Photons and 
Electrons (Addison-Wesley Publishing Co., Inc., Cambridge, 
Mass., 1955), pp, 458-461. 

8 L. Rosenberg, Phys. Rev. 129, 2786 (1963). In Eq. (16) and 
Fig. 2, we have labeled the legs of the triangle in accordance with 
Rosenberg’s notation, which differs from the labeling convention 
used in Eqs. (12) and (13). Because the integral defining the 
triangle graph is linearly divergent, the value of the triangle 
graph is ambiguous and depends on the labeling convention and 
file method of evaluation of the integral. For example, if Eq, (16) 
is evaluated by symmetric integration about the origin in r space, 
the value of R 9Plt so obtained satisfies the usual axial-vector Ward 
identity (but is not gauge-invariant with respect to the vector 
indices). If, on the other hand, Eq. (16) is evaluated by symmetric 
integration around some other point in r space, say r=k\ [or, 
alternatively, if we integrate symmetrically around r=0 but label 
the triangle using the convention of Eqs, (12) and (13)], then the 
result has an anomalous axial-vector Ward identity. The value in 
Eq. (17) which we have assigned to is the unique value which 
is gauge-invariant with respect to the vector indices. Further 
discussion of the ambiguity in the definition of Eq. (16), and a 
justification of the specific choice in Eq. (17), are given in the 
Appendix. 
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We will also need an expression for the triangle graph 
with 7^75 replaced by 2mo7s. Defining 



dV 

(2^ 


(-l)tr 


t 

- (-ie o7«r) 

. r+ki — mo 


X- (-ie oyf) 


7—trio 


r—kz—tno 


-2wo7« 


, (19) 


we find that 

kl^2 T G$TaflBl ) 

B i= Sifonol oo(ki f k2) 



We are now ready to calculate the divergence of the 
axial-vector triangle diagram. If the Ward identity 
holds, we should find 

— (^ 1 +^ 2 )^^^= 2tnoR ap , (21) 



P Zn = P"P' 

Fig. 3. Diagram for calculation of the asymptotic behavior 

of the general axial-vector loop. 




Subgraph 4 Subgraph 2 

a(l)=-(2nH) CZ(2)>-(2nH) + 4 

Fig. 4. Subgraphs (doubled lines) which determine the 
asymptotic behavior of Fig. 3. 


while the momentum p—p f carried by the axial-vector 
current is held fixed. According to Weinberg's theorem , 4 
the asymptotic behavior of the loop graph in this 
limit is 

Hln £)*, (24) 

where is undetermined by Weinberg's analysis and 
where a is the maximum of the superficial divergences 6 
ot(g) of the subgraphs 6 g linking the 2 n photon lines 
(i.e., linking the momenta which are becoming infinite). 
For the diagram of Fig. 3 there are two such subgraphs, 
illustrated in Fig, 4, with superficial divergences a(l) 
= — 2«+l and a(2)=— 2»+3. Thus, the asymptotic 
coefficient a is a(2)=— 2»+3, and comes from the 
subgraph in which all propagators in the loop are 
involved. Now Weinberg’s theorem always tells us 
what the maximal asymptotic power of a graph is, but 
it does not guarantee that the coefficient of the maximal 
term is nonvanishing. In fact, in the case of the axial- 
vector loop diagram we will show that the coefficient 
of the £" 2 n+ 8 (ln£)^ term does vanish, so that the leading 
asymptotic behavior is £" 2 n+ 2 (ln£) 0 ', one power lower 
than is predicted by naive power counting. Let us denote 
by Lip—p'jMa'jpw'tpin-d the graph illustrated 
in Fig. 3, 


but from Eqs. (16)-(20) we find, instead, 

— (k\-\-k2) ti R V(tii =2moR- V(t -\-%Tpk\Sk2 T z$T9{f ( 22 ) 

We see that the axial-vector Ward identity fails in the 
case of the triangle graph . The failure is a result of the 
fact that the integration variable in a linearly divergent 
Feynman integral cannot be freely translated. 

The breakdown of the axial-vector Ward identity 
which we have just found is related to another anom¬ 
alous property of the triangle graph. To see this, let 
us consider the behavior of the general axial-vector 
loop diagram with In photon vertices (See Fig. 3), 
as the 2n— 1 independent photon momenta k\, •••, 
kzn-i approach infinity simultaneously in the manner 

h= , y-1, •••, 2 n- 1; ^ 

qj fixed , co , 


L(p—p ', Wo; p 1 , • * •, p2n-\) 


= / <JV Tr 


X7 (fc) - 


f 2» fc-i r 1 -1 

x: n Y t3 ’>- 

*-1 3=1 L r+pj-mo j 

1 1 


-Y/iYs- 

7 +pk—trio 7 +p k-bp 1r —p ~ mo 


X 


2 n r 

n h 

j—fe-fiL 


•(/). 


-— 11 . 


7+pi+p f —p—mo 


(25) 


4 S. Weinberg, Phys. Rev. 118, 838 (1960). For a simplified 
exposition of Weinberg’s results, see J. D. Bjorken and S. D. 
Dreil, Ref. 1, j>p. 317-330 and pp. 364-368. Weinberg’s theorem 
applies for arbitrary spacelike four-vectors q,. There can also be 
powers of In Inf, In In Inf, etc., in Eq. (24), which we do not 
indicate explicitly. 

6 The superficial divergence of the subgraph is obtained, as 
usual, by adding —1 for each internal fermion line, —2 for each 
internal boson line, and +4 for each internal integration. For the 
precise definition of subgraph in the general case, see Ref. 4- 
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Clearly we can write 

L(p — p f ,mo\ pi,' * * jp2n— l) 

(A) ~L(p—p ( , mo; pi ,-• *,£ 2n -i) 

— L(0,WoJ pi,* • *,^2n-l) (26) 

(B) +£(0,mo; pi,- - *,?2»i-0“-^(0,0; pi,- • • ,p& i-0 

(C) +£(0,0; p\,- -• yptn- 0- 


Because differencing the loop graph with respect to 
either the axial-vector current four-momentum p — p* or 
the fermion mass mo decreases the degree of divergence 
by one, terms (A) and (B) on the right-hand side of 
Eq. (26) have a(2)=—2»+2, and therefore behave 
asymptotically as £ -2 " +2 (ln£)' J/ . Term (C) on the 
right-hand side of Eq. (26) can be rewritten as 

L(0, 0; pi t - • *, p2n —i) 


Jd*r Tr 


2» fc—i r In 1 

E II y U) - 7 Ci) - 7*75 

*-i /-I L r+pjJ r+p k 


X 




j d*r Tr 


d 

7s— 
dr* 


2 n P 

n ■ 

3-1 L 




Integrating by parts with respect to r gives 


£(0,0;£i s *--,£ 2 n-i)=0, 

proving that the asymptotic behavior of the loop graph 
is one power better than given by Weinberg’s theorem. 

The only nonalgebraic step in this proof is the 
integration by parts with respect to r, an operation 
which is valid provided that the integration variable in 


J d 4 r Tr 



can be freely translated. This is the same condition as 
we found above for validity of the axial-vector Ward 
identity. Thus again, our proof is valid for n> 2, but 
we expect possible trouble in the case of the triangle 
graph (n= 1). From the explicit expression for the 
triangle graph in Eqs. (17) and (18), we see that if we 
write ki=£q, kz=—iq+p r —p, then as £ —► oo we find 

» — 8 t r 2 £g r e rw + 0 (ln£). ( 29 ) 

In other words, the asymptotic power is a= 1= — 2»+3, 
as given by Weinberg’s rules, rather than one power 
lower, as is the case for the loop graphs with n> 2. 
It is easy to check that when Eq. (29) is multiplied by 
— (ki+k^)*, the term with the anomalous asymptotic 
behavior agrees, for large £, with the term in Eq. (22) 
which violates the Ward identity. Thus, the breakdown 
of the axial-vector Ward identity in the triangle graph 



>>i 


Fig. 5. Contribution of the triangle diagram to the general 
axial-vector vertex. We have not drawn the second diagram in 
which the photon lines emerging from the triangle are crossed. 


and the anomalous asymptotic behavior of the triangle 
graph are basically the same phenomenon. 

It is clear that the breakdown of the Ward identity 
for the basic triangle graph will also cause failure of the 
Ward identity for any graph of the type illustrated in 
Fig. 5, in which the two photon lines coming out of the 
triangle graph join onto a “blob” from which If 
fermion and b boson lines emerge. From Eq. (22) for 
the divergence of the basic triangle graph, it is possible 
to show that the breakdown of the axial-vector Ward 
identity in the general case is simply described by 
replacing Eq. (4) for the axial-vector-current divergence 
(which we have shown to be incorrect) by 

d ao 

— jV(x) = 2imof(x)+—:Ff(x)F'*(x): w (30) 

dXft 4 v 

[Equation (30) is easily verified by using the Feynman 
rules for the vertices of jf,j 6 , and (oto/^):F^F T *: 
which are given in Fig. 6.] For example, if we define 

P(P,P') by 

S/(P)F(P,P')Sf'(P')= — Jd i xd i y e i P ,x er ip, ' v 

X (T(p(x ): F*'(0)F"(0): W(y))>o, (31) 

then the axial-vertex Ward identity of Eq. (5) is 
modified to read 

(P^pO^/(p,p')=2ii»iP(p f p')-i(a^4r)f(p f p / ) 

+S F f (p)- 1 y 5 +ysS F '(p')- 1 - (32) 


OPERATOR 


VERTEX FACTOR 


i°CXl 


p p 


p j 


r* 


&:F ftr mF T />ui:e f<rT/) kfkj e (a . Tp 

J. 6. Feynman rules for the vertices appearing in Eq. (30) 
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Equation (30), which is the principal result of this 
section, states the surprising fact that the axial-vector- 
current divergence , as calculated in perturbation theory , 
contains a well-defined extra term which is not obtained 
when the axial-vector divergence is calculated by formal 
use of the equations of motion . 6 

II. CONSEQUENCES OF THE EXTRA TERM 

In this section we investigate the consequences of 
the extra term which we have found in the axial-vector- 
current divergence [Eq. (30)] and in the axial-vector- 
current Ward identity [Eq. (32)]. We consider, in 
particular, the questions of (A) renormalization of the 
axial-vector vertex, (B) radiative corrections to v x l 
scattering, and (C) the connection between 75 invar¬ 
iance and a conserved axial-vector current in massless 
quantum electrodynamics. 

A. Renormalization of the Axial-Vector Vertex 

Recently, Preparata and Weisberger 7 have proved 
the following theorem: If a local current, constructed 
as a bilinear product of fermion fields, is conserved apart 
from mass terms, then the vertex parts of both the 
current and its divergence are made finite by multi¬ 
plication by the wave-function renormalization con¬ 
stants of the fields from which the current is constructed. 
If Eq. (4) correctly described the divergence of the 
axial-vector current in spinor electrodynamics, then the 
theorem of Preparata and Weisberger would apply in 
this case. However, we have seen that the divergence 
is actually given by Eq. (30), and involves an additional 
term which is not a mass term. The effect of this extra 
term, we shall see, is to cause the Preparata-Weisberger 
argument to break down. 

First let us review how the Preparata-Weisberger 
result could be derived if Eq. (4), and the corresponding 
Ward identify of Eq. (5), were true. Since both j M 6 and 
j 6 are local bilinear products of fermion fields, the vertex 
parts iy and T 6 are multiplicatively renormalizable. Thus 
we can write 


r/(*,*>z*- l iy(*,*') > 

TKP^Zjr'TKPt'), (33) 

s F '(p)=z 2 s F '(p), 

where the tilde quantities are finite (cutoff-independent) 
and where Z Aj Zd, and Z 2 are cutoff-dependent re¬ 
normalization constants. Substituting Eq. (32) into 
Eq. (5) we get 

(p-p r yf fl Hp,p')=(2m a Z A /Z D )^(p,p') 

_ +(ZA/Z 2 )l5 F '(p)-'y 5 +y l 5 F '(p l )~'l, (34) 

6 We show in the Appendix that this extra term cannot be 
eliminated by redefining the triangle graph. 

7 G. Preparata and W. I. Weisberger, Phys. Rev. 175, 1965 
(1968), Appendix C. 



Fig. 7. Diagram giving the Iowest-order contribution of the 
extra term in Eq. (32). The heavy dot denotes the vertex of 
(ao/4ir ):Fl*F*r: e*» rp . 


and varying the cutoff gives 
0= b(2moZ A /Z D )T^p t p f )+b(Z A /Z 2 ) 

X [5 f (p)~ x y ,+7 f (pO -1 ] * (35) 

Putting p,p f t or both on mass shell then implies that 
8(2moZ A /Zj)) = 8(Z A /Z z ) = 0 1 (36) 

which means that both 2mfZ A /Z 1> and Z A /Zi are 
cutoff-independent, and hence finite. Thus, if Eqs. (4) 
and (5) were correct, multiplication by the wave- 
function renormalization constant Z 2 would make T M 6 
and T 6 fiinte. 

Let us now consider the actual situation, in which 
the divergence of the axial-vector current is given by 
Eq. (30) and the axial-vector Ward identity by Eq. (32). 
The extra term in Eq. (32) first appears in order a 0 2 of 
perturbation theory. [See Fig. 7.] This lowest-order 
contribution is already logarithmically divergent; 
introducing a cutoff by replacing the photon propagator 
1 /(f+U) with [l/($ 2 +te)][—A 2 /(—A+g 2 +te)], we 
find that 

— i(a 0 /4ar)F(p t p f )= — l(a 0 /ir) 2 }n(A?/m 2 )(p—p)t i 

X 7 ^ 75 +a 0 2 Xfinite+O(a 0 3 ). (37) 

We will also need part of the expression for T 5 (p,p ( ) to 
order a 0 , 


r5 (A? / ) = 75[l+0(ao)]+(ao/27r)m 0 


Xl(p i p , )(p-p , )> t y M y 6 +0(ao 2 ), 


I(P>P')= f dx f dy£x(l—x)p 2 +y(l—y)p' 2 
Jo Jo 

— 2xyp -p f — («+y) Wo 2 ]” 1 . 



Comparing Eqs. (37) and (38), we see that it is imposs¬ 
ible to cancel away the divergence in Eq. (37) by adding to 
it a constant multiple of Eq. (38 ): A constant counter 
term of order a 0 2 multiplying the leading 75 term in 
Eq. (38) cannot cancel the divergence in Eq. (37), 
because the latter is proportional to (p—p , ) lt y M y& i 
while a constant counter term of order a 0 multiplying 
the (p — p^^y/Yt term in Eq. (38) cannot cancel the 
divergence in Eq. (37) because of the nontrivial func¬ 
tional dependence of I(p t p') on p and p*. In other 
words, the axial-vector divergence with the extra term 
included, 

2m 0 r 6 (p t p')-i(a 0 /4ir)F(p,p '), ( 39 ) 

is not multiplicatively renormalizable. 
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Fig. 8 . Lowest-order contribution of the triangle diagram to the 
axial-vector vertex. We have not drawn the diagram in which the 
photon lines are crossed. 

Since multiplicative renormalizability of the diver¬ 
gence was essential to the Preparata-Weisberger argu¬ 
ment outlined above, this argument no longer applies. 
We expect, then, that even after multiplication by Z 2 , 
there will still be logarithmically divergent terms in the 
axial-vector vertex. Such terms first appear in order ao 2 
of perturbation theory, as a result of the diagram shown 
in Fig. 8; the divergence of Fig. 8 is just a consequence 
of the anomalous asymptotic behavior of the triangle 
graph pointed out in Sec. I. Introducing a cutoff in 
the photon propagator as above, we find that 

Z 2 T M *(p i p f ) = y M yf£l-i(ao/ir) 2 ln(A 2 /m 2 )] 

+«oXfinite+«(?Xfinite+0(ao 3 ). (40) 

Equation (40) shows explicitly that the axial-vector 
vertex, while still multiplicatively renormalizable, is not 
simply made finite by multiplication by the wave- 
function renormalization constant Z 2 . Rather, we have 
[see Eq. (33)] 

Z*=Z 2 [l+f (ao/*)* In (A 8 /m 2 ) +0 (ao 8 )]» (41) 

B. Radiative Corrections to viZ Scattering 

As an application of Eq. (40), let us consider the 
radiative corrections to vj, scattering, where l is a u or 
an e . According to the usual local current-current theory, 
the leptonic weak interactions are described by the 
effective Lagrangian 

£eff=(G/^);WS (42) 

where 10~ 6 /M P roton 2 is the Fermi constant and 
where 3 

i x =iVX X ( 1—Ys^+ivrHl - Ys)^ (43) 

is the leptonic current. In addition to the usual terms 
describing muon decay, Eq. (42) contains the terms 

(G/\2](ju7x(1— 75 ) »>iVY x (l — Y s)m 

+^7x(l—75)^7 x (1—75>], (44) 

which describe elastic neutrino-lepton scattering. In 
order to study radiative corrections to the basic vH 
scattering process, it is convenient to use a Fierz trans¬ 
formation to rewrite Eq. (44) in the form (the so-called 


“charge retention ordering”) 

(G/V2](ju7x(1—75)m?m7 x (1—76)^m 

+eyx(l—75)eivy x (l —ys )vj. (45) 

The radiative corrections to Eq. (45) may then be 
obtained simply by calculating the radiative correc¬ 
tions to the charged lepton currents m 7 x( 1 — 75 )^ and 
^ 7 x(l— 7 s)c, without any reference to the neutrino 
currents. 

Now, application of standard electrodynamic pertur¬ 
bation theory shows that the effect of the radiative 
corrections to the charged lepton currents is to replace 
the matrix elements my\(1—Ys)mj ® 7 x(l— 75 ) e, (we use 
/i, e to denote spinors here) by 

jaz 2 (#,) [rx w -rx 5W >, eZ 2 <*>(Tx<*>-rx 6 <®>>. (45) 

In Eq. (46), IV^-® 5 and IV 1 ''*® 5 denote the proper vector 
and axial-vector vertices, while the wave-function 
renormalization factors Z 2 (m, ® ) come from self-energy 
insertions on the external lepton lines which run into 
and out of the proper vertices. From the usual electro¬ 
dynamic Ward identity for the vector part, we know 
that Zi^lV^ and Z 2 ( ® 5 rx (e) are finite. On the other 
hand, Eq. (40) tells us that 

Z 2 ^-®)rx 6 ^* e) = 7x75[l—|(aoA) 2 ln(A 2 /m 2 )] 

+aoXfinite+a 0 2 Xfinite+0(a 0 3 ), (47) 

which means that, on account of the presence of axial- 
vector triangle diagrams, the radiative corrections to v e e 
and scattering diverge in the fourth order of perturba¬ 
tion theory . This result contrasts sharply with the fact 
that the radiative corrections to muon decay or to the 
scattering reaction j^+e—> v 0 -hu are finite to all 
orders in perturbation theory. 7 The crucial difference 
between the two cases, of course, is that because of 
separate muon and electron-number conservation, the 
current A7x(l — 7s)« cannot couple into closed electron 
or muon loops, and thus the troublesome triangle 
diagram is not present. 

Two points of view can be taken towards the diver¬ 
gent radiative corrections in yfi scattering. One view¬ 
point is that we know, in any case, that the local 
current-current theory of leptonic weak interactions 
cannot be correct, since this theory leads at high energies 
to nonunitary matrix elements, and since it gives 
divergent results for higher-order weak-interaction 
effects. 9 Thus, it is entirely possible that the modifica¬ 
tions in Eq. (44) necessary to give a satisfactory weak- 
interaction theory will also cure the disease of infinite 
radiative corrections in vj, scattering. The other view¬ 
point is that we should try to make the radiative 
corrections to vil scattering finite, within the framework 
of a local weak-interaction theory. It turns out that this 

9 For recent discussions of the sicknesses of the local current- 
current theory and their possible remedies, see N. Christ, Phys. 
Rev. 176, 2086 (1968); and M. Gell-Mann, M. L, Goldberger, 
N. M. Kroll, and F. E. Low, Phys. Rev. (to be published). 


^ ® We omit the normal ordering signs. 
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is possible, if we introduce veU and y M e scattering terms 
into the effective Lagrangian, so that Eq. (44) is 
replaced by 

(G/V2)[>7x (1 “7 s)m—( 1—7s)«] 

X[Pm7 x ( 1—7s)^—*>e7 x (l—75)^J- (48) 

This works because the troublesome extra term in 
Eq. (30) is independent of the bare mass mo, so that it 
cancels between the muon and electron terms in Eq. 
(48), giving 10 

d 

-D* 7 x 75 M— ^7 \ 7 ee] = limf^jiy 5 M “ 2im 0 (e) ey 5 e . (49) 

dx\ 

Application of the Prepara ta-Weisberger argument to 
Eq. (49) then shows that the radiative corrections to 
Eq. (48) are finite in all orders of perturbation theory. 
Experimentally, it will be possible to distinguish 
between Eq. (48) and Eq. (44) by looking for elastic 
scattering of muon neutrinos from electrons. 

C. Connection Between y 5 Invariance and a 
Conserved Axial-Vector Current in 
Massless Electrodynamics 

Finally, let us discuss the effects of the axial-vector 
triangle diagram in the case of massless spinor electro¬ 
dynamics [Eq. (1) with f»o=0]. We will find that the 
triangle diagram leads to a breakdown of the usual 
connection between symmetries of the Lagrangian and 
conserved currents. As in our previous discussions, we 
begin by describing the standard theory, which holds 
in the absence of singular phenomena . 8 Let {$(#)} 
= {$i(^),$ 2 (^),*'and {d\$} be a set of canonical 
fields and their space-time derivatives, and let us 
consider the field theory described by the Lagrangian 
density 

(50) 

To establish the connection between invariance proper¬ 
ties of £ and conserved currents, we make the infinites¬ 
imal, local gauge transformation on the fields, 

*/(*) — $i(*)+A(*)Gj[{$(*)}], (51) 

and define the associated current J a by 

J«=-6£/8(djL). (52) 

Then, by using the Euler-Lagrange equations of motion 
of the fields, we easily find 11 that the divergence of the 
current is given by 

_ a a /«=-5£/5A. (53) 

10 What is happening here is that the muon triangle diagram 
and the electron triangle diagram contribute with opposite sign, 
and so regularize each other. 

11 For details, see S. L. Adler and R. F. Dashen, Current Algebras 
(W. A. Benjamin, Inc., New York, 1968), pp. 15—18. 
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In particular, if the gauge transformation of Eq. (51), 
with constant gauge function A, leaves the Lagrangian 
invariant, then 5£/5A=0 and the current J a is con¬ 
served. Thus, to any continuous invariance trans¬ 
formation of the Lagrangian there is associated a 

conserved current It is also easily verified that the 
charge Q0) = Jd?xJ*(x,t) associated with the current 
J a has the properties 

dQ(t)/dt=0 } (54a) 

[ft*i(*)] = «*(*)• (54b) 

Equation (54b) states that Q is the generator of the 
gauge transformation in Eq. (51), for constant A. 

Let us now specialize to the case of massless electro¬ 
dynamics, with Eq. (51) the gauge transformation 

$(x) -*■ [l+fYBA(*)3Ka:). (55) 

When A is a constant and mo=0, this transformation 
leaves the Lagrangian of Eq. (1) invariant, so that 
according to Eq. (53), the associated current J a should 
be conserved. But calculating we find 

- 5£/5(d«A) , (56) 

which according to Eq. (30) has the divergence 

d a J a = (ao /4tr)Ft*(x)F Tp (x)ct 9rp . (57) 

Thus, Eq. (53), which was obtained by formal calcula¬ 
tion using the equations of motion, breaks down in this 
case. We see that because of the presence of the axial- 
vector triangle diagram, even though the Lagrangian (and 
all orders of perturbation theory) of massless electro¬ 
dynamics are y 6 invariant , the axial-vector current 
associated with the 75 transformation is not conserved . 

However, it is amusing that even though there is no 
conserved current connected with the 75 transformation, 
there is still a generator Q 6 with the properties of Eq. 
(54). To see this, let us consider the quantity j 6 defined 
by 

, ao dA r (x) 

Ji> 5 (x) = j f ?(x) - A ( (x) -(58) 

w dx p 

referring to Eq. (30), we see that 

d 

— in B (*)=0. (59) 

dx^ 

Although ^ 6 is conserved, it is explicitly gauge-dependent 
and therefore is not an observable current operator . But 
the associated charge 

Q 5 = j&x jo 5 (x) 

— J —A.VXA] (60) 
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is gauge-invariant and therefore observable. According 
toEq. (59), Q 5 is time-independent, and its commutator 
with \p(x) (calculated formally by use of the canonical 
commutation relations) is 

[0 6 ,^(z)]= -yii(x)=i[iy s \p(x)2- (61) 

Comparison with Eq. (59) then shows that Q* is the 
conserved generator of the 75 transformations. 12 

After this manuscript was completed, we learned that 
Bell and Jackiw 18 had independently studied the 
anomalous properties of the axial-vector triangle graph, 
in the context of the <r model. In the Appendix we 
discuss certain questions raised both by the paper of 
Bell and Jackiw and in conversations with Professor 
S. Coleman. 

Note added in proof. (1) All field quantities appearing 
in the paper denote unrenormalized fields, with the one 
exception that in Eqs. (A29), (A30), and (A34), <f> r 9 
and <t> n denote, respectively, the renormalized pion and 
17 fields. 

(2) It is our claim that Eq. (30) is an exact result, 
valid to all orders in electromagnetism, and similarly 
that the <r-model analog, Eq. (A22), is exact to all orders 
in both the electromagnetic and strong couplings. These 
conclusions follow in our diagrammatic analysis from 
the fact that electromagnetic or strong radiative correc¬ 
tions to the basic triangle always involve axial-vector 
loops with more than three vertices, which satisfy the 
normal axial-vector Ward identities. A more detailed 
discussion of this question will be given by the author 
and W. A. Bardeen (to be published). 

(3) Field-theoretic derivations of Eq. (30) have been 
given by C. R. Hagen (to be published), R. Jackiw and 
K. Johnson (to be published), B. Zumino (to be 
published), and R. A. Brandt (to be published). Jackiw 
and Johnson point out that the essential features of 
the field-theoretic derivation, in the case of external 
electromagnetic fields, are contained in J. Schwinger, 
Phys. Rev. 82, 664 (1951). 

(4) In Eq. (Al) we state that the general form of the 
triangle diagram is R app , Rosenberg’s gauge-invariant 
expression, plus an arbitrary multiple of e r a PP (ki—k 2 ) r ; 
we infer this form for the extra term by studying how 
the triangle graph is changed by shifts in the integration 
variable. It is easy to see that this is the only allowed 
form for the ambiguity, by noting that the extra term 
must satisfy the following conditions, (i) The extra 
term must have the dimensions of a mass; (ii) the extra 
term must be a three-index (<rp/i) Lorentz pseudotensor; 

(iii) the extra term must be symmetric under inter¬ 
change of the photon variables and (£ 2 ,p); 

(iv) the extra term must have no singularities in any 
of the variables ki*, k 2 2 , kvk 2 and mo, since the dis- 

u Because of an implicit photon field dependence of ja B (x) 
implied by Eq. (30), Q fi does commute with all the photon field 
variables. The details of showing this are complicated, and will 
be given elsewhere. 

w J. S. Bell and R. Jackiw (unpublished). 


continuities of the triangle diagram across its singulari¬ 
ties involve no linear divergences and hence are un¬ 
ambiguously contained in Rosenberg’s expression R vpfi . 

(5) The statement in Ref. 20, that the simultaneous 
presence of iso scalar and iso vector vector mesons 
affects the >-27 prediction, is not correct. There 
will, of course, be an extra term of the form 

dB*(I= \)/dx v dB r (I=$)/dxfo vrp 

in the PCAC equation. However, the matrix element 
of this term relevant to the?^—► 27 low-energy theorem, 
when expressed in terms of Fourier transforms of the 
vector-meson fields, is proportional to 

jd'k(y(k ht ,)y(k^) | £* +il W(/= 1 )£_*'(/= 0 ) 10 ) 

X (^ 1 +^ 2 )^^^^- 

Because of photon gauge invariance, the matrix element 
(7(^i,ei)7(^,e 2 ) 12W*,*(/= l)£- fc '(/=0) 10> 

is proportional to k\k 2j and so the two-vector meson 
term is of order k\k 2 {k\-\-k^. Since the low-energy 
theorem involves only terms of order k\k 2i the two- 
vector meson contribution is of higher order and does 
not affect our result. This also means that the extra 
terms in the PCAC equation proposed recently by 
R. Arnowitt, M. H. Friedman, and P. Nath, Phys. 
Letters 27B, 657 (1968), do not in fact lead to a non¬ 
null PCAC prediction for 7 r°— ► 2y. 
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APPENDIX 

We discuss here the following questions raised both 
by the recent paper of Bell and Jackiw and in conversa¬ 
tions with Professor S. Coleman: (1) Is the expression 
R app [see Eq. (17)J which we have used for the triangle 
graph unique, or is it possible to redefine R app by a sub¬ 
traction in such a way as to eliminate the anomalies 
discussed in the text? (2) What is the connection 
between our results and the <r-model discussion of Bell 
and Jackiw, and between our results and the physical 
it"—► 2 y and ij —► 27 decays? 

A. Uniqueness of the Triangle Graph 

The expression for R app in Eq. (17) is obtained from 
Eq. (16) by the regulator technique of subtracting from 


250 


Adventures in Theoretical Physics 


177 AXIAL-VECTOR VERTEX IN 

Eq. (16) a loop with mo replaced by M, performing the 
r integration, and then letting Af—► *>. Clearly, any 
mass-independent terms in Eq. (16) will be lost in this 
process. That a mass-independent term is present can 
be seen from the fact that when we make the change of 
integration variable r—> r+aki+bkz in Eq. (16), the 
result is not left invariant, but rather is changed by 
multiples of e rap Jk\ r and e r<rp) Ji 2 r > If we are careful to 
preserve symmetry with respect to the photon variables, 
the change will be proportional to e TaP[l (k\—k 2 ) r . The 
noninvariance of the triangle graph under changes 
of integration variable is of course just a result of the 
linear divergence in Eq. (16), and means that in a 
nonregulator calculation the results obtained for the 
triangle graph will depend on how the external momenta 
ki and k 2 are taken to run through the internal lines. 
We may express this ambiguity formally by writing 
that the general expression for the triangle graph is 

-^<rpA*Cf3 = -^<rpM+f € T<rpA.(^l—^2) T , (Al) 

with R afill the regulator value in Eq. (17). 

We easily find the following properties of 

(i) vector index divergence: 

ki*R„jLf]= ^ki <r k2 r e T<r p tly (A2) 

*a^wCf]=f*a p *i T er W ; 

(ii) axial-vector index divergence: 

= 2 maR afi + (8^- 2f)^i^2 T e € r«rp; (A3) 

(iii) asymptotic behavior: Writing ki = £<?, k 2 = —£<? 
+p'— p, as £—► co 

*,p,[f]-* -£(8**-2r)* f * w ; (A4) 

(iv) axial-vector meson to two-photon matrix ele¬ 
ment: If b (£ 1 +^ 2 )= ei'ki= e 2 ^h 2 = &i 2= ^ 2 ^ = 0, then 3 

^ei'e a ^ ir p #( Cf]=r^ei'^(fti-ft a ) T e rirP|1 ; (A5) 

(v) large mo behavior: 

lim 22 w [?]= fe«w,(ii-W T . (A6) 

WM)-*oo 

Referring first to Eqs. (A2)-(A4), we see that when 
f=0, which is the case discussed in the text, the triangle 
graph is gauge-invariant with respect to the photon 
indices but has an anomalous axial-vector Ward identity 
and anomalous asymptotic behavior. By contrast, when 
f = 47T 2 there is no longer gauge invariance with respect 
to the photon indices, but the axial-vector Ward 
identity and the asymptotic behavior as £—» 00 axe 
normal. Since the formal proof of gauge invariance for 
the triangle graph suffers from the same difficulties as 
does the formal proof of the axial-vector Ward identity, 
there is no a priori reason to demand gauge invariance 
with respect to the photon indices as opposed to a normal 
axial-vector Ward identity, or, for that matter, to 
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demand either. In other words, as long as we consider 
only the divergence properties of .R w [j£l|j there is no 
requirement fixing {*. 

There are, however, two additional restrictions on 
R aptl which force us to choose f=0. First of all, we recall 14 
that two real photons can never be in a state with total 
angular momentum 1, which means that the matrix 
element for an axial-vector meson to decay into two 
photons must vanish. In order for our triangle graph 
to satisfy this requirement, we must have ] 

= 0 when l is an axial-vector meson polarization vector 
satisfying b (^ 1 +^ 2 )= 0 and when the photon variables 
satisfy evk\= e 2 ‘k 2 =ki 2 =k2 2 =0. Referring to Eq. 
(A5), we see that this requirement forces us to choose 
f=0. [To check that, even with the constraints on 
/, e, etc., the expression l M ei <r e 2 f> (ki— k 2 ) r e r v fiP is in general 
nonvanishing, choose ki= (— 1,1,0,0), ei= (0,0,1,0), 
* a = (—2,0, 2,0), ««= (0,1,0,0), *i+**=(-3,l, 2,0), 
/= (0,0,0,1), Jfei—Jh= (1,1, —2, 0).] Secondly, it is 
physically unreasonable that a loop diagram such as 
our triangle graph should influence low-energy phenom¬ 
ena in the limit as the mass of the loop fermion becomes 
infinite. In other words, we expect 

lim i?<rp>.[f]=0, k 2 fixed (A7) 

mo-* 00 

which according to Eq. (A6) again requires f=0. Thus, 
there are strong physical restrictions which uniquely 
select the regulator value for the triangle graph ; in partic¬ 
ular, it is not permissible to make the choice {* = 471^ 
which eliminates the anomalies discussed in the text. 

B. Connection with Bell and Jackiw and with 

5c° —> 2f and ij —► 2f Decay 

In a recent paper, Bell and Jackiw discuss 7r°—► 2y in 
the <r model; they find and attempt to resolve a paradox 
arising from the presence of triangle diagrams. We briefly 
summarize their work, and then discuss our own inter¬ 
pretation of the paradox, which differs from theirs. 16 
Bell and Jackiw use a truncated version of the <r model, 
in which the charged pion and the neutron fields are 
omitted. Letting ip r <t >, and <r be, respectively, the fields 
of the proton, the neutral pion, and the scalar meson, 
the Lagrangian density is 8 

□ —mo+go(<r+^76)]^+2[(d0) 2 + (dtr) 8 ] 

-|MoV-KMo 2 +2X o //o 2 )(r 8 -Xo[(0 2 +<T 2 ) 2 

— 2/o^VfaH-* 2 )]—tftfpTriM 11 > (A8) 

with the coupling constant /o given by 
_ JWo/(2m 0 ). (A9) 

u C. N. Yang, Phys. Rev. 77, 242 (1950). 

16 Our results do not contradict those of Bell and Jackiw, but 
rather complement them. The main point of Bell and Jackiw Is 
that the a model interpreted in the conventional way, does not 
satisfy the requirements of PCAC. Bell and Jackiw modify the 
cr model in such a way as to restore PCAC. We, on the other hand, 
stay within the conventional cr model, and try to systematize and 
exploit the PCAC breakdown. 
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The axial-vector current is 

r d 

j/(x)=4f(x)y M y^(x)+2\ <r(x)—4>(x) 

L dx» 

d "I d 

-<f>(x) - a (x) -/o" 1 - <f>(x ), (A10) 

dx» J dx* 

and the divergence of the axial-vector current, as 
calculated by formal use of the equations of motion, is 

d 2 

— j»( x ) = —<t>(x)- (All) 

dXp / 0 

This is, of course, the usual operator PCAC equation. 

The paradox noted by Bell and Jackiw is obtained by 
applying Eq. (A11) to the calculation of 7r° —» 2y decay. 
Let us concentrate first on the left-hand side of Eq. 
(All). The matrix element Sflof the axial-vector 
current between the vacuum and a state with two 
photons has the following general structure, imposed by 
the requirements of Lorentz invariance, gauge in¬ 
variance, and Bose statistics [cf. Eq. (17)]: 


ef&Sawtyifs 2 ) > 

= Cl^i r eT(rp/*“f“^ , 2^2 T CT(rpAi“f“6^a^lp^l^2 T C{T(r^ 
+C4^2p^l^2 T C{T(TM“l"^5^1(r^l^2 r e{TpM 

+ C8^2«r^l^2 T e{rpM, 
Cl=^l'^2C3 + ^2 2 C4 , 

C2 = ^1 2 C5+^1*^2C8 , 

Ci(ki,ki) = — Ca(k2,ki), 

Ci(kl,k2) = —Cs(&2i&l) • 

As in Eq. (17), k\ and k 2 denote the photon four- 
momenta. The matrix element of the divergence of the 
axial-vector current is proportional to (^rf^)^*., 
and a straightforward algebraic rearrangement* using 
Eq. (A 12) shows that 

(k i“bkz) M ei a e 2 p S a pn (k ijfe) | ft iW 2 »=o 

= 2 (Pt~ C8)(^l+^2) 2 ^1^2 T ei* r e2 p e{T(rp' (A 13) 

Thus, if we write the matrix element for 7r°—»2y in 
the form 

Sm(7T°-» 2y) = kiik2 r e 1 *e2‘ > e i r* fi F, (A 14) 

then Eqs. (All) and (A13) tell us that in the <r model 
(or any other PCAC model), F vanishes when the pion 
mass (Jfei+Jh) 2 Is extrapolated to zero. This statement, 
of course, must hold in each order of perturbation 
theory. So let us check by calculating SflZ(7r°—» 2y) 
directly in the <r model in lowest-order perturbation 
theory, where the only diagram which contributes is the 
pseudoscalar coupling triangle diagram (i.e., Fig. 2 with 
Y m Y 6 replaced by the pion-nucleon coupling igoyo)* We 


find, comparing with Eqs. (19) and (20), that 


-fao 1 


20 T (9T® —* 2 Y)lowest order— Igo^l* £2 p Rtrp 

(2 


=k^k2 T ^i a ^ p e iTir p(2a^/rr)gomJoo(ki,k2 ), (A15) 


so that 


2ao 

Flowest order='- gonial oo(k\,k 2 ) | fc 1 s ™fc 2 l = 0 . 

7T 


(A 16) 


Setting (£i+£ 2 ) 2 =0 then gives 


ao go 


Floweet order| -, (A17) 

7T W 0 

which does not vanish, contradicting the conclusion 
obtained indirectly from PCAC. The nonzero value of 
Eq. (A17) is the paradox of Bell and Jackiw. 

Bell and Jackiw attempt to circumvent this contradic¬ 
tion by introducing a regulator nucleon field which is 
quantized with commutators rather than anticom¬ 
mutators. The coupling of the regulator field to the 
mesons is described by the interaction Lagrangian 
density 

$ig\(<r+i*faf>)'l'u (A 18) 

to maintain the PCAC equation the regulator coupling 
and mass must satisfy the relation 


gi/m=go/tno - 


(A 19) 


Thus, as the regulator mass approaches infinity, the 
regulator coupling to the mesons becomes infinite as 
well. As a consequence, even in the limit of infinite 
regulator mass the regulator field triangle diagram 
makes a contribution to the amplitude for tt° —» 2y 
decay, 

“0 gi oto go 

^regulator triangle diagram —- = - . (A20) 

7T Wi 7T Wo 


The total amplitude is the sum of Eqs. (A16) and (A20), 
and does vanish at (^ 1 +^ 2 )*= 0, in accord with the 
PCAC prediction. 

Unfortunately, however, the regulator procedure of 
Bell and Jackiw leads to grave difficulties when we 
turn to purely strong interaction phenomena. Let us, 
in particular, consider the regulator loop contribution 
to the scattering of 2 n <r particles. In the limit of large 
regulator mass, this loop is proportional to 


gi 2n f d 4 r Tr j f —-— 1 }«»!*(—) , (A21) 

J Lr— WiJ J \wi/ 


and thus, on account of Eq. (A19), becomes infinite as 
wi—» co. This means that the regulator procedure of 
Bell and Jackiw introduces unrenormalizable infinities 
into the strong interactions in the <x model, and therefore 
is not satisfactory. 
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We now suggest a different resolution of the paradox, 
utilizing the ideas developed in the text. 16 As we saw, 
when triangle graphs are present we cannot naively use 
the equations of motion to calculate the divergence of 
the axial-vector current. Rather, we must infer the 
correct divergence equation from perturbation theory, 
which tells us that the extra term of Eq. (30) is present 
In the a-model, the efFect of this extra term is to replace 
Eq. (All) by 


Let us now make the standard PCAC assumption that 
F is slowly varying as the pion mass ( ki+kz ) 2 is varied 
from m 2 to 0, so that we can use Eq. (A26) for the 
physical ir°-decay matrix element. We also replace 
g T (0) by the on-shell coupling constant g r . Using the 
physical values for g r , gA, 17 we find for the pion 

lifetime 

r -i= (ju 3 /647r)7 f ’ 2 = 9.7 eV, (A27) 


d Mo 2 ao 

—i, B 0*)—-*(*)+— Fi*F'>e iarp . (A22) 

dx p /o 4 tt 

In other words, the PCAC equation must be modified 
in the presence of electromagnetic interactions . As a 
result, the argument leading to the conclusion that F 
vanishes at (JferHfe2) 2 =0 must be modified. As before, 
we conclude that the matrix element of the left-hand 
side of Eq. (A22) between vacuum and two photons 
vanishes at (£i+£2) 2 =0. But instead of implying that 
£nx(^°—» 2y) vanishes, this now tells us that 

£nx(^°—» 2y) = Z 3 “ 1/2 Xmatrix element of (ju 2 <f>) 

= —m 2 (/o/mo 2 )Z3~ 1/2 X matrix element 

of ^(ao/47r)E^ <r i ?Tp e{ ffTp ] 

M 2 /a g 0 \ 

=- Zr m [ - )k 1 ik2 r efe 2 ^ irafi ; (A23) 

Mo 2 \ v 

in other words, 

M 2 / a go \ 

F\ =—Z%M -) . (A24) 

Ho 2 \ it mo/ 


[In Eqs. (A23) and (A24), Zz is the ir° wave-function 
renormalization constant.] To lowest order in perturba¬ 
tion theory, Eq. (A24) agrees with Eq. (A17), so our 
modified PCAC equation leads to no paradox. In 
addition, Eq. (A22) yields a bonus: From the derivation 
of Eq. (A24) it is clear that Eq. (A24) is not just a 
lowest-order perturbation theory result, but in fact is an 
exact statement in the <r model. We can reexpress 
Eq. (A24) in terms of physical quantities using the 
equation 16 


*LlL Zr wJ®L±- 

nt 0 juo 2 ™n gJi 


(A25) 


where m N> g r ( 0), gA are, respectively, the renormalized 
nucleon mass, the renormalized pion-nucleon coupling 
constant (evaluated at pion mass zero), and the nucleon 
axial-vector coupling constant in the <r model. Thus 
Eq. (A24) becomes 

a gr(fl) 

F | (fc 1+ fc 2 )*=o=-• (A26) 

_ v tn N gA 

16 M. Gell-Mann and M. Ldvy, Nuovo Cimento 16, 705 (1960). 


in good agreement with the experimental value 18 

Texpf^ (1.12d=0.22)X10“ sec- 1 

= (7.37=tl.5) eV. (A28) 

So we see that the <r model, as interpreted with Eq. 
(A22), gives a reasonable account of ir° —» 2y decay. 19 
This also makes it clear that the use of regulators to 
cancel away the triangle graph contribution to F up to 
terms of order u 2 /^n 2 will tend to give much too small 
a value for the ir°—» 2y matrix element. 

The above ideas are readily extended to other field 
theoretical models, and hopefully, to the physical 
axial-vector current as well. Let $z 5X be the third 
component of the axial-vector octet. (It corresponds 
to in the model discussed above.) Let us suppose 
that the world is really described by a field theory, and 
that there are only spin-0 or spin-! elementary fields. 20 
We then make the following two assumptions: 

(i) The usual PCAC equation, 
d 

—JF3 6X =CVm 2 &t0, C T =m N gA/g T (0) , (A29) 

d# x 


17 We take g r »13.4, gx«l,18. If we used gA** 1.24, then we 

would get r - 1 =8.9 eV. We can also evaluate Eq. (A26) by using 
the relation g r ( 0 )/ {tmtgA) /*, with /, the charged-pion 

decay amplitude and ju + the charged-pion mass. (See S. L. Adler 
and R. F. Dashen, Ref. 11, pp. 41-45.) This gives F| (ki+krf-o 
= — (a/aOvSji+V/,. Using the experimental value /,«0.96 ju+ 3 , 
we find from Eq. (A27) that r -1 = 7.4 eV. 

18 A. H. Rosenfeld el al. r Rev. Mod. Phys. 40, 77 (1968). 

19 Comparing Eqs. (A26) and (A17), we see that apart from a 
factor of gA~* f our PCAC expression for the 7 r° lifetime is the same 
as the expression obtained from the pseudoscalar coupling triangle 
graph if one uses the physical nucleon mass and pion-nucleon 
coupling rather than the bare mass and coupling appearing in 
Eq. (A17). That the triangle graph, evaluated using physical 
quantities, gives a good value for n* —> 2y decay has been noted by 
J, Steinberger, Phys. Rev. 76, 1180 (1949); and J. Steinberger 
(private communication). 

20 This assumption is not strictly necessary for the calculation 
of the t 0 —»■ 2y rate. If there is also a single elementary neutral 
vector-meson field B\ then there will be an additional term in 
Eq. (A30) proportional to F^dB T /dx fi ^ ffTfi . However, because the 
gauge-invariant coupling of a massive vector boson to a physical 
photon vanishes [G. T. Feldman and P. T. Matthews, Phys. Rev. 
132, 823 (1963)], this term makes no contribution to the physical 
n 0 —> 2y decay. In general, there will be no change in the ^—>27 
prediction if only isoscalar vector mesons or only isovector vector 
mesons are present. If both isoscalar and isovector vector meson? 
are present, there will be additional terms Kke dB*(I=l)/dx& 
dB r (l~0)/dx fi e^Tfi7 which do affect the v°—> 2y prediction. 
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should, on account of triangle graphs, be replaced by 

d ao 

- (F 3 s "=CA.+5-(A30) 

dx x 4 tt 

with S a constant . 21 

(ii) If SF 8 5X is expressed in terms of the elementary 
fields by 

3 : 3 6 X = 5 Z^/y x 7 Vy+ m eson terms, (A31) 

9 

3 

then S is given by 

S=Hg&\ (A32) 

1 

where the charge of the Jth fermion is Qje o. Equation 
(A32) means that we count only triangle graphs of the 
elementary fermions, but do not include triangles 
involving nonelementary bound states. It may be 
possible to decide in model calculations whether this 
rule, which we conjecture, is really correct. 

Using Eq. (A30) to calculate the ir °—>27 matrix 
element then gives 

— (< */Tr)2S(g r /mNgA ). (A33) 

The experimentally measured ir° lifetime corresponds 22 
to | S | =0.44; for comparison, S in the <r model is 
|1 2 — 20 2 = 2 > while S in the quark model is 2 (f ) 2 

More generally, in any triplet model in 
which the electromagnetic current is a U -spin singlet, 
the triplet charges will be (Q P ,Qn,Qx)= (Q> (5—1, Q — 1) 
and we have S= JQ 2 —£(Q— 1) 2= Q — b That is, in 
triplet models we have S=(Q) av , where (Q) flV is the 
average charge of the triplet particles taking part in 
both the AS=0 weak V—A current and the |AS| = 1 
weak V—A current. This means that the condition 
(Q)ftv= — I, necessary 23 for the radiative corrections to 
the AS=0 and |AS| = 1 weak currents to be finite, 
also predicts a ir° —» 2y rate in good accord with 
experiment . 24 


81 In Eq. (A30), 4 > w 0 does not necessarily mean a canonical pion 
field, but only a suitable interpolating field for the pion. For 
example, in the quark model, would be proportional to J/y^rnp. 
The separation of dx3 r * 6X into two terms in Eq. (A30) is made 
unique by the requirement that <f > w 0 and the photon field be 

dynamically independent , in the sense that \jj>*<hA x] = \j>ra,A x]=0 
at equal times. 

22 If we use instead of Eq. (A33) the formula (a/ir)(2S) 
X (v2ju + V/,), as in Ref. 17, then the experimentally measured tt® 
lifetime gives \S\ =0.50. 

“ N. Cabibbo, L. Maiani, and G. Preparata, Phys. Letters 25 B, 
132 (1967); K. Johnson, F. Low, and H. Suura, Phys. Rev. 
Letters 18, 224 (1967). 

24 This result was noted previously, in the context of the vector 
dominance model, by N. Cabibbo, L. Maiani, and G. Preparata, 
Phys. Letters 25B, 31 (1967). 


The two-photon decay r)—*2y can be treated in a 
similar manner. The analog of Eq. (A30) for 9 : 8 6X is 

d 1 ao 

— S— (A34) 
dx x \3 4 7 t 

where S is the same constant as in Eq. (A30) and where 
the factor 3~ 112 appears because the electromagnetic 
current is a &-spin singlet . 26 If there were no ij — X° 
mixing, then fa would be the ij field; in the presence of 
mixing, fa would be a mixture of the ij and X° fields. 
In the SUz limit, one has, of course, C n =C r . To get a 
prediction for the rate from Eq. (A34), we 

sandwich Eq. (A34) between the ij state and a two- 
photon state and make the following three approxima¬ 
tions: (i) We neglect rj—X° mixing; (ii) we takeC,,= C,r; 
(iii) we ngelect the left-hand side of Eq. (A34), which 
makes a contribution of order /x , 2 [equivalently, we 
assume that the exact prediction E,(ju 1 I 2 = 0 )= — (a/V) 
X(25/\3)(l/C,,) can be smoothly extrapolated from 
Mi! 2 =0 to the physical rj mass]. These approximations 
give the standard SU 3 prediction 26 

2 7 ) = ^ WM)T(ir°-» 2y)= (165±34) eV, (A35) 

about a factor of 8 smaller than the experimental 
value of 

2?)= (1210±260) eV. (A36) 

In view of the approximations made, the discrepancy is 
not too disturbing; in particular, the terms of order fi v z 
are by no means negligible, and could easily make a 
contribution to the ij—» 27 matrix element as important 
as the S/y/S term which we have retained. 27 

26 The correctness of the factor 1/vJ is easily verified in the 
triplet model. 

28 The factor (ja ,//*)* comes from phase space. 

27 We discuss briefly two other electromagnetic decays to which 
current algebra methods have been applied: w —> ifiy and t? —> 37 t. 
In the case of w —»ifiy it has been argued by D. G. Sutherland 
jNud. Phys. B2, 433 (1967)] that the usual PCAC equation 
.Eq. (All)] implies vanishing of the decay amplitude at zero 
our-momentum. This conclusion, however, is erroneous, and 

results from the use by Sutherland of an insufficiently general form 
for the axial-vector-current-vector-meson-photon vertex. The 
most general such vertex is given by Eq. (A12); an examination of 
the reasoning leading to Eq. (A13) shows that Eq. (A13) is vaKd 
only when £i a =£3 a =0. When one of the vectors is massive, as in 
the case of w decay, we find instead that 

(kiA-k*)**!*tfSffpnikijkz) [ 

= CC*+G«“£(Cj+C«)]£2 2 £l^2 r *l r *2 P «{Tffp?^0, 

contradicting Sutherland’s conclusion. This equation also means 
that our modified PCAC prediction for w 0 —► 2y will be altered 
when one of the photons is virtual, as is the case in the Primakoff 
effect. 

In the decay n —» 3r, the only point which we wish to make is 
that the triangle graphs which we have considered (involving 
either photons or strongly interacting vector mesons) cannot alter 
the usual PCAC predictions. The reason is the presence in all 
matrix elements coming from our extra term of the factor k\^k% T 
X which vanishes at zero four-momentum for the 

axial-vector vertex. (In the v° —» 2y case we were always talking 
about the matrix element left after removal of this factor.) 
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TT 


DECAY 


Stephen L. Adler 
Institute for Advanced Study 
Princeton, New Jersey 


I wish to describe some recent theoretical work on tt ° ^ 2 Y 
decay, which helps to resolve puzzling questions which have arisen 
over the years, and which may shed light on the nature of possible 
fundamental constituents of matter, such as quarks. Purely kine¬ 
matic considerations tell us that the matrix element and the decay 
rate for this process are 

-» Zy) = k k € € P € F , 

L 1 Z 5T(Jp 


1 


-1 3 2 

t = (p. / 64 tt)F 


( 1 ) 


with (k^, c^), (^2 ,€ 2^ momen ^ um an d polarization four-vectors 
of the two photons, \i the pion mass, and F an intrinsic coupling 
constant. The job for the theorist, of course, is to try to calcu¬ 
late F. An important step towards this goal was taken in 1949 by 
Steinberger, ^ who considered a model in which the tt^ dissociates 
(via pseudoscalar coupling) into a proton-antiproton pair, which 
emit the two photons and then annihilate. In lowest order pertur¬ 
bation theory there are only two Feynman diagrams, the triangle 
diagram in Fig. la and the corresponding diagram with the two 
photons interchanged. Although this diagram appears to be linear¬ 
ly divergent, the presence of the y^ in the Fermion trace causes 
all divergent terms to vanish identically, and a straightforward 
calculation gives (neglecting small terms of order p^/m 2) 


F 




a 


g. 


tt m 


( 2 ) 


N 
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Fig. 1(a) Triangle diagram with pseudoscalar coupling, (b) Tri¬ 
angle diagram with pseudovector coupling, (c) A virtual meson 
correction to the triangle diagram. 


a = fine structure constant = e / 4tt ~ 1/137 
g = pion-nucleon coupling constant ~ 1 3. 6 
rn T = nucleon mass . 

N 


-1 


Substituting Eq. (2) into Eq. (1), one finds a decay rate r ~14 eV, 

in fairly good agreement with the experimental rate t”^ , = 

7 6 6 iI i v expt 

(1. 12 ± 0. 22) x 10 ib sec' 1 = (7. 37 ± 1. 5) eV. That such a naive cal¬ 
culation should work so well is, in fact, rather puzzling, since we 
know that Eq. (1) is just the first term in ? power series in the 
strong coupling g r , and there is no obvious reason why one should 
be able to get away with the neglect of all of the higher terms. 


A second puzzle also emerged from Steinberger' s calculation. 

In addition to calculating tt^ -> 2y decay using pseudoscalar 
coupling, Steinberger also repeated the calculation with pseudo¬ 
vector (axial-vector) coupling, by evaluating the diagram shown in 
Fig. lb. This diagram _is actually linearly divergent, and must be 
evaluated by regulator techniques to insure photon gauge invariance. 
On the basis of the pseudoscalar-pseudovector equivalence theorem, 
one would expect Fig. lb to give the same rate as Fig. la, but actu¬ 
al calculation shows that Fig. lb gives a tt^ Zy amplitude F 
smaller than that of Eq. (2) by a factor p^/6m^^. In the limit of 
zero pion mass, the pseudovector amplitude F actually vanishesl 


During the last ten years, extensive and very successful calcu¬ 
lations on soft pion emission have been done using the partially- 
conserved axial-vector current (PCAC) hypothesis. This hypothe¬ 
sis states that, apart from certain equal-time commutator terms 
(which do not enter into our problem), soft pions behave as if they 
were coupled to nucleons by pseudovector , rather than pseudo- 
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scalar, coupling. When we turn to Steinberger's calculation, 

PCAC thus leads us to the troublesome conclusion that the answer 
F ~ 0 should be chosen over the numerically reasonable answer 
for F given by Eq. (2)1 This conclusion is independent of the per¬ 
turbation theory model used by Steinberger, and is easily derived 
in a completely general fashion.^ All we need do is to sandwich 
the PCAC equation 

VS -«„'■*>* 0 ■ (3) 

TT 


f = charged pion decay amplitude 


TT 


between the two photon state < ^(k^, c )y(k^, €^)| an d the vacuum 
| 0 >. The matrix element of the right-hand side of Eq. (3) is pro¬ 
portional to ^(tt^ —> 2y), while a purely kinematic analysis of 
< \(k , ^ ^ > shows that the matrix element of the 

left-hand sicfe oFEq.(3) is proportional to k^k^ T ej’'°' € 2^ € ^Tcrp * 
(k^k )^. Thus, in a model-independent way, PCAC predicts 
F oc fkj+k 2 ) = , just as was found from the pseudovector tri¬ 

angle graph in perturbation theory. 


To summarize, the theory of 
three puzzles: 


tt^ decay presents the following 


(1) Naive calculation using the lowest order pseudoscalar coupling 
triangle diagram, and neglecting possible strong interaction re¬ 
normalization effects, gives a surprisingly good result. 

(2) The pseudoscalar-pseudovector equivalence theorem breaks 
down. Pseudovector coupling predicts that tt^ -* Zy decay is 
strongly suppressed. 

(3) PCAC implies, in a model independent way, that tt^ -* Zy de¬ 
cay is strongly suppressed. 

3 

Recent theoretical work, which I will now briefly describe, 
has helped to resolve these puzzles. The key observation is that 
when very singular diagrams (e.g. triangle diagrams) are present, 
formal field-theory results such as Ward identities, the pseudo¬ 
scalar-pseudovector equivalence theorem, and the PCAC equation 
itself, break down. Consider, for example, the pseudoscalar- 
pseudovector equivalence theorem, which asserts that the dia¬ 
grams of Figs, la and lb should give identical results. The theo¬ 
rem is formally derived by taking the vacuum to two photon ma¬ 
trix element of the divergence equation 


R17 


257 


650 


Vll - PROPERTIES OF PlONS AND MUONS 


(g r /2im N )3 x j 5X - g r j 5 ; 

j 5X = +\ X . j 5 = 4^,- + • 


(4) 


[We can neglect meson terms in Eq. (4) because no virtual mesons 
appear in Figs, la and lb.] The matrix element of the right-hand 
side of Eq. (4) corresponds to Fig. la, while the matrix element 
of the left-hand side of Eq. (4) corresponds to Fig- lb, and Eq. (4) 
asserts that they should be equal. This formal derivation breaks 
down because the local product of operators is singu¬ 

lar, and the naive manipulations of equations of motion which lead 
to Eq. (4) are incorrect. The correct answer can be obtained by 
regarding the axial-vector current and the pseudoscalar current 
as limits of nonlocal, gauge-invariant currents, ^ 

X + € 

- ie J d£ ■ A (£. ) 

X- € 


j^\x) = lim 4 j (x+€)*y^ l *y c) exp 

€ -> 0 


j"*(x) = lim + exp 

€ -> 0 b 


X+€ 

-ie J d£ 

x- e 


A(l ) 


(5) 


A - electromagnetic field, 


giving, after a careful calculation 

o. J = 2im_ T j + (al 4tt)F F e t , 
\ N &o-rp 


F - ' 0 ' = Q^A^ - Q^A 0 ” = electromagnetic field-strength tensor . 


( 6 ) 


The matrix element of the extra term in Eq. (6) precisely accounts 
for the difference between Figs, la and lb as calculated by Stein- 
bergerl An alternative procedure"^ is to directly calculate the dif¬ 
ference of Figs, la and lb in momentum space. If One freely trans¬ 
lates the loop integration variables one "deduces" that the differ¬ 
ence is zero, but if one pays careful attention to the fact that in 
linearly divergent integrals the origin of integration cannot be 
freely shifted, one reproduces the right-hand side of Eq. (6). We 
see, then, that the pseudoscalar-pseudovector equivalence theorem 
breaks down for triangle diagrams because of the presence of 
singular (linearly divergent) integrals, and the breakdown is com¬ 
pactly summarized by Eq. (6). 


Clearly, the phenomenon which modifie s Eq. (6) will also affect 
the PCAC equation, Eq. (3). Let us consider a particular field- 
theoretic model, the cr-model of Gell-Mann and Levy. This model 
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consists of a neutron n and a proton p interacting with the pions 
(tt + , tt^ , tt" ) and with a scalar, isoscalar meson cr via SU^ ® SU 
symmetric couplings. In the absence of electromagnetism, Eq.(3) 
is satisfied as an operator identity, with <j> q the canonical pio^ 
field. In the presence of electromagnetism, the singularity 
triangle diagram changes Eq. (3) to read 


5\ 




8.= (f N 2)4» 0 + \{al 4^)F b F H e 
A. 3 TT ^ 9CTT 


(7) 


In other words, the PCAC equation for the neutral pion must be 
modified in the presence of electromagnetic interactions. Just as 
we did above, let us take the matrix element of Eq. (7) between 
the vacuum and the two-photon state. In the soft pion limit, the 
matrix element of the left-hand side makes no contribution, but in¬ 
stead of deducing that the tt^ -* 2y amplitude F vanishes, we now 
find that F is proportional to the matrix element of the extra 
term in Eq. (7), 7 

a_ \TZ |jl 


F 


(k 1+ k 2 ) - 0 


TT 


( 8 ) 


TT 


Because Eq. (8) has been derived without recourse to perturbation 
theory, it is an exact low energy theorem for tt^ decay . Using 
the experimental value f ~ 0. 96 and substituting Eq. (8) into 
Eq. (1), we find the decay rate t“^ =7.4 eV, in excellent agree¬ 
ment with experiment. It is interesting to compare Eq. (8) with 
Steinberger 1 s lowest order perturbation theory result [Eq. (2)] by 
using the Goldberger-Treiman relation, 


nT2 


J£ 


g. 


r*o 

r*o 


TT 


m N g A 


(9) 


g = nucleon axial-vector coupling constant ~ 1. 22, 


to rewrite Eq. (8) in the form 

F } 

1 (k +k r= o 


r*o 

r*o _ 


a 


g. 


tt m. 


N g A 


( 10 ) 


We see that the effects of higher orders of perturbation theory are 
entirely contained in the factor g^, which is numerically close to 
unity; as a result Steinberger 1 s calculation, which neglects the fac¬ 
tor g^, is a fairly good first approximation. 

We see, then, that the modified PCAC equation resolves the 
puzzles noted above. At the same time, however, some new 
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questions and problems are raised. Let us now consider these 
problems, as well as some of the experimental implications of Eq. 
(7). 

1. We have just gone through some rather subtle reasoning to avoid 
the prediction, following from the unmodified PCAC equation, that 
F is suppressed by a factor However, one can always 

ask how one knows that tt^ decay is not really a suppressed decay. 
One argument is the theoretical one, that with the extra term PCAC 
gives a good answer for tt^ decay, which means that without this 
term, the rate would be much too small to agree with experiment. 
There is also an interesting experimental test, ^ which strongly 
suggests that tt^ decay is not suppressed. To see this, let us re¬ 
turn to the suppression argument following Eq. (3), in the altered 
situation in which one of the photons is off-mass - shell, say k^ ^ 

0. Some simple kinematics shows that the vacuum to two photon 
matrix element of is now proportional to k^k 2 T e^* 

et rcr p[p.^+pk^], with p of order unity. We see that while the On- 
shell part of the amplitude is suppressed by a factor the off - 

shell dependence is not suppressed . Since the off-shell amplitude 
is measured in the reaction tt^ —* e + e"y, our suppression argu¬ 
ment predicts that the ki^ dependence of this process will have 

p p J- 

the form 1 + (p/p )k^ , which has a much larger slope than the 
form 1 + (p/m ^)k 2 expected in the absence of suppression of the 
tt^ —* Zy decay. A measurement of this slope has been reported 


by Devons et al. , 7 who find a matrix element 1 + a kj , a = 


(0. 01 ± 0. ll)/p 

suppression. 


Clearly, this is strong evidence against tt^ —* Zy 


2. The argument that Eq. (8) is an exact low energy theorem is not 
as simple as we have made it sound. To be sure that Eq. (8) is 
exact, we must be sure that strong interaction modifications of the 
triangle diagram, such as illustrated in Fig. lc, do not renormal¬ 
ize the extra term in Eq. (7). This can in fact be demonstrated, 

to any finite order of perturbation theory . The reason that virtu¬ 
al meson corrections do not modify Eq. (7) is that they always in¬ 
volve Fermion loops with more than three vertices (Fig. lc in¬ 
volves a 5-vertex loop), which satisfy normal Ward identities be¬ 
cause they are highly convergent. There is still the possibility 
that Eq. (7) is modified by nonperturbative effects, such as con¬ 
tributions from triangles involving bound states of the fundamental 
fields. Our neglect of possible nonperturbative modifications is 
pure assumption. 

3. The spectacularly good agreement of Eq. (8) with experiment is 
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somewhat fortuitous, ^oth because of the large error in the experi¬ 
mental ti-0 Zy rate and because of the usual 10-20 percent ex¬ 
trapolation error involved in PCAC arguments. For example, use 
of the Goldberger-Treiman relation to replace Eq. (8) by Eq. (10) 
alters the theoretical prediction by 20 percent, to t" 1 = 9-1 eV. 

4. The constant \ appearing in front of the term (a/4Tr)F^ (r F r P x 
€ |o-rp ■^ c l* arises from our particular choice of fundamental 
Fermion fields, and differs in different field-theoretic models. 
Quite generally, if i s expressed in terms of fundamental 

fields by 


J 

then the modified PCAC reads 

5\ 


— X . 

3r ^ - E g.MJ.y y y. + meson terms 

-> i J J 5 J 


(ID 


9= (f /n/ 2)4> 0 + S(ci'/4iT)F^ cr F TP G t 
A. 3 TT c,crT 


( 12 ) 


S - ? g.Q. , 

J J J 

where the charge of the fermion is Qj. All we are doing, of 
course, is adding up the contributions of the triangle diagrams in¬ 
volving the various Fermions. The tt^ —* Zy low energy theorem 
derived from Eq. (12) is 


F 


a \T2 


Ul 


2S 


r*o _ 


a 


g. 


1 


( k l+ k 2 ) = 0 


TT 


TT 


tt rri g A 
N 6 A 


2S 


(13) 


which reduces to our previous result when S = 


The comparison which we have made above with the experi¬ 
mental tt^ decay rate tells us that | S| ~ 0. 5, but does not de¬ 
termine the sign of S. However, there are a number of different 
ways of determining the sign of S, all of which, fortunately, seem 
to agree! The first method is by analysis of tt + -* e+vy decay, 
the vector part of which is related by CVC to F and the axial-vec¬ 
tor part of which can be estimated by hard pion techniques. Using 
the experimentally measured vector to axial-vector ratio for this 

o 

process, Okubo° finds that S is positive. A second method is to 
make use of forward tt^ photoproduction, where one can observe 
the interference between the Primakoff amplitude (which is pro¬ 
portional to F) and the forward purely strong interaction amplitude. 
The sign of the latter can be determined by finite energy sum rules 
from the known sign of the photoproduction amplitude in the (3, 3) 
resonance region; the analysis has been carried out by Gilman, ^ 
who finds S positive. A third method consists of comparing Eq. 
(13) with an approximate expression for the tt^ —* 2y amplitude de- 
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rived by Goldberger and Treiman^ (as corrected by Pagels^). 
These authors applied a pole dominance argument to proton Comp¬ 
ton scattering dispersion relations, obtaining the relation 


r*o 

r*o 


- 4ttqt 


K 

_E 


m 


(14) 


N 


K - proton anomalous magnetic moment = 1. 79, 

^ 0 

which gives a tt —* 2y rate of 2.0 eV, in fair agreement with ex¬ 
periment. Comparison of Eq. (14) with Eq. (13) again gives S posi¬ 
tive. A fourth method which has been proposed^ is to use Compton 
scattering data on protons to try to measure the interference of the 
pion exchange piece (proportional to F) with the nucleon and nucleon 
isobar exchange pieces. The problem with this proposal^ is that 
one does not know whether to take the pion exchange piece in its 
Born approximation form, tF/(t-p^), or in the polology form, 
p^F/(t-p^). Since t is negative in the physical region, this un¬ 
certainty leads to a sign ambiguity and renders the method dubious. 
In any case, with fair certainty one learns from the first three me¬ 
thods that S is positive. 


Armed with the experimental knowledge that S = + 0. 5, we can 
now use Eq. (12) to test various models of the hadrons which have 
been proposed. One very popular model is the triplet model , con¬ 
sisting of an SU^-triplet of Fermions (p, n, X) interacting by meson 
exchange. The charges of (p,n ,\) are (Q,Q-1,Q-1) and the corres¬ 
ponding axial-vector couplings are (g , g , g^) = (j, -2,0). One im¬ 
mediately finds S = j Q - ^(Q-l)^ = Q - and so S = \ requires 
Q = 1 [i. e. , integral triplet charges (1, 0, 0)]. Note that the frac¬ 
tionally charged quark model has Q =2/3, S = 1/6, and so the 
quark hypothesis is strongly excluded. Another integrally charged 
triplet model which is allowed is the Han-Nambu-Tavkhelidze^ 
model, which has three triplets, S, U, B, with respective charges 
(1,0,0), (1,0, 0), (0, -1, -1) and with axial-vector couplings (\ t - j, 0) 
for each triplet. 


5. The ideas which we have developed can also be applied to the 
r\ -> 2y and the X^ -* 2y decays.^ Unfortunately, the experi¬ 
mental situation here is worse, and the theoretical situation is also 
worse, because the soft r\ and soft X^ approximations involve a 
much larger extrapolation from the physical region than does the 
soft pion approximation. Nonetheless, pursuing this track, Gla¬ 
show et al. ^ find a connection between the tt^ —* 2y, rj -» 2y and 
—> 2y decay rates, which predicts 


262 


Adventures in Theoretical Physics 


S. L. ADLER 655 

t -* Zy) ~ 350 keV quark model (Q - -j) , 

~ 120 keV integrally charged (Q = ± 1) 

triplet model . (15) 

Present experiments do not distinguish between the two alterna¬ 
tives in Eq. (15). 
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DISCUSSION 

A. Par : Recent measurements, by the Primakoff effect, of the tt° decay 
rate give t _ 1 = 11.2 eV (± 10%). S,L»A. : Using g 2 /4 tt = 14.6, the 
theoretical estimate is in the range 7.4 - 9.1 eV, but as indicated 
there is ^ 20% uncertainty in the PCAC extrapolation. Incidentally, 
the new width is in still poorer agreement with the fractionally- 
charged quark model. V. Telegdi : Would one expect to see, in 
n-decay, evidence for a non E.M. isospin symmetry breaking inter¬ 
action? S t L t A, : One can invent such an interaction to explain the 
3tt decay of the n. The extra terms I discussed will not affect this 
decay mode. 
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We consider matrix elements of the axial-vector current in spinor electrodynamics, and develop the 
change in the usual reduction formalism caused by the presence of the axial-vector-current-two-photon 
triangle diagram. When at most one photon is reduced in from the external states, we are able to charac¬ 
terize the anomalous behavior of the triangle diagram entirely in terms of a consistent set of anomalous 
field-current and current-current commutators. 


I T has recently been shown 1 that the axial-vector 
current in spinor electrodynamics does not satisfy 
the usual divergence equation 

B*i*(j*) =2im 0 j 6 (x ), 

where 

j„ B (x) =$(x)y M y 5 \f'(x ), f(x) =i(x)y$(x ), (1) 

expected from naive use of the equations of motion. 
Rather, because of the presence of the triangle diagram 
shown in Fig. 1, the axial-vector current satisfies the 
anomalous divergence condition 

B*i*K*) =2imof(x) + (ao/4Tr)Fi*(x)F"(x) , (2) 

with F* 9 the unrenormalized electromagnetic field- 
strength tensor. Because radiative corrections to the 
basic triangle diagram (Fig. 2) involve axial-vector 
loops with at least five vertices, and because these larger 
loops satisfy the usual axial-vector Ward identity, 
Eq. (2) is an exact equation, valid to all orders in 
perturbation theory. 2 

In the present paper we explore further consequences 
of the singular behavior of the triangle diagram in 
spinor electrodynamics. Although the anomalous 
divergence phenomenon appears in all matrix elements 
of the axial-vector current, we will consider explicitly 
only the axial-vector-current-two-photon matrix ele¬ 
ment which is described in lowest 

order by the graph of Fig. 1. (Here J^, k 2 and e l3 e 2 
denote, respectively, the four-momenta and polariza¬ 
tions of the two photons.) First, we develop the reduc¬ 
tion formalism for the triangle graph. When one photon 
is reduced in, we are able to characterize the anomalous 

1 S. L. Adler, Phys. Rev. 177, 2426 (1969). This paper will 
hereafter be referred to as I. See also J. Schwinger, ibid. 82, 664 
(1951), Sec. V; C. R. Hagen, ibid. 177, 2622 (1969); R. Jackiw 
aud it. Johnson, ibid. 182, 1457 (1969); B. Zumino (unpublished). 
As in I, we use the notation and metric conventions of J. D. 
Bjorken and S. D. Drell, Relativistic Quantum Fields (McGraw- 
Hill Book Co., New York, 1965), pp. 377-390. In particular, we 
use «oim=« 1S8, = 1. 

8 S. L. Adler and W. A. Bardeen, Phys. Rev. 182, 1515 (1969). 
Note that the anomalous divergence term can be rewritten in 
terms of finite quantities as (a/4r)F P t'Ff- r '«f*r» where FM is the 
renormalized electromagnetic field-strength tensor. 
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behavior of the triangle graph entirely in terms of 
anomalous commutators of the electromagnetic field 
with the axial-vector current (“seagulls”) and of the 
electromagnetic current with the axial-vector current 
(“Schwinger terms”). We check that the various com¬ 
mutators which we obtain are consistent with each 
other, with the equations of motion, and with the 
electromagnetic-field canonical commutation relations. 
These formal considerations indicate that the equations 
obtained from explicit study of the matrix element 
{ 0 \jp 6 \k u e 1 ;k 2 ,ei) can be applied unchanged to the 
matrix element {A\j^\B) y with A and B arbitrary, 
when at most one photon is reduced in from the external 
states. Using the anomalous commutation relations, 
we complete the heuristic verification that the quantity 
Q* introduced in I is the chiral generator in massless 
electrodynamics. Finally, we show that when both 
photons are pulled in, one cannot represent the triangle 
graph by a reduction formula containing a time-ordered 
product with the usual properties. 

To study the reduction formula for the triangle graph 
with one photon pulled in, we use the equation 3 

(0| Ji» 6 (0) ^2,e2)C(27r) 3 2^io(27r) 3 2^2o] 1/2 

= —hi 9 j d*xe~ ikl ' x 

XD,<01 T(jy(0)A v (x)) |^e2)C(2T) 3 2^o] 1/2 

= —iexUt^ee?/(2ir) Ar \R apVl (kxyk 2 ), (3) 

where A a is the photon field and Ra pll (k u kt) is the 
explicit expression for the lowest-order triangle graph 
given in Eqs. (17) and (18) of I. Bringing inside the 
time-ordered product (using the usual rules 4 for 
differentiating time-ordered products), we find 

d*x 1 T(j*(f))A a(x))\k it e 2 ) 

_ =Ai iir kio+B p<r +Ci it ,(kio ), (4) 

8 Since in Eqs. (3)-(7) we work to lowest order only, we omit 
the wave-function renormalization factor from Eq. (3). 

4 S. L. Adler and R. F. Dashen, Current Algebras (W. A. 
Benjamin, Inc., New York, 1968), Eq. (2.7). 

1740 
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A | [[A «rW,jV 6 (0)] \kt,et ), 

Bpa=j d i xe Al ’ x 8(xo){0\[A a (x) t j^(0)2\k^}, 
Cfia(kio) =eo 01 T(jf(0) j a (x)) |£ 2 ,e 2 ), 


(5) 


A *(x)=—A a(x ), j ff (x) =$(x)y<$(x ). 
dxo 

Provided that the time-ordered product in Cp, is not 
too singular, in the limit as £i 0 —► «, the function io) 

has the Bjorken 6 -Johnson-Low 7 behavior 

m f 

Cp w (k io)=-/ dhc e <kl ' T b(dbj) 

kio J 

X<0|[>«,;V 6 ^ (6) 

indicating that the equal-time commutators 
D4„(z),.7„ 6 (0)], [i,(»:),;' j, 6 (0)], and Zj,(x), jV 6 (0)] are 
to be identified, respectively, with the parts of 2? w 
behaving like k X o, 1, and J^o" 1 as k x o becomes infinite. 
From Eqs. (17) and (18) of I, we find 

et p R*pr(k iM) 

= 47T 2 (£ 2 T e 2 * ^ 2 ^C 2 t ) {t T (rp#i“f"g(rOtOrp^ £p0€0r<i>i 

^iro)(^2(rCOrp^“f"^ 2 Ot(rrp^i) 

“f"^(ro(l &»o(l &io)kl , *iiT‘r}i 

+(terms which vanish when c-=0 or m=0)]} 

+O[(ln^ 0 )^io 2 ]. (7) 

Comparing Eq. (7) with Eqs. (5) and (6), we find the 
equal-time commutation relations 8 


[• a .(«),j' f . 6 (y)]=[^oW,jV 6 (y)]=o, 

[ir(a),;'o 6 (y)] =(-2 iao/ jt)5 s (x - y)B r (y), 
[_A r (x),j. 6 (y) ]= (ioto/ t)<5 3 0- y)t r “E‘(y), 
Zjo(x),jo 5 (y)l =(—ieo/2tr i )B(y) ■ V x 5 3 (x-y), 
[;'r(a),;o 6 (y)] =(—ieo/4ir J )[E(a;)X V y 5 3 (x-y)]’ 
[j'oW,j'. 6 (y)] =(te 0 /4ir ! )[E(y)X V x i 3 (x—y)]*, 


( 8 ) 


on ki 


Fig. 1. Axial-vector triangle diagram which 
leads to the extra term in Eq. (2). 


6 We have suppressed the dependence of A M , t .. , t C f 
and kt, 

•J. D. Bjorken, Phys. Rev. 148, 1467 (1968). 

7 K. Johnson and F. E. Low, Progr. Theoret. Phys. (Kyoto) 
Suppl. 37-38, 74 (1966). 

0 We remind the reader that since we have deduced the com¬ 
mutators of Eq. (8) from the triangle graph alone, without 
considering other graphs, we have not yet ruled out the presence 
of additional terms in the field-current or current-current commu¬ 
tators of higher order than at or £q, respectively. However, the 
consistency argument of Eqs. (23)-(30) below suggests that such 
terms, if they occur at all, are at worst Schwinger terms and sea¬ 
gulls of the usual type, which cancel against each other when 
vector or axial-vector divergences are taken. 



with 


5^)=[VXAW] ( =e r,( - A*(x ), 

daf 


E t (x) = -A‘(x) - A\x), 

dx* 

e 123 =l. 


(9) 


We have only listed the current-current commutators 
containing at least one time component, since these are 
the only ones which appear when divergences with 
respect to the vector or axial-vector indices (<r or fx) are 
brought inside the time-ordered product in Eq. (5). All 
of the nonvanishing commutators in Eq. (8) are 
anomalous in the sense that if they are calculated by 
naive use of canonical commutation relations they 
vanish. 

It is easy to check that the anomalous commutation 
relations of Eq. (8), together with the reduction 
formula of Eqs. (4) and (5), correctly reproduce the 
known divergence properties of the lowest-order 
triangle diagram. Consistent with our assumption that 
the time-ordered product C M , is not too singular, and 
obeys the Bjorken-Johnson-Low asymptotic formula, 
we use the usual formulas 4 for differentiation of the 
time-ordered product, 

VWM >(*))= WW(y)U*)) 

+ 5(y 0 -*°)[jo 6 (y) ,./„(*)], 
dx r TUS(y)j*(*))=T(j/(y)dx r jJx)) 

+Kx°-y°)Uo(x),u & (y)l- 

To check gauge invariance for the photon which has 
been reduced in, we multiply Eq. (4) by k x v . Using 
vector-current conservation (d <r J <r =0) and Eq. (10) to 
evaluate ki'C^k io), we find 

ksj d*x x <01 T(j, 6 (0)A „(*)) I *««) 


=£ 1 ' / d 4 ^e <kl * I 5(jf 0 )(0|[A, r (^),;V 6 (0)]|^ 2 ,€2) 


-ieo f «*»**«(ai®)<01 jV(0)] | . (11) 


W* W* 





Fig. 2. Typical second-order radiative corrections 
to the triangle diagram. 
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Using the commutators of Eq. (8), one can easily see 
that the right-hand side of Eq. (11) vanishes. To check 
the axial-vector divergence of the triangle, we multiply 
Eq. (4) by —Using the axial-vector-current 
divergence equation (2) and Eq. (10) to evaluate 
(^i+W^^io), we find 

-(*.+**)'■ [d^e-^oM 7XjV(DM,(*)) |fe,e 2 > 


= -ie„J d i xe~ iki ' x {0 1 T(£2im 0 j s (0) 

+(ao/4 1 r)F^(0)F-'(0)e { . r ,]>W) i *»,«> 

+/ d*x e' kl ' z 5(xo) 

+«« 0 < 0 1[>(*),Jo 6 (0)] ffe,es>} . (12a) 

Since we are only working to lowest order (order «o 2 ), 
the anomalous divergence term proportional to 
e^cxoF^F^e^ makes no contribution. However, the 
anomalous commutator terms in curly brackets may 
be evaluated from Eq. (8), and they give 

jd i xe*'-5(xo){ - (Ai+*») |1 <0 ] iA ,(*),;7(0)] [*,,«> 

= —^ p (e^/2ir i )ki i kt T ^ ffTP , (12b) 

When multiplied by € 1 % Eq. (12b) is identical with 
the matrix element (0| (ot^/^K)F^ a F TP ^ aTP \kx^x’,k 2 ^ 2 ) 
XC(27r) a 2^ 10 (27r) 8 2^ 2 oI| 1/2 which comes from the anoma¬ 
lous term in Eq. (2) if we calculate the divergence of 
( 0 |y„ 6 |£i,ei;£ 2 ,e 2 ) directly, before reducing in one 
photon. We see then that the reduction formula of 
Eqs. (4) and (5), combined with the anomalous com¬ 
mutators of Eq. (8), correctly characterizes the anoma¬ 
lous axial-vector index divergence of the triangle 
diagram. As Jackiw and Johnson 1 have particularly 
emphasized, in the reduction formula the anomalous 
divergence term rp arises from the failure of the 

“Schwinger term” p'^Jo 6 ] and the “seagull” [_A ff ,j P 6 2 
to cancel. {As a point of consistency, we note that the 
pseudoscalar-two-photon triangle R ap [defined in 
Eq. (19) of has the asymptotic behavior £ 2 )—>0 

as ^ 10 —»oo. Thus the usual equal-time commutation 
relations 

LA . (x), j 5 (y)J = [A,(x),/(y)J= 0 (13) 

remain valid, and no extra seagull terms are picked up 
when the one-photon reduction formula is applied to 
the matrix element (0\2itnoj 5 \k 1 ,e 1 ;k 2 ,e 2 ).} 

We proceed next to check whether the commutation 
relations of Eqs. (8) and (13) are formally consistent 


with each other, with the equations of motion, and with 

the usual electroniagnetic-field canonical commutation 

relations. In the Feynman gauge, the electromagnetic- 

field equations of motion and commutation relations are 

•« 

UA P —A p = £o jp j 

LA x (x) ,A"(y)2\ X M ,A’(y) ] U°-n° =o, (14) 
LA x (*),i'(y)]|.°-!, 0 = -ig x, S 3 (x—y). 

We also need the diveigance equations satisfied by the 
currents j P (x y t) and j p 6 (x,t), 

d 

-;o+V-j=0, 

a/ 

(15) 

d 

—;o 5 +V-j 5 =2m 0 ; 5 +(2a 0 /T)E-B, 
di 

with E and B given, of course, by Eq. (9). We proceed 
to combine Eqs. (14) and (15) with Eqs. (8) and (13). 
All the commutators which we write down are at equal 
time, with x°=y°=t. 

(i) From [_A v (p$>jif(yY]=Q> we deduce 9 

LA„(x),joKy)l+LA«(x),(a/dt)jo ! ‘(y)l=o. (16) 

On substituting Eq. (15) for ( d/dt)j 0 6 (y ) and using 
LA ,(x),f(y) ] = D4 ,(x) ,j‘(y) ]=0, we find 

[i.(») ) jo 6 (y)]= -D4»(*),(2a 0 /ir)E(y) - BOO], (17) 

Using the canonical commutation relations of Eq. (14), 
we then get 

[io(a),;o 6 (y)]=0, (18) 

[i r (z),Jo 6 (y)> (-2iao/ir)d\x-y)BXy), (19) 

in agreement with Eq. (8). 

(ii) From [A 0 (^),Jo 6 (y)]=0, we deduce 

[i 0 W,jo 6 (y)]+[ioW,(5/3/)jo 6 (y)]=0. (20) 

Substituting Eq. (15) for (d/dt)jtf(y) and Eq. (14) for 
Aq(x), and using the commutators [^oW,io 5 W] 
= [i„(a:),j 6 (y)] = [io(«),j 6 (y)]=0, we find 

[«oJo(*),Jo 6 (y)]= -[io(«),(2ao/ir)E(y) -B(y)] 

= (—2fa 0 /ir)B(y) • V x 5 3 (x —y), (21) 

that is, 

[jo(«),Jo 6 (y)] = (-*'«o/2T J )B(y) • Vi5 3 (x—y), (22) 

in accord with Eq. (8). 

(iii) From [i r (^) ) ;o 6 (y)]= ~(2ia 0 /ir)8\x-y)B r (y), 
we find 

LAr(x),jo & (y)l+LA r (x),(d/dt)joHy)l 

= ( — 2iao/ir)5 i (x — y )6 T (y) 

_ =(2^oA)5 3 (x-y)[V y XE(y)J. (23) 

9 We use here the method of D. G. Boulware and L. S. Brown, 
Phys. Rev. 156, 1724 (1967). 
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Substituting for A r (x) and (d/dl)jo 5 (y) as before, we find 

leoj\(x)JoKy)l-LA r (x),v 7 -i 5 (y)l 

= (2ia 0 / tt) 5 3 (x — y) [’Vj X E (y) ] r 

—[A r (^),(2a 0 A) E (y) • B(y)] 

=(—2iof 0 A)[E(^)XVy5 3 (x—y)] r . (24) 

Using Eq. (8) to evaluate [eoj r (x)Ja 5 (y)l an< ^ 
—[A r (a;),V* j 6 (y)], we see that Eq. (24) is satisfied. 

(iv) From tjo(x),jo *(y)] = - (ieo/2w*)B(y) ■ V x <5 3 (x- y), 
we find 

l(d/dt)jo(x)J 0 *(y)l+[j 0 (x)Xd/dt)jV(y)l 

= (-ie< i /2ir i )B(y)-V I 8*(x—y) 

= (ieo/2ir%VjX E (y)3- V x 5 3 (x-y). (25) 

Substituting Eq. (15) for (d/dt)jo(x) and (d/dt)jo 6 (x) 
gives 10 

-[Vx-jWJo^D-D'oWjVyjHy)] 

= ( ieo/2ir*)[y 7 X E(y)] ■ Vx8*(x— y ). (26) 

Using Eq. (8) to calculate the commutators on the 
left-hand side, we find that Eq. (26) is satisfied. 

(v) Finally, to check the consistency of quantization 
in the Feynman gauge, we must verify that 

iWo+V-A (27) 

and L remain dynamically independent of the axial- 
vector current. That is, we must verify that 

[L(*),j/(y)]«0 (28) 

and that 

[L(z),^(y)]=0. (29) 

Equation (28) follows immediately from the first line 
of Eq. (8). To check Eq. (29), we substitute Eq. (14) 
for Ao and use [ J 4o(^),y/(y)]=0, giving 

[£ (x) ,j„ 6 (y) ] = [>o;o (x) ,j^(y) ] 

+[V I -A(*),;7(y)]. (30) 

Substituting commutators from Eq. (8) then shows 

that the right-hand side of Eq. (30) vanishes. 

We conclude that the commutation relations of 
Eq. (8), which were obtained from the triangle graph 
in lowest-order perturbation theory, are consistent with 
the equations of motion and canonical commutation 
relations of Eqs. (14) and (15). Moreover, the fact that 
Eq. (19) for [4 r (a),; 0 6 (y)] and Eq. (22) for[j 0 (a:),jo 6 (y)] 
were deduced from simpler, exact commutators 11 and 

10 We have used Q/ 0 (:x;),E(y) ■ B(y)] =0, which follows from 
L/o(*),i! r (y)]:=0. Note that [;o(:r),.4 r (y)]—Ocan 
be derived from Q/’o (x) r (y) ]=0 and the divergence equation 
djo(x)/ Vx• j = 0, in the same way that we derived Eq. (19). 

r/K ?o““ ut ation relations D4 r (*),j/(y)A[X r (*) t j‘(y)] 
“ LAo(*),/M*(y) J“ L-doteL^Cy)]—0 can be proved to all orders in 
perturbation theory by the Bjorken-Johnson-Low method, using 
the Weinberg asymptotic rules discussed in Chap. 19 of Bjorken 


equations of motion 2 suggests that Eqs. (19) and (22) 
are themselves exact to all orders of perturbation 
theory. 12 The values given in Eq. (8) for [A r (%),j**(y)l, 
Jr(*),/o 5 (y)], and [/o(*),y. 6 (y)] cannot, on the other 
land, be deduced from the consistency argument of 
Eqs. (23)-(30). To see this, we note that Eqs. (24), 
(26), and (30) [as well as the reduction formulas (11) 
and (12)] are all unchanged if we modify these commu¬ 
tators to read 

iao 

ZA r (x),j'*(y)]=— 5 3 (x-y)e T4l £ e (y) 

T 

—fe 0 i 3 (x—y)5”(y), 

—ieo 

[y r (*) ,y o 5 (y) ] = — ;-[E(*) X v y 5 3 (x - y) ] r 

4tt j 

+*—[5 3 (x-y)S”(y)], (31) 
dy* 

ieo 

CM*) >y * 5 (y) ]=—[E(y) XV,i 3 (x-y)]' 


-i —[5 3 (x-y)S r *(y)], 
dx? 

with S T *(y) a pseudotensor operator. In other words, 
the consistency check of Eqs. (23)-(30) does not rule 
out the possibility that higher orders of perturbation 
theory may modify Eq. (8) by adding Schwinger terms 
and seagulls of the usual type, 13 which cancel against 
each other [as in Eqs. (11) and (12)] when vector or 
axial-vector divergences are taken. It is expected 13 on 
general grounds that the commutator [dr(#),y« fi (y)] 
does not involve derivatives of the 5 function and that 
the commutators \j r (%) 9 jo 6 (y)2 and \jo(%),j*(y)2 do 


and Drell (Ref. 1). Let T 9l afi(k j,...) be an arbitrary amplitude 
Involving an external photon of polarization <r and four-momentum 
ku an axial-vector current with four-momentum —jfej-HA, 
2/ external fermions, and b additional external photons. Because 
of charge-conjugation invariance, we cannot have 2f=b=0. 
When />0 or b>l, the asymptotic coefficient a associated with T t 
as £io —* 00 , can never be greater than zero. When /=0 and b— 1, 
the superficial asymptotic coefficient is 1 (the graph is linearly 
divergent), but gauge invariance implies that the photon b must 
couple through its field-strength tensor, and this reduces the 
effective a to zero. Thus a for T can never be greater than zero, 
and since T is arbitrary, this statement holds for all subgraphs 
of T as well. We conclude that • * • )~(ln£io) jS as ifeio —>• 00 , 

and since ki*T 9lt 2 fb(ki,. . .)«0 by gauge invariance, this means 
that T'o M s/*(&i f .. O'^KfHln&o)*. Comparing with Eq. (5), we 
conclude that An identical 

argument holds with replaced by j h . 

12 We believe that Eqs. (19) and (22) are exact when sandwiched 
between normalizable states {a | and | fc). We make no claims about 
matrix elements involving non-normalizable states such as 
j 9 (x)\a) or yV(y)|a) and, in particular, we do not demand that 
the commutators of Eq. (8) satisfy the Jacobi identity. (They 
do not.) For a discussion of Jacobi-identity breakdown, see Johnson 
and Low (Ref. 7). 

13 See Adler and Dashen (Ref. 4), Chap. 3; BouLware and Brown 
(Ref. 9); D. G. BouLware, Phys. Rev. 1 f2, 1625 (1968). 
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not involve derivatives of the 5 function higher than 
the first. Under this assumption, Eq. (31) represents the 
most general form for these commutators consistent 
with Eqs. (14) and (15). 

Using Eq. (19), we can easily complete the argument 
sketched in I that the operator 


Q 5 = / d*x[j Q 5 (x) + (oto/ir)k(x)- V x XA(a;)] (32) 


is the conserved generator of 75 transformations in 
massless electrodynamics. In I it was shown that 


-“Q 6 =0, (33) 

dt 


We now show that Q* commutes with the photon field 
variables. From the first line of Eq. ( 8 ) we find 


[&A ,(>>)]=KJ'UoCy)]=0. 


(34a) 


while from Eq. (19) we find 14 


K 6 4 r(y)]=[y 

+[/ d*. 


d s x jo 5 (x) 


Mi] 


+ f d 3 x(ao/ir)A(x)-V I X\(x) 


,Ar(y)] 


2icto 2iao 

=- B’(y) - B'(y)= 0, (34b) 

T T 

as promised. 

Finally, we will show that when two photons are 
pulled in, the triangle graph cannot be represented by 
a reduction formula containing a time-ordered product 
with the usual properties. When two photons are 
pulled in, Eq. (3) is replaced by 16 


J d A xd*ye- ik * x e~ ikt v 

Xn x n v (0\T(j\*(0)A <r (x)A p (y))\0) 

= ileo 2 /(2T)^R aptl (kM- (35) 

Bringing and □„ inside the time-ordered product 
on the left-hand side of Eq. (35) gives 18 

14 Equation (34) and Eqs. (16) and (19) may be combined into 
the simple observation that C<2 8 »-4r]=0 and dQ*/dt=0 implies 
MVir> 0 . 

11 Again, we neglect the photon wave-function renormalization. 
18 We have suppressed the dependence of and S p9 p on ki 
and k». 


J d A xd A ye~ ikl ’ x e~ ik *‘ v 

Xn*n y (0\njf(0)A v (x)A p (y))\0) 

== Cjiffp(^ 10 )^ 2 o) (km,kzo) , (36) 


10 j^ 2 o) 


=<?o*y 


,— **1 •*-—»**• v 


d A xd*y e 


X(0|r(;V(0)i.Wip(y))|0). 


The “time-ordered product” C Mffp contains all of the 
dynamical singularities of the matrix element, but in 
addition there is a polynomial in k\ and k 2 , which we 
have labeled S pap , arising from anomalous commutators 
of A and A with the currents. If the time-ordered 
product Cpo- p were of the usual type, then it would have 
the Bjorken-Johnson-Low behavior in the limits as 
£ 10 , £ 20 , or £ 10—£20 become infinite. That is, we would 
have 


C^ ffp (^io,^ 2 o) 


—ie o 2 


fcl0~»°°.fc20 fixed fe. 


d A xd A y 


Xe'^dixo)?-**- v (0 1 F([>(^),; M 6 ( 0 )]; p (y)) | 0 ) 

+ 0 (O n ^io)V^io 2 ) j 


U P ff P (^ 10 j^ 2 o) 




* 20 -«o.*i 6 fixed ^ 20 


d A xd A y 


Xe-^e'^-niy^ I nU,(yW(o)]j,(*)) I o> 

+O((ln£ J 0 )W), (37) 


C H a p (k lOjfeo) 


*U>~*30-* so .*i0+*20 fixed 


—ie o 2 r 
■IQ—kzoJ 


d K ocd A y 


C*+v)^.'(ki-ka). (a* —y 0 )) 

+OC(ln(^io — ^ 2 o))V(^io — ^ 2 o) 2 ]* 

According to Eqs. (36) and (37), all terms in R apti 
winch either approach constants or diverge linearly in 
the three limits must be contained entirely in the 
polynomial S . In Eq. (7) we saw that as £ 10 —, with 
kio fixed, Ra pp approaches a nonzero finite limit and, by 
Bose symmetry, the same statement holds for the 
limit £ 20 —»°°) with km fixed. In I it was shown that in 
the limit km—km —»°°, with £ 10+^20 fixed, R trpM diverges 
linearly (i.e., behaves as finite coefficient times km— £ 20 )- 
Clearly, these three limiting behaviors cannot be de¬ 
scribed by a polynomial in km and £ 20 , which means 
that C M(rp cannot vanish in all three of the limits in 
Eq. (37). Thus, the time-ordered product appearing in 
the two-photon reduction formula is not of the usual 
type. 

We wish to thank L. S. Brown, R. Jackiw, and S. B. 
Treiman for helpful conversations. 
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Absence of Higher-Order Corrections in the Anomalous 

Axial-Vector Divergence Equation 
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We consider two simple field-theoretic models, (a) spinor electrodynamics and (b) the a model with the 
Polkinghome axial-vector current, and show in each case that the axial-vector current satisfies a simple 
anomalous divergence equation exactly to all orders of perturbation theory. We check our general argument 
by an explicit calculation to second order in radiative corrections. The general argument is made tractable 
by introducing a cutoff, but to check the validity of this artifice, the second-order calculation is carried out 
entirely in terms of renormalized vertex and propagator functions, in which no cutoff appears. 


I. INTRODUCTION 

I T has recently been shown 1 - 2 that the axial-vector 
current in spinor electrodynamics does not satisfy 
the usual divergence equation 

d»j tl 5 (x)=2imoj 5 (x), 

h\ x ) =^My>.Y^M , j*( x ) =${x)y${x ), 

expected from naive use of the equations of motion, 
but rather obeys the equation 

*//(*) = 2m 0 y5(^)+( ao /4T)^^(^^(^)e UTp , (2) 

with the electromagnetic field strength tensor. 
Similarly, it was shown that in a simple version of the 
Gell-Mann-L6vy <r model 3 coupled to the electromag¬ 
netic field, the axial-vector current does not satisfy 
the usual PCAC (partially conserved axial-vector 
current) equation 

&jA*) = — (MiVgo)ir(z), 

jA x ) =${x)\yf/ b \(/(x) +<r(x) d„r(x) (3) 

—r(x)d l> o-(x)+gr 1 d tt T(x), 

but instead obeys the modified PCAC condition 

&jA x ) = “ (miVSoM*) 

+\{^/ ^)F^{x)F Tp {x)t ^ Tp . (4) 

In both theories, the extra term in Eqs. (2) and (4) 
arises from the presence of the axial-vector triangle 
diagram shown in Fig. 1. This diagram has an anoma¬ 
lous property; when it is defined to be gauge-invariant 
with respect to its vector indices, it does not satisfy the 
usual axial-vector Ward identity. 

* Research sponsored by the Air Force Office of Scientific 
Research, Office of Aerospace Research, U. S. Air Force, under 
AFOSR Grant No. 68-1365. 

1 S. L. Adler, Phys. Rev. 177, 2426 (1969), hereafter referred to 
asl. As in I. we use the notation and metric conventions of J. D. 
Bjorken and S. D. Drell, Relativistic Quantum Fields (McGraw- 
Hill Book Co., New York, 1965), pp. 377-390. 

2 See also J. Schwinger, Phys. Rev. 82, 664 (1951), Sec. V; 
C. R. Hagen, ibid . 177, 2622 (1969); R. Jadriw and K. Johnson, 
ibid. 182, 1459 (1969); R. A. Brandt, ibid. 180, 1490 (1969); 
B. Zumino (unpublished). 

3 M. Gell-Mann and M. L6vy, Nuovo Cimento 16, 705 (1960). 
We actually study a truncated version of the 9 model proposed 
by J. S. Bell and R. Jackiw, ibid. 60A, 47 (1969). 
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An essential conclusion 4 of I was that Eqs. (2) and 
(4) are exact. In other words, the anomalous term 
F^ ar F Tp e^ frTp does not receive additional contributions 
from radiative corrections to triangles, such as shown 
in Fig. 2 (the wavy line denotes either a photon or a 
meson). This conclusion follows naively from the ob¬ 
servation that radiative corrections to the basic tri¬ 
angle involve axial-vector loops (such as the five-vertex 
loop shown in Fig. 2) which, unlike the lowest-order 
axial-vector triangle, do satisfy the usual axial-vector 
Ward identity. The purpose of the present paper is to 
support this naive reasoning with more detailed calcu¬ 
lations, and in particular, to show that the fact that 
radiative corrections to triangles involve the usual re- 
normalizable infinities causes no trouble. 

The plan of the paper is as follows. In Sec. II we con¬ 
sider the two models discussed in I—spinor electro¬ 
dynamics and the <r model—and develop a general argu¬ 
ment which shows that Eqs. (2) and (4) are exact. In 
Sec. Ill we give an explicit calculation of the second- 
order radiative corrections to the triangle. We find, in 
agreement with our general arguments, that when all of 
the second-order radiative corrections are summed, 
their contributions to the F**F tp g$ 9Tp term exactly 
cancel. In Sec. IV we briefly summarize our results and 
compare them with the conclusions reached recently by 
Jackiw and Johnson. 2 


Fig. 1. Axial-vector triangle diagram which 
leads to the extra term in Eqs. (2) and (4). 



w* V>5 Ws 



Fig. 2. Typical second-order radiative corrections to the triangle 
_ diagram in spinor electrodynamics. 


4 For example, in order for the low-energy theorem for i r° —* 2y 
derived in I to be valid, it is essential that there be no strong- 
interaction corrections to the anomalous term in Eq. (4). 
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n. GENERAL ARGUMENT 

We develop in this section a general argument, valid 
to any finite order of perturbation theory, which shows 
that Eqs. (2) and (4) are exact. The basic idea is this: 
Since Eqs. (2) and (4) involve the unrenormalized fields, 
masses, and coupling constants, these equations are 
well defined only in a cutoff field theory. Consequently, 
for both of the field-theoretic models discussed, we con¬ 
struct a cutoff version by introducing photon or meson 
regulator fields with mass A. In both cases, the cutoff 
prescription allows the usual renormalization program 
to be carried out, so that the bare masses and couplings 
and the wave-function renormalizations are specified 
functions of the renormalized couplings and masses, 
and of the cutoff A. In the cutoff field theories, it is 
straightforward to prove the validity of Eqs. (2) and 
(4) for the unrenormalized quantities; this is our 
principal result. From Eqs. (2) and (4) we obtain 
exact low-energy theorems for the matrix elements 
( 2 y 12 moi B 10 ) and ( 2 ?| (—MiYgoVlO); the latter of 
these yields the t° —» 2y low-energy theorem discussed 
in I. 

Having summarized, in a very condensed way, our 
method and results, we now turn to the details in the 
various models. 

A. Spinor Electrodynamics 

We consider first the case of spinor electrodynamics, 
described by the Lagrangian density 

£(#) =\p(x) —\F iiy {x)F^ v {x) 

—e Q }p(x)y l jp(x)A fi (x) , (5) 

F fUt {x) = d v A il {x) —d lx A y {x ), d^y^d^. 

We introduce a cutoff by modifying the usual Feynman 
rules for electrodynamics as follows. 

(i) For each internal fermion line with momentum 
p we include a factor i(p —wo+fe ) -1 and for each vertex 
a factor —ie<fY m with mo and e 0 the bare mass and charge. 
For each internal photon line of momentum q, we replace 



(b) 



fc) 




Fig. 3. (a) Two-vertex photon vacuum polarization loop, (b) 
Larger vacuum polarization loops, (c) Vertex and self-energy 
parts which are made finite by the photon propagator cutoff and 
gauge invariance of Loops. 


the usual propagator — igiip(q i ~hie)~ 1 by the regulated 
propagator 

... r i 1 \_~ i &- 

, ) • ( 6 ) 

\£ 2 +ie q 2 — A 2 +te/ £ 2 +ie q 2 — A 2 +t€ 


(ii) Let II ^^{q) denote the two-vertex vacuum 
polarization loop [Fig. 3(a) 3 


n ^*( 4 ) 



d*k r 1 1 

-Tr - y„ - 

(2t) 4 L k— Wo+ie k+q— tno+ie 



Wherever appears, we use its gauge-invariant, 

subtracted evaluation 6 

n<«(o)=o. (8) 

All vacuum polarization loops with four or more vertices 
[Fig. 3(b)3 are calculated by imposing gauge invariance; 
this suffices to make them finite without need for further 
subtractions. 

(iii) As usual, there is a factor J'd 4 l/{ 2tt) 4 for each 
internal integration over loop variable l and a factor 
— 1 for each fermion loop. 

(iv) We use the standard, iterative renormalization 

procedure 6 to fix the unrenormalized charge and mass 
e 0 and mo and the fermion wave-function renormaliza¬ 
tion Z 2 as functions of the renormalized charge and 
mass e and m and the cutoff A. For finite A, the renor¬ 
malization constants eo, mo, and Zt will all be finite. 
The reason is that regulating the photon propagator 
(plus gauge invariance for loops) renders finite all vertex 
and electron self-energy parts and all photon self¬ 
energy parts other than [Examples of such ver¬ 

tex and self-energy parts are given in Fig. 3(c).3 The 
self-energy part II^ (2) has already been made finite by 
explicit subtraction. 

(v) We include wave-function renormalization fac¬ 
tors Z^ 12 and Z 3 1/2 for each external fermion and photon 
line. (We recall that Zi=e 2 /e o 2 .) 

This simple set of rules makes all ordinary electro¬ 
dynamics matrix elements finite. We may summarize 
the rules compactly by noting that they are the Feyn¬ 
man rules for the following regulated Lagrangian density: 

<£*(#) =£o B (s)+£r B (aO, 

JGo B (a;) 0“ wo)^ (x) —J F ltP (x)Ff lP (x) 

+iF^ R (x)F R ^(x) —^A. 2 A ti R (x)A Rfi (x) , (9) 

£i R (x) = —ef$(x)y$(x)[A li (x)+A Rli (x)] 

+C««[F |ff (*)+F |W *(*)XF^(a:)+F*»*(*)], 

where A yfi is the field of the regulator vector meson of 
mass A, and F ftv R (x) = d v A v R (x)—d lx A v R (x) is the regu¬ 
lator field strength tensor. The regulator field free 


6 Bjorken and Drell (Ref. 1), Chap. 19. 
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Lagrangian density is included in «Co B (aO with the op¬ 
posite sign from normal; hence according to the canon¬ 
ical formalism, the regulator field is quantized with 
the opposite sign from normal—that is, 

[A „ R (x),A v R {y )~\| a .»_ v o=ig„^ > (x-y), 


Fig. 5 ; Arbitrary Feynman 
amplitude involving 


whereas 


2 f b 



Ld„(x),A,,(y)l | *•=*• = -ig„r8 3 (x-y) 

—giving the regulator bare propagator the opposite sign 
from the photon bare propagator. The interaction terms 
in £t r (x) treat the regulator and the photon fields 
symmetrically. The term proportional to C (2) is a 
logarithmically infinite counter term which performs 
the explicit subtraction in the two-vertex vacuum 
polarization loop so that II f2 >(0)=0. 

Having specified our cutoff procedure, we are now 
ready to introduce the axial-vector and pseudoscalar 
currents j/(x) and j 6 (x), and to study their properties. 
First, we must check whether all matrix elements of 
these currents are finite when calculated in our cutoff 
theory. The answer is yes, that they are finite, and 



Fig. 4. Basic loops involving one axial-vector or one 
pseudoscalar vertex. 


our attention first to the type-(a) contribution pictured 
in Fig. 6(a), which can be written 


in -1 wr 1 “| 1 1 

E II y li > - Y (i) - YiAfs— - 

*-i L p-i-pj—miiJ p+pk—niQ P +£&— Wo 


2n-i r in 

X n y (}) — - 7 (S»)(. • •), (10) 

L P'+Pi —m Q J 

Q=p-p', 

Q = p - p' 



2 f-2 


(a) 


b 


follows immediately from the fact that all of the basic 
fermion loops involving one axial-vector or one pseudo¬ 
scalar vertex (Fig. 4) are made finite by the imposition 
of gauge invariance on the photon vertices, without the 
need for explicit subtractions. Thus, we can turn im¬ 
mediately to the problem of showing that Eq. (2) is 
exactly satisfied in our cutoff theory. 

Let us consider an arbitrary Feynman amplitude 
involving //, with 2/ external fermion and b external 
boson lines (Fig. 5). Proceeding as in I, we divide the 
diagrams contributing to the Feynman amplitude into 
two categories, which we call type (a) and type (b). 
The type-(a) diagrams are those in which the axial- 
vector vertex y^Ts is attached to one of the / fermion 
lines running through the diagram; a typical type-(a) 
contribution is shown in Fig. 6(a). By contrast, the 
type-(b) diagrams are those in which the axial-vector 
vertex yfys is attached to an internal closed loop; in 
Fig. 6(b) we show a typical contribution of type (b). 
In both Figs. 6(a) and 6(b), we have denoted by Q the 
four-momentum carried by the axial-vector current. 

To study the divergence of the axial-vector current, 
we multiply the matrix element of by iQ*. We turn 


2f b 

.. .'•v 



(b) 

Fig. 6. (a) Contribution to Fig. 5 in which the axial-vector 
vertex is attached to one of the / fermion lines running through 
the diagram, (b) Contribution to Fig. 5 in which the axial-vector 
vertex is attached to an Internal closed loop. 
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where we have focused our attention on the line to 
which the 7^75 vertex is attached and have denoted the 
remainder of the diagram by (• • ■). As shown in I, when 


we multiply the propagator string in Eq. (10) by iQ* 
and do an algebraic rearrangement of terms, we obtain 
the following identity: 


E 


2n—1 [" 

c n 7 

-1 y-i _ 


(j) 


1 


1 


7 


(&). 


-7m7s 

P+Pj—m J P+Pk—m P 


1 2 n —1 p 

:- n 7 

-\-pk —^0 /“*+! L 


U). 


1 


P f +pj—m J 


7 


(2 n) 


2 n —1 k—\ p 

= e n 7 

fc-i j-i L 


1 


U). 


p+pj-tno-i 


1 


7 


(AO. 


P+Pk—wio 




1 


277—I 

n 


7 


U). 


1 


2«—1 p 

—i II j ^7 


U). 


p'+pic—tno ?= k + l L p'+pj—tnoJ 
1 


7 


(2n) 


p+pj—mo-i 


7 


2 m— 1 r 1 

C 2 n) 7 5 —ijo n \y U) —— 
>=1 L p +pj— 


w 0 J 


7 


( 271 ) 


(ID 


Since the integrals over the four-momenta of the photon 
propagators joining the fermion propagator string 
to the “blob” in Fig. 6 (a) (i.e., the integrals over 
pi,” ’,p 2 n-\) are all convergent in our regulated field 
theory, it is safe to do the algebraic manipulations im¬ 
plicit in Eq. (11) inside the integrals. The first term on 
the right-hand side of Eq. (11) gives the type-(a) con¬ 
tribution to the Feynman amplitude for 2imoj s , cor¬ 
responding to replacing 7^75 by 2im<rfh in Fig. 6 (a). 
The two remaining terms in Eq. (11) are the usual 
“surface terms” which arise in Ward identities from 
the equal-time commutator of jo 6 with the fields of the 
external fermions of momenta p and p f , 6 Thus, as far 
as the type-(a) contributions to the Feynman amplitude 
are concerned, the divergence of is simply 2 intoj 5 , 
with no extra terms present. 

We turn next to the type-(b) contribution pictured 
in Fig. 6 (b), which we write as 


F(Q) 7/»75j Ph ' ' ' iPin— l) ( ' ' ' ) j 

£(Qi T; Pw ' 'j p2n~l) 


= d*r Tr 


in k— 1 [” 

:n 7 

1 j'-i _ 


U). 


1 


r+pj—ntoJ 


1 


1 


X7 


(A). 


r+pk—m 0 r+pk—Q— w 0 

1 

x n i7 w 


in p 

n 7 

y=A+i _ 


r+pj—Q—tn 0 J 


, (12) 


where we have focused our attention on the closed loop 
and have again denoted the remainder of the diagram 
by (■■■). As was shown in I, some straightforward 
algebra implies 


L{Q\ iQ*Jiijbt p\,' ■ • ipin-\) 

— L(Q; 2imoj5j p\r ■ ',pi n -\) 


+i 


dV Tr 


75 



1 1 
O’)- 

r-\-pj — nto- 


—75 


2n p 1 - 

n 7 o) -—— 

y-i L r+pj—Q—nto-i 



6 Y. Takahashi, Nuovo Ciinento 6, 370 (1957), 


For loops with n>2 (i.e., with four or more vector 
vertices) the residual integral in Eq. (13) is sufficiently 
convergent for us to be able to make the change of 
variable r ~» r-\-Q in the second term, causing the two 
terms in the curly brackets to cancel. Thus, the loops 
with n>2 satisfy the usual Ward identity 

«(?*Y*7b, P\,' ' " iPin —l) 

=£((?; 2m»o76; ph’ • ’,pin~\). (14) 

Again, since the integrals over pi, '’-,pt n -i are all 
convergent in the regulated field theory, the manipula¬ 
tions leading to Eq. (14) can all be performed inside 
these integrals. This means that the type-(b) pieces 
containing loops with n> 2 all agree with the usual 
divergence equation d ti jp 5 (x) = 2itnoj 5 (x). 

Finally, we must consider the case n = 1, that is, the 
axial-vector triangle diagram illustrated in Fig. 7. As 
was shown in I, when the triangle is defined to be gauge- 
invariant with respect to the vector indices, it does not 
satisfy Eq. (14) for the axial-vector index divergence. 
Instead, there is a well-defined extra term left over 
which comes from the failure of the two terms in the 
curly brackets in Eq. (13) to cancel. The analysis of I 
shows that the effect of the extra term is to add to the 
normal axial-vector divergence equation the term 

(a 0 /hr)[Ft*(x)+F R t*(x)l 

X\iF Tp (x)+F RTp (x)2^ 0 r P . (15) 


2 f b 



w* 

Fig. 7. Contribution of the axial-vector triangle 
diagram to the Feynman amplitude of Fig. 5. 
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To summarize, our diagrammatic analysis shows that 
the axial-vector divergence equation in the regulated 
field theory is 

d*jr 5 (x)=2im 0 f(x) + (a 0 / / h-)Fi or (x)F T f>(x)ti:*rp 

+(a 0 /4r )£Ft*(x)F RT *(x)+F*t*(x)F T >(x) 

+F R ^(x)F Rr f , (x)y ^ Tp . (16) 

Equation (16) is identical with Eq. (2), apart from the 
terms involving F R which arise from our explicit in¬ 
clusion of a regulator field. The crucial point is that the 
coefficient of the anomalous term is exactly a 0 /4i r and 
does not invoke an unknown power series in the coupling 
constant coming from higher orders in perturbation theory. 

The diagrammatic analysis which we have just given 
may be rephrased succinctly as follows: If we use the 
regulated Lagrangian density in Eq. (9) to calculate 
equations of motion, and then use the equations of 
motion to naively calculate the axial divergence, we find 

5 p yV (x) —2imoj 5 (x) . (17) 

Extra terms on the right-hand side of Eq. (17) can only 
come from singular diagrams where the naive deriva¬ 
tion breaks down. In the regulated field theory, all 
virtual photon integrations converge and therefore, can¬ 
not lead to singularities giving additional terms in Eq. 

(17) . Hence breakdown in Eq. (17) (if it occurs at all) 
must be associated with the basic axial-vector loops 
shown in Fig. 4. But, as we have seen, the axial-vector 
loops with four or more photons satisfy Eq. (17), so 
the basic triangle diagram is the only possible source of 
an anomaly. 

Having derived our basic result, we turn next to the 
low-energy theorem for 2imoj 5 (x) which is implied by 
Eq. (16). Taking the matrix element of Eq. (16) be¬ 
tween a state with two photons and the vacuum gives 

F(kyh)~G(k v k2)+H(kvk 2 ), (18) 

where 

<Y(£i,eOY(£2,e 2 ) ] 10) 

= (4^io^2o)“ 1/2 ^1^2 T ei* <r e2* p e£T <rpF(ky ki) , 

<Y (h h ci)y(^ 2 j e 2 ) 1 2i m 0 f | 0) 

— (4^io^ 2 o)~ 1 ^ 2 ^i^ 2 T ei* <r e 2 * p ef Tffp G(^i' ^ 2 ) , 
(a 0 /47r)(Y(^i,ei)Y(^2,e 2 ) | (F^+F^^T^+E^^e^p |0) 
= (4:kiQkzo)- 1 li ky*kz T £**^2* p e$T<rpH(kl' kz) . (19) 

We wish, in particular, to study Eq. (18) at the point 
kyki~0. As has been shown by Sutherland and 
Veltman, 7 F(kykt) cc£ r £ 2j so the left-hand side of Eq. 

(18) vanishes at kyk 2 — 0, giving 

G(0) = -tf(0). (20) 

There are two types of diagrams which contribute to 
as illustrated in Fig. 8, where we have used 
the symbol 0 to denote action of the operator 


(a) 



(b) 


Fig. 8. (a) Diagram in which the operator (ao/4ir)(F^+F Jl ^) 
X(F Tp H-F firp )€{ < rrp, denoted by®, attaches directly to the external 
photon lines, (b) Diagram in which there is a photon-photon scat¬ 
tering between ® and the two external photons. 

(a 0 /47r)(F f,r -|-F Kf ' r )(F Tp +F' STp )ef < r T p. In the diagrams in 
Fig. 8(a), the field strength operators attach directly 
onto the external photon lines, without photon-photon 
scattering. The effect of the vacuum polarization parts 
and the external-line wave-function renormalizations 
is to change ao to a, giving 

tf(0)(«>=2a/V. (21) 

In the diagrams in Fig. 8(b), there is a pboton-pboton 
scattering between 0 and the free photons. As a result 
of the antisymmetric tensor structure of the anomalous 
divergence term, the vertex 0 is proportional to ky+k». 
Also, the diagram for the scattering of light by light is 
itself proportional tokykt, since photon gauge invariance 
implies that the external photons couple through their 
field strength tensors. 8 Thus, the diagrams in Fig. 8(b) 
are proportional to kyk 2 (ki+k 2 ) and are of higher order 
than the terms which contribute to the low-energy 
theorem, giving us 

H( 0)< 6 >=0. (22) 

Combining Eqs. (20)-(22), we get an exact fow-energy 
theorem for the operator 2imoj 5 y 

G(0) = —2a/V. (23) 

So far in our discussion we have kept the cutoff A 
finite, so that G(0) is a matrix element calculated with 
our modified Feynman rules. Let us now indicate 
briefly the form which Eq. (23) takes when the cutoff 
A becomes infinite. A straightforward analysis of matrix 
elements of the operator j h shows that divergences as 
A —» oo are associated only with the electron propagator 

8 R. Karplus and M. Neumann, Phys. Rev. 80, 380; 83, 776 
(1950). 


7 D. G. Sutherland, Nucl. Phys. B2, 433 (1967). 
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Sp f (p), the photon propagator Dr(q)p. Vj the photon 
vertex part Tp(p,p'), and, in addition, the pseudoscalar 
vertex part r *(p,p f )j defined by 

SF , (P)T b (p,p , )Sr(f r ) = — Jd*xd*y 

X(T(P(x)fm(y)))o. (24) 

In matrix elements of ntoj b , the vertex part Y & (p,p f ) 
will clearly always occur in the combination MoY 5 (p,p'). 
Let us now introduce the usual electrodynamic re¬ 
normalizations 

zr\s F ’(p)=s F '(p), 

Z,- 1 D F '(q) III ,=D F '(q) ftr , 

Zir„(p,pi-r l .(p,p / ), ( ; 

eo—Zf~ lli e j 

plus the usual wave-function renormalizations on ex¬ 
ternal lines. The effect of these rescalings is to replace 
m 0 r *(P,P')j wherever it occurs, by maZiY h {p,p f ). But, 
as we will now show, this latter quantity is finite (i.e., 
cutoff-in dependent as A—»oo). To see this, we note 
first that Y 6 (p,p f ) is multiplicatively renormalizable 9 ; 
therefore, if Zt>T & (p,p) is finite, then so is Z^{p i p / ). 
Next, let us write the Ward identity for the axial-vector 
vertex, 

(p-pW(p,p') =2mcT 6 (p,p') -i(a a /4*)F(p,p') 

+S Apr^+y^S/(?)-', (26) 

where IV and F are defined by replacing j 6 (0) in Eq. 
(24) by 7V(0) and F ia (0)F Tp (p)e^Tpj respectivdy. When 
p =p f j the left-hand side of Eq. (26) obviously vanishes. 
It is also easy to see that P(p,p) =0 as a result of the 
antisymmetric tensor structure of this term. So the 
axial-vector Ward identity at p becomes the simple 
equation 

0 = 2m Q Y\p J p)+S F t {pr^+y 5 SF , {pr y , (27) 

which immediately implies that m^ZiY b (p,p) is finite. 
If we introduce a renormalized pseudoscalar vertex 
part by writing 

moZzY 6 (Pjp f ) =mT h (p,p') , (28) 

then Eq. (28) may be rewritten as the equal-momentum 
boundary condition 

o= 2 »r Kp^+s/ipr^+y^Apr 1 • (29) 

The results of this analysis may be summarized as 
follows: If we calculate an arbitrary matrix element of 

# We recall that multiplicative renormalizability of the usual 
vector vertex Y p follows from the fact that T p satisfies the integral 
equation J*Y 0 Sf , SFK [see Bjorken and Drell (Ref. 5)]. 

More generally, G. Preparata and W. I. Weisberger [Phys. Rev. 
175, 1965 (1968)] have observed that in spinor electrodynamics 
the vertex Yo(pjp') of with O a product of 7 -matrices, 
satisfies the integral equation r 0 =0— J'YqSf'Sf'K, and there¬ 
fore is multiplicatively renormalizable. 


*»o/ B in our cutoff theory, and then let A—we get 
the same answer as if we calculated all the skeleton 
diagrams for the matrix element and replaced the elec¬ 
tron and photon lines and the vertex parts appearing 
in the skeleton by the renormalized quantities Sp f (p), 

j and mP b (p,p f ). These quantities can 
all be calculated without recourse to cutoffs by using 
dispersive methods; in the case of the pseudoscalar ver¬ 
tex the subtraction constant in the dispersion relation 
can be fixed by using Eq. (29) as a boundary condition. 

Returning to our low-energy theorem, we see that in 
the limit A —>00 the pseudoscalar-photon-photon 
matrix element G(kykt) becomes the renormalized 
matrix element G(kyk 2 ) calculated by the recipe we 
have just outlined, and the low-energy theorem tells us 
that 

g(0) = -2a/T. (30) 

In other words, all order a 2 , a 3 , • • •, contributions to 
G{kyk%) vanish at kykt—Q. In the next section, we 
will verify by explicit calculation that the order a 2 terms 
do cancel. 

In conclusion, we remark that the arguments which 
we have given in this section for spinor electrodynamics 
apply, with only trivial modification, to the neutral- 
vector-meson model of strong interactions. In particular 
the low-energy theorem analogous to Eq. (23) will hold 
to all orders in both a and the neutral-vector-meson 
strong coupling. 

B. cr Model 

We turn next to the case of Gell-Mann and Levy’s 
<r model. 3 As in I, we consider a truncated version of the 
<r model which contains only a proton (^), a neutral 
pion (t), and a scalar meson (<r), but omits the charged 
pions and the neutron. Our exposition will differ some¬ 
what from the previous case of spinor electrodynamics, 
where we first introduced a cutoff procedure to remove 
infinities from the theory, and then afterwards pro¬ 
ceeded to discuss the properties of the axial-vector cur¬ 
rent. In the case of the <r model, we will have to consider 
the axial-vector current simultaneously with our intro¬ 
duction of the cutoff, in order to ensure that the cutoff 
preserves the usual PCAC equation pEq. (3)]] when 
electromagnetic interactions are neglected. Once we 
are sure that the axial-vector divergence equation in 
the absence of electromagnetism has no abnormalities, 
we can then determine how Eq. (3) is modified when 
electromagnetic effects are taken into account. 

We begin by writing the Lagrangian for the <r model 
and discussing some of the formal properties of this 
theory. We have 

£ —— Go(go“ 1 +o' -\-irry 5)^ 

+Xo[4o- 2 +4g 0 <r((r 2 +T 2 )+go 2 ( 0 ' 2 + 7r2 ) 2 ]] 

+ 2 Mo 2 C 2 g 0 ” V+ <r 2 +t 2 ] 

+iC(dT) 2 +(5(r) 2 ]-W(T 2 +(r 2 ), (31) 
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where we have chosen the fully translated form of the a 
model with 

(<r)o=0 (32) 

to all orders of perturbation theory. The axial-vector 

current is generated by the chiral gauge transformation 

xj/-* (1 

t —>T—v(gir 1 +<r ), (33) 

go" 1 +0- —> go” 1 + 0 -+VT , 

giving 

ji? = —5£/5(3^) -T^a+go” 1 ^, 

d*jp = —ScC/Sv = — (mi 2 /£o)t . (34) 

The terms in Eq. (31) have the following significance: 
(i) Go is the unrenormalized meson-nucleon coupling 
constant; (ii) the quantity go is related to the bare 
nucleon mass mo by 

Go/go—tnoj (35) 

and may be expressed directly as a vacuum expectation 
value, 

l/g° = i^y<2 3 *io 6 M,7r(0)J| y ; (36) 

(iii) jui 2 is the bare meson mass which appears in the bare 
propagators A F a (q) and A f t (?), 

A^--(j)-1/(9*-ah*+«); (37) 

(iv) the term \ 0 C4a 2 +4goa(a 2 +7r 2 )-|-g() 2 ((r 2 +7r 2 ) 2 [] is a 
chiral-invariant meson-meson scattering interaction; 
and (v) the term 2Mo 2 Pgo”V+(r 2 +7r 2 [] i s a chiral-in¬ 
variant counter term which is necessary to guarantee that 

(&£/&<r)o = d\(&£/&(d\<r))o =0, (38) 

as is required by the Euler-Lagrange equations of 
motion and translation invariance. Equations (32) and 
(38) fix mo 2 to have the value 

Mo 2 = (Gogo^ — XoC4go 2 (3(T 2 +7T 2 )+4goV(<r ! +7r 2 )[])o. (39) 

The effect of mo 2 , which is formally quadratically di¬ 
vergent, is to remove the “tadpole” diagrams of the 
type shown in Fig. 9, so that the condition (a)o—0 is 
maintained in each order of perturbation theory. It is 
easily seen that the mo 2 counter term simultaneously 
removes the quadratically divergent parts of the ir- 
and a-meson self-energies, so that the remaining bare 
quantities appearing in the Lagrangian (Go,go, mi) j as 
well as the wave-function renormalizations, are at most 
logarithmically divergent. 


Fig. 9. Tadpole diagram. 

I 

J 

I 



An important feature of the Lagrangian density in 
Eq. (31) is that it is not normal-ordered; the omission 
of normal ordering is essential in order for the axial- 
vector current to satisfy the PCAC equation (34). To 
see this, let us consider the effect of normal ordering on 
the chiral-invariant meson-meson scattering term, 

&MM “£AfAf^ 2) +'£AfAf <3) +'£AfAf f4) , 

—4a 2 , =4goa(a 2 +7r 2 ), (40) 

The normal-ordered forms of the two-, three-, and four- 
meson scattering terms are defined by 

(: £&m C2:) :) o = {(S/ do ): Z M m 2:) :) — 0, 

<:£AfAf C3) :)o = <(d/da) :£ J wAf C8) :)o = ((3/d7r):£AfAf ca), .)o 

— {(d 2 /d<r 2 ): C3) : )o 

= ((d/d<r)(d/di 0: J5jirjir C8> : >o 

=<(5 2 /d7r 2 ):£W 8) :)o=0, 1 ; 

(:£ J w J w C4) : )o = {(d/da) :-£**-**- C4) : )o = ((d/dx): C4) : )o 

— ((3 2 /3a 2 ):£AfAf C4); )o= ■ ■ ■ 

= <(5 3 /5t3):W 4) :)o = 0. 

These conditions may easily be satisfied by introducing 
counter terms to remove the vacuum expectation values 
of the various derivatives, 

'.<£AfAf C2) '. =4a 2 — (4a 2 )o, 

:<£Afjv C3) : =4g 0 a(a 2 +T 2 )—4goa(3<r 2 +T 2 )o 

-4go(a(a 2 +7T 2 )) 0 , 

:£AfAf (4> : ^go 2 ^ 2 ^-^ 2 ) 2 —4g 0 V(a(a 2 +T 2 )) 0 (42) 

— 2g 0 V 2 (3a 2 +tt 2 )o— 2goV 2 (a 2 +3T 2 )o 
—go 2 ((<r 2 +7r 2 ) 2 )o+2go 2 (o- 8 )o(3a 2 +7r 2 ) 0 

+ 2go 2 (7r 2 )o(a* 2 + 3 t 2 ) o, 

giving 

4goa(3a 2 +T 2 ) 0 —4g 0 V(a(a 2 +7r 2 ))o 
—2goV 2 (3a 2 +7r 2 )o — 2go 2 T 2 (a 2 +3T 2 )o 

+const. (43) 


Clearly, the normal-ordered interaction :-£**-**-: will be 
chiral-invariant only if the counter terms combine to 
be proportional to o 2 +T 2 +2o/go, that is, only if 


(3a 2 +7r 2 )o = (o 2 +3t 2 )o = (3a 2 +7r 2 )o+go(o-(a 2 +T 2 ))o, 

(44) 


which requires 


<<r 2 )o = <7r 2 )o, 
(a(a 2 +T 2 )) 0 = O. 



These conditions would be satisfied if t and a were free 
fields, but they are not true in the presence of the inter¬ 
action terms of Eq. (31). Thus, the normal-ordered 
form is not invariant under the chiral gauge 

transformation of Eq. (33) and, if used in the Lagran¬ 
gian instead of spoils the PCAC equation. The 
way out of this difficulty consists in noting that the 
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normal-ordering conditions of Eq. (4) are not necessary 
for the consistency of the Lagrangian field theory of 
Eq. (31); all that is necessary is the single condition 
(5£/5(r)o=0. As we have seen, this condition can be 
satisfied by including the chiral-invariant counter term 
proportional to juo 2 , without any use of normal ordering 
in the Lagrangian. 

The fact that mo 2 removes the quadratic divergence 
from the ir and <r self-energies can be expressed in a 
simple equation, which will be very useful in what 
follows. Let A F vf (q) denote the full pion propagator, 
given by 

A F v, {q) — —i f d A x e iq ' x (T(ir(x)T(0)))o 


the presence of electromagnetism , since the antisymmetric 
tensor structure of the extra term in the PC AC equation 
causes the contribution of this term to Eq. (48) to 
vanish at q—0. 

This completes our survey of the formal properties 
of the tr model. We proceed to introduce a cutoff (with 
electromagnetism included) by modifying the usual 
Feynman rules as follows. 

(i) For each internal fermion line with momentum p 
we include a factor i(p— Wo+*0” 1 , with mo~Go/go. 
For each internal photon line of momentum q , we re¬ 
place the usual propagator by the regu¬ 

lated propagator 


= (46) 

where S T ( q 2 ) is the pion proper self-energy. According 
to Eq. (46), 

A^(0)—l/^+S'tO)]. (47) 

An alternative expression for A F vf (q) may be obtained 
by substituting the PCAC equation (34) into Eq. (46), 

go f 

& F v, (q)=si — / d A x e iq ' x {T{{d/dxf}j^{x)T{(i)))^ 

Ml 2 J 


go 


q» / d 4 *tf*«-*<rOV 5 (a:)T(0)))o 


Mr 


1*0 


d*x c-^ ,3t <[;o 5 W^(0)]| * 0 =o)o, (48) 


Mr 


which at q ~0 becomes 
igo 


-1 


A,*'(0) 


d*x <[y 0 5 W^(0)]| xo .o)o=- 


Mr 


Mr 


(49) 


Comparing Eqs. (47) and (49), we obtain the desired 
result 

S'(0)«0. (50) 


Since the differences S T (^ 2 ) —S T (0) and 2*(g 2 ) — 2 T (0) 
are only logarithmically divergent, Eq. (50) tells us 
that the t and <r self-energies 2 T (^ 2 ) and 2*(g z ) are them¬ 
selves only logarithmically divergent. 

So far, we have discussed the tr model in the absence 
of electromagnetism. To include electromagnetism, we 
add to the Lagrangian density of Eq. (31) the terms 

- —effa$A **. ( 51 ) 


We expect, because of the presence of triangle diagrams, 
that electromagnetism will modify the PCAC equation 
by the addition of a term proportional to F^F Tp ^ aTf) . 
However, it is easy to see that all of the other formal 
properties of the tr model which we have derived above 
are unchanged. In particular, Eq. (50) is still valid in 



1 1 
q*+k ? 2 -A 2 +fe 


-«&./ -A 2 

£ 2 +ie\£ 2 — A 2 +ie 



For internal t or a lines, which are not attached at either 
end to the axial-vector current , we replace the usual 
propagator i(q 2 —by the regulated propagator 


1 


1 


mi 2 +& q 2 — A 2 +ie, 


-A 2 +mi ! 


q 2 — jtz^+z'eV^ 2 —A 2 +ze- 


(53) 


For the photon-nucleon, meson-nucleon, and meson- 
meson vertices, we include the factors shown in Fig. 10, 
with eoj go, Co, and \o the appropriate bare couplings. 

(ii) For the axial-vector-current-nucleon and axial- 
vector-current-meson vertices, we include the factors 
shown in Fig. 11. For the pion propagator immediately 
following the axial-vector-current-pion vertex, we use 
the unregulated propagator i(g* — Mi 2 +*e) -1 * while we 
replace the product of pion and <j propagators im¬ 
mediately following the axial-vector-current-pion-<r 


- - ^ 

- - r 

-- c 

VERTEX. . FACTOR 



T 


a■ 

1 

-iGo 

tt ifnfr 

i 

i 

G o r 5 

a<r 

— 

QiX 0 

crcrcr 

—-< 

24ig 0 X 0 

airir 

'' -<* 

* 


<r<r<r<r 

X 

wviK 

IT 7T 7T7T 

S «. * 

24igfX 0 

cr a 7T7T 

X 

0i 9l K 


Fig. 10. Feynman rules for the a model: photon-nucleon, meson 
nucleon, and meson-meson vertices. 
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vertex by 

i 


i 


q t — (?+Q) J — Mi*+7‘e 

i i 

-. (54) 

? 2 -A 2 +te(?+Q) 2 -A 2 +te 

(iii) We use the finite, renormalized values for all of 
the superficially divergent nucleon loop diagrams illus¬ 
trated in Fig. 12. These diagrams fall into six classes: 
(a) diagrams with external <r or t lines only, (b) dia¬ 
grams with one axial-vector vertex and external meson 
lines, (c) diagrams with external photon lines only, 

(d) the axial-vector-photon-photon triangle diagram, 

(e) diagrams with external photon and meson lines, and 

(f) diagrams with an axial-vector vertex and external 
photon and meson lines. In Appendix A we give explicit 
renormalized expressions for the diagrams of types (a) 
and (b), and show that they satisfy the usual axial- 
vector Ward identities. The diagrams of type (c) (pho¬ 
ton vacuum polarization loops) were considered in our 
discussion of spinor electrodynamics. The diagrams of 
types (d)-(f) are made finite and unique by calculating 
them in a gauge-invariant manner. As we have empha¬ 
sized, the triangle diagram of type (d) does not satisfy 
the usual axial-vector Ward identity. We show in Ap¬ 
pendix A that the axial-vector-photon-photon-meson 
box diagram prig. 12(f)] does satisfy the usual axial- 
vector Ward identity. 

(iv) We take account of the counter term proportional 
to mo 2 in the following way. First, we omit all <r-meson 
tadpole diagrams (Fig. 9). (The recipe in Appendix A 
sets the basic nucleon loop tadpole equal to zero, but 
now tadpoles involving virtual meson integrations are to 
be dropped as well.) Second, when calculating pion 
self-energy diagrams 2 T (<? 2 ) involving virtual meson 
integrations, a subtraction at $=0 should be performed 
to ensure that 

2*(0)=0. (55) 

pVe will check explicitly below that the derivation of 
Eqs. (46)-(50) is valid in the cutoff theory.] This sub¬ 
traction eliminates the formal quadratic divergence 
(which has become an actual logarithmic divergence in 
our cutoff theory) and leaves only formal logarithmic 
divergences, which are rendered finite by the cutoffs in 

“ AXIAL current 



Fig. 11. Feynman rules for the <r model: axial-vector-current— 
nucleon and axial-vector-current-meson vertices. 


To- l 


O 
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+ permutations + permutations 
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n r<r 







Fig. 12. Superficially divergent nucleon loop diagrams in the 
<r model. The six categories are described in the text immediately 
following Eq. (54). 


the meson propagators. The tr self-energy is to be calcu¬ 
lated from the pion self-energy by use of the equation 

S'($ 2 ) = [S'( ? 2 ) -2 T (0)]+2 T (0); (56) 

the quantity in square brackets is only formally loga¬ 
rithmically divergent, and hence finite in our cutoff 
theory. All other diagrams involving virtual meson 
integrations are automatically finite in the cutoff 
theory. 

(v) There is a factor 2 t) 4 for each internal 

integration over loop variable l , a factor —1 for each 
fermion loop, a factor \ for each closed loop with one 
or two identical meson fines prig. 13(a)], and a factor 
£ for each closed loop with three identical meson lines 
[Fig. 13(b)]. 

(vi) We use the iterative renormalization procedure 10 

10 We make no attempt to prove renormalizability of the <r 
model. Renormalizability of the <r model (with only the mesons 
present) has recently been discussed by B. W. Lee, Nucl. Phys 
B9, 649 (1969), and renormalization of the closely related <p 4 meson 
theory has been analyzed by T. T. Wu. Phys. Rev. 125, 1436 
(1962). 
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to fix the unrenormalized quantities e 0i go, Go, No, 
nto=Go/go, mi, and the wave-function renormalizations 
Z 2 (fermion wave-function renormalization), Z 3 r , Zf, 
and Z 3 ' y =(e/e 0 ) 1/2 . For finite A, all of these will be 
finite functions of A and of the renormalized quantities 
e , g, G , X, and m, with m the physical pion mass. (Alter¬ 
natively, we can take the independent physical quan¬ 
tities to be e, m, G , X, and m, with m the physical nucleon 


mass.) We include wave-function renormalization fac¬ 
tors Z 2 1/2 , (Z 3 ir ) 1/2 , (Zz a y /2 3 and (Z 3 7 ) 1/2 for each fermion 
pion, tr, and photon external line. 

As in the case of spinor electrodynamics, the cutoff 
rules in the o' model are compactly summarized by the 
statement that they are the Feynman rules for the 
regulated Lagrangian density 11 : 


£X(x)=Ki3-Go(g a -'+r T +iv T 7sW+(D< 2) +K)[.H<r^+4go<rH(^ 2 +(*n 2 )+go 2 ((r T ) 2 M* T ) 2 ) 2 J 

+i£C 2 )[(3T 7 ’) 2 +(3<r I ’) 2 ]+i(F< 2 )+Aio 2 )C2go-V I ’+(<r I ’) 2 +(T I ’) 2 ]+i[(3T) J +(3<r) 2 ]-|Aii 2 (T 2 +<r 2 ) 

-\t{dT R y+{dv R )^+^[X* R y+{* R )^-iF^+\F„ v R F R »'-WA» R A R »-e4yM A ' i + AR, ‘) 

+C™(F r ,+F ir r =T+T\ (r r =<r+(r B . (57) 


The axial-vector current, generated by the gauge trans¬ 
formation 

^(l+lm#, 

7r->T-<g ( r 1 +0'), 

go” I +0’ -* go“ 1 + o’+Wflr> (58) 

w R —> 7r K -^Z)o* K , 

<r R —> o'^+zjt^, 
is 12 

y/= —&£ R /&(d tt v) =^2Tm75^+0'3 a1 t— Trdp<r+g<r l dpW 

-^d^+ir R d ll ir R +E^ 
X(<r T dpT T —w T dr<r T +g{r 1 drTr T ). (59) 

LOOP FACTOR 



(b) 


Fig. 13. Meson loop diagrams and corresponding 
Bose-symmetry factors. 


11 Since (<t)q*= ( 0 ^) 0 *=0, the conditions (6£/8o)o~(6£/to a )o*=Q 
are identical. 

12 The modified Feynman rules for meson propagators attached 
to the axial-vector vertex [item (li) abovel follow directly from 
Eq. (59). The pion propagator immediately following the axial- 
vector-current-pion vertex is unregulated because go'^x appears 
in Eq. (59) without an accompanying gtT l d^ir B term. Similarly, 
the product of meson propagators following the axial-vector- 
current-pion-ff vertex is regulated as in Eq. (54) because the 
bilinear terms in Eq. (59) have the d)fference-of-products form 

o’d^ix— 


The counter terms proportional to D (2) , £ (2) , and F (2) 
perform the explicit subtractions in the loops illustrated 
in Figs. 12(a) and 12(b), just as C (2) provides the explicit 
subtraction in the basic vacuum polarization loop (see 
Appendix A for details). 

We are now ready to calculate the divergence of the 
axial-vector current in our cutoff theory. One way to 
do this is to proceed diagrammatically, as we did in 
Eqs. (10)-(14) in the spinor electrodynamics case. How¬ 
ever, because of the complexity of the <r model, this 
method will involve very complicated equations. There¬ 
fore, we will instead follow the second, more, succinct, 
method used in spinor electrodynamics. We note first 
that calculation of the axial-vector divergence in the 
regulated <r model by naive use of the equations of 
motion gives 

d^jff — — V 8v = — (pi 2 /go)n • (60) 

Extra terms on the right-hand side of Eq. (60) can arise 
only from diagrams which are so singular that the Ward 
identities break down. However, since we have cut off 
the photon and meson propagators, all virtual boson 
integrations are strongly convergent and cannot lead 
to singularities which are not present when the boson 
integrations are omitted. Thus, breakdown of Eq. 
(60) can only be associated with the basic axial-vector 
loops shown in Figs. 12(b), 12(d), and 12(f). (All other 
axial-vector loops have enough vertices, and hence are 
convergent enough, to satisfy the normal axial-vector 
Ward identities.) By explicit calculation, we have found 
that of these diagrams, only the axi al - vector-vec tor- 
vector triangle of Fig. 12(d) has an anomalous Ward 
identity, leading to the conclusion that, in the regulated 
<r model, the axial-vector-current divergence equation is 

— o/47r)(Ff'+F K f') 

XCF’M-F**)**,*. (61) 

This completes our verification that Eq. (4) is exact 
to all orders of the strong and electromagnetic couplings 
in the <r modd. 
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From Eq. (61) a number of consequences immediately 
follow. 

(i) We can check the consistency of our cutoff Feyn¬ 
man rules by verifying that Eq. (55) is really valid in 
the regulated theory. As in Eq. (46), we define 

- i f d*x e*>*(7>(*MO)))o, (62) 


and substitute Eq. (61) for w(x). Using Eq. (59) for 
//(#), and the canonical commutation relation 

Ed»T(x)+EWdW(x),T(0)l\ * 0 = 0 = -i5 3 (x), (63) 

we still find the result = — 1/mi 2 * "The relation 

between A f t '($) and the proper pion self-energy S T (g 2 ) 
in the cutoff theory is given by 


A f"(?) = 


11 11 / 1 1 \ 1 

f - ; SV)——+— -7—;- —W) 

\g 2 —Mi 2 q 2 — A 2 / 


2 2 -M i 2 ? 2 -Mi 2 


q 2 -IH 2 q 2 —tx i 2 


? 2 —Mi 2 


1 /I 1 \ / 1 1 \ 1 

F-S'(^)-W)-- p T ($ 2 )-— + ■ • • 

q 2 — Mi 2 \q 2 —tii 2 q 2 — A 2 / y? 2 —Mi 2 q 2 — A 2 / £ 2 —Mi 2 


1+S T (5 2 )(^ 2 —A 2 ) -1 


$ 2 -Mi 2 +S t (? 2 )(mi 2 -A 2 )($ 2 -A 2 )- 1 


(64) 


so that 


A,"(0) 


1-2*(0)A-* 

—Mi 2 —2*(0 )(mx 2 — A 2 )A ~ 2 9 



and therefore Eq. (49) still implies that 2 T (0)=0. 

(ii) In the absence of electromagnetism, Eq. (61) be¬ 
comes the usual PCAC equation d^j^= -(mi 2 /&o)?t. 
From this equation, it is straightforward to prove 18 that 
the coupling-constant, mass, and wave-function renor¬ 
malizations which make 5-matrix elements in the <r 
model finite also make all matrix elements of jp 6 and of 
G“i 2 /go)?r finite (i.e., cutoff-independent as A—»«>). In 
the presence of electromagnetism, the effect of the extra 
term in Eq. (61), as shown in I, is to induce an extra 
infinity in jp 5 which is not removed by the renormaliza¬ 
tions which make the S matrix finite. However, just as 
we found that ntoj 6 (x) is made finite by the fermion 
wave-function renormalization in spinor electrody¬ 
namics, we expect that, even in the presence of electro¬ 
magnetism, Gui 2 /goV will be made finite by the pion 
wave-function renormalization in the <r model. That is, 
we expect 

(mi 2 / go)(Zs T ) 1/2 = finite. ( 66 ) 

Since the pion field is multiplicatively renormalizable, 

T = (Z 3 T ) 1 / V :enorm , (67) 

to prove Eq. ( 66 ) we only need show that any particular 
nonvanishing matrix element of (mi 2 /&oV is finite. The 
natural choice is the vacuum to two-photon matrix 
element G T (kvk 2 ) f defined by 

(7(^i,ei)7(^2,e 2 )|(—MiV^oVfO) 

= r ei**e2* p et T *fi T (ki * £ 2 ) . (68) 


Precisely the same arguments leading to Eq. (23) show 
that 

G*( 0)—o/x, (69) 

proving Eq. ( 66 ). We are now free to let the cutoff A 
approach infinity, defining a renormalized matrix ele¬ 
ment G T (k 1 *^ 2 ), 

\imG*(k 1 -k 2 )=G*(k 1 -k 2 ) J (70) 


which satisfies the exact low-energy theorem 

#r( 0 ) = -a/T. (71) 

In Sec. Ill, we will explicitly check Eq. (71) to second 
order in the strong meson-nucleon coupling constant G. 

(iii) The low-eneigy theorem of Eq. (71) can be re¬ 
rewritten in a physically interesting form, as follows. 
We introduce the ir —>2y decay amplitude F*(k x*& 2 ) 
and the pion weak decay amplitude f T by writing 

<7(^ex)7(£2,e 2 ) I (□ 2 +M 2 V renorm |0> 

-(4*,ofe2*0" 1 /a ii**2 T 6 1 *^ 2 *^ ro F-(i,-i.), (72) 

and 

<*(ff) I iff 10) = (2?o)- 1 / 2 (-W)/,M • (73) 

Comparing Eq. (72) with Eqs. ( 68 ) and (71), we find 


-Mx a (Z.') 1/a 
-F t (0)=-, 

go M 2 * 



while taking the divergence of Eq. (73) and using Eq. 
(61) gives 

fr/^2 = { — Mi 2 /£o)(Z 3 t ) 1/2 

+(terms of higher order in a). (75) 


18 J. Bernstein, M. Gell-Mann, and L. Michel, Nuovo Cimento 
16, 560 (1960); Preparata and Weisberger (Ref. 9). 


Combining Eqs. (74) and (75) then gives the low-energy 
theorem relating the t—» 27 and ir weak decay ampli- 
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Fig. 14. (a) yB-y„-y fi skeleton triangle, (b) Second-order radiative 
corrections to the yt-yv-y p triangle in spinor electrodynamics. 


tudes, 

i ? '(0)-(-a/T)(^M 2 //0 

+ (terms of higher order in a ). (76) 

The fact that Eq. (61) is exact means that Eq. (76) is 
true to all orders in the strong interactions in the <r 
model. The experimental consequences of Eq. (76) are 
discussed in I. 

m. SECOND-ORDER CALCULATION 

We give in this section an explicit second-order calcu¬ 
lation to check our contention that Eqs. (2) and (4) are 
exact. Rather than calculating corrections to both the 
axial-vector and pseudoscalar or pion vertices and 
checking Eqs. (2) and (4) directly, we will check these 
equations indirectly by verifying the low-energy theo¬ 
rems (30) and (71) which they imply. In the case of 
spinor electrodynamics, we will calculate the second- 
order radiative corrections to the matrix element 
( 7 (^ 1 , 61 ) 7 ^ 2 , 62 ) \2itn$y$l/\0) f which arise from the six 
diagrams shown in Fig. 14(b). These diagrams [phis 
mass counter terms appearing in the basic 7s-7<r7 P tri¬ 
angle of Fig. 14(a)] make a contribution to 5(0) of 
order a 2 , which must, in fact, be zero for Eq. (30) to be 
correct. The vanishing of the a 2 term is clearly a test 
of the absence of a term proportional to aa 0 F^F T(> e^ Tf> 
in Eq. (2). 

Similarly, in the <r model we will calculate radiative 
corrections to (y(h,ei)y(jh,e 2 ) | (“MiVgoHO). It is con- 

“ i Gq^ 


Ytr 



Fig. 15. Nucleon bubble diagram, which in 
the <r model appears in the second-order radiative 
corrections to the yB-y<t^y p triangle. 


venient to rewrite this matrix element by substituting 
for fuhr the pion equation of motion obtained from Eq. 

(31), 

jLtlV = — □ V —^Go^75^ 

+X oPgtxr T+4go 2 7r(c' 2 +T 2 )] +Mo 2 tt . (77) 

The matrix element of DV makes a contribution to 
of order kvk 2t and thus can be neglected at 
ki-k 2 —0. If we work to second order in G 2 but to zeroth 
order in so that the physical pion and o masses re¬ 
main equal, the meson-meson scattering terms in Eq. 
(77) can also be dropped. Finally, let us recall that the 
effect of the counter term proportional to jLto 2 is to pro¬ 
duce a subtraction in the pion proper self-energy, giving 
S ir (0)=0. In particular, this means that the counter 
term in Eq. (77) combines with the nucleon bubble 
diagram involving — iGtfpyjp, shown in Fig. 15, to give 
a contribution to G*(ki-k 2 ) proportional to S^&i-fo), 
which vanishes at ki’k 2 ~0. Recalling that Go/go^nto, 
we may summarize the findings of this paragraph by 
the statement that to check the low-energy theorem of 
Eq. (71) to order G 2 } we need only calculate the matrix 
element ( 7 ^ 1 , 61 ) 7 (^ 2 , 62 ) \int^y^p\0) f omitting the bub¬ 
ble diagram of Fig. 15. The twelve diagrams which con¬ 
tribute have the form of those in Fig. 14(b), with the 
virtual photon line replaced by a virtual pion or a virtual 
<r line. These diagrams, plus mass counter terms in the 
basic triangle of Fig. 14(a), make a contribution to 
5*(0) of order G 2 a , which must vanish for Eq. (71) to 
be valid [Thereby verifying the absence of a term pro¬ 
portional to G 2 otoFt a F Tp e$a T p in Eq. (14)]. Thus we see 
that the second-order spinor electrodynamics and tr- 
model calculations will appear very similar. 

The calculations of the second-order radiative correc¬ 
tions to the triangle diagram will proceed in the follow¬ 
ing way. First, we calculate the renormalized quantities 
Vp(p,p f ), F b (p t p *), and5//(^) in spinor electrodynamics 
and the tr model, and substitute them into the 75 - 7 *- 7 ? 
skeleton triangle of Fig. 14(a). The constants 5(0) and 
5*(0) are determined by extracting the first nonvanish- 
ing terms of a Taylor series expansion of the amplitudes 
in the photon momenta k\ and k 2 . The Ward identities 
and integrations by parts are used to show that the self¬ 
energy corrections are exactly canceled by the unex¬ 
panded vertex corrections. The terms where an external 
momentum has been expanded from a vertex correction 
function are evaluated in two ways: by direct calcula¬ 
tion and by using a further integration by parts. Using 
either method, the sum of these terms is found to vanish. 
Therefore, the second-order radiative corrections to 
5(0) and S^O) are zero. 

Let fV 2) (A/0> Is {s) (pip% and S/ (2) (p) denote the 
renormalized second-order vector vertex, pseudoscalar 
vertex, and fermion propagator in either spinor elec¬ 
trodynamics or the <r model, as defined by Eqs. (25) and 
(28). (Explicit expressions for these quantities are given 
in Appendix B.) The matrix element which we want 
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is proportional to 

3R„=JdVT r [T 5 <»(r-fc 2 , r+i,)3/<»(r+*,) 

Xr,m(r+ 4 „ r)S/<®(r)r/«(r, r-k 2 ) 

xS/‘«(r-4i)]. (78) 

Since we are actually only interested in studying the 


part of Eq. (78) coming from the second-order radiative 
corrections, let us substitute 

W) =Y„+A ,{p,p'), 
WKP,P')=-t*+^{p,P'), (79) 

Sf ,(2 Kp) =t>-«-s(^)]-‘ 

into Eq. (78) and isolate the second-order part. This 
gives 


9TC<r P (2) = j d*rTT[_A5(r—k2 t r+ki)(r+k 1 —m)~ 1 ya(T—m)~ 1 yp(r—k2—tn)~ 1 (80a) 

+ 7 6 (r+fti —m)“ 1 S(r+ *i)(r+fti —- m)~ l y P {r-k t — m)” 1 (80b) 

+y 5 (r+k 1 —m)- 1 Aa{r+ki t r)(r —m)~ 1 y p (r—k 2 -m)~ 1 (80c) 

+yh{r+ki—tn)~ 1 y a {r—tny- 1 ' 2 (r)(T—tn)~ 1 yp(r—k 2 —tn)~~ 1 (80d) 

+75(r+ft 1 -m)- 1 7, r (r-m)” 1 A p (r, r-^ 2 )(r-fe-m)- 1 (80e) 

+y^r+k l —m)~ l y a {r—m)~^ P {r~k 2 —fny v L{r-k 2 ){r-k 2 -m)~ v ], (80f) 


with the six terms in Eq. (80) corresponding, of course, 
to the six diagrams in Fig. 14(b). To evaluate Eq. (80), 
we use the fact that although the integral over r is 
apparently linearly divergent, the linearly divergent 
parts of terms (80a)-(80f) vanish separately when the 
trace is taken. 14 This means that we can simplify the 
form of Eq. (80) by making separate translations of the 
integration variable in each of the pieces (80a)-(80f), 
as follows: 

(a) r->r+k 2} (d) r-»r, 

(b) r->r-k x> (e) r-»r, 

(c) r->r—k i, (f) r-*r+h. 

After making these translations, wherever 2 appears 


it has argument r, and the vertex parts A 6 , A*, and A p 
have r as the first argument. Next, we Taylor-expand 
with respect to k x and £ 2 , keeping only the leading term 
of order k x k 2 (because of the 75 , the terms of order 1, k Xy 
£ 2 , k x 2 , and & 2 2 vanish identically). Defining the vertex 
derivatives A5,f(r) and A,r,{(r) by 

A 6 (r, r+a) =A5(r,r)+a f A 5 ,f(r)+0(a 2 ), 

A,(r, r+a) =A a (r > r)+a*A a ,i(r)+0{a 2 ), 

we find 

9TV 2) = 9TCW 2) +9TW 2) 

+ (terms of higher order in momenta), (82) 

with [we abbreviate s=(r— m)* -1 ] 


'MA* P i2) =k X tk2 T< M A tT*p {2) 


and with 


jd*r T+A'>(r ) r)s(~k 1 )sy ff s(—k2)sy fi s+y 6 s2(r)sy ff sk 1 sy p sk2S+y 6 sA< r (r,r)sk 1 sy p sk2S 

+y&s(—k x )sy tr s^(r)sy p sk 2 S+y 5 s(-'k x )sy tr sA p (r ) r)sk 2 S+y & s(—k x )sy tr s( -fe)nv* s ( r ) J ] > (83) 


< M Bap ^k^k2 T m B ^p {2) 


jd A r r Ti£kitA5,i(r)sy tr s(—k2)sypS+yss(—kit)A < r l s(r)sy p sk2S+y&s(—k 1 )sy tr s(—k2 T )A PlT (r)s ']. (84) 


The tensors 9TLi{T<r P (2) and JF^r^ 25 must both have the 
structure 


3E J 4fT<rp (2) = e£ T(rp 3E J i (2) , 
9RBfra P (2) = e fT(rp ^ (2) , 



and therefore are completely antisymmetric in their 
indices. Clearly, each individual term in the sums in 
Eqs. (83) and (84) will also have the form of Eq. (85), 


14 Simple 7 -matrix counting shows that trace of each of the terms 
(80a)-(80f) is proportional to the fermion mass m, and is therefore, 


on dimensional grounds, at most logarithmically divergent. Be¬ 
cause of the over-all factor the logarithmic divergences vanish 
as well, so that the integrals of terms (80a)-(80f) actually converge. 
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once the integration over r is performed. This fact evaluating the trace gives 
greatly simplifies the following calculation. 

To evaluate 9R J ifT<rp <2 \ we eliminate the vertex parts ^ (2) 
from Eq. (83) by using the Ward identities 


= -4 i / d*r {m 2 E x {t*) 


A 6 (f,f) = (l/ 2 m)[ 7 6 2 (r)+ 2 (r) 75 ], 

A x(r ) r) = -d x S(r), (86) 

= (r— m)“ 1 7 x(r—m) _1 . 


Integration by parts in the terms involving d x 2(r) shifts 
the derivatives to the free propagators (r— m) -1 . The 
resultant amplitudes all involve a trace containing 7 
matrices, r, and 2(r). By anticommuting r through the 
7 matrices and using the total antisymmetry of 
3JLifT<rp (2) in its tensor indices, we find 


SHkfr a P (2) = 




—3 


XTr[i(75S(r)+S(r)76)7«7.r7r7j 


-mrlZMr 2 ) -Z)»(r J )]} (r>-m 2 )- 3 . (90) 
From Appendix B we find 


E x {r 2 )=-~ 

16t 2 


e 2 

-G 2 


X 


i 

zdz 


o 


—2m 

—rh{ 1 — z)+zm 2 +(l — z)/i 2 ’ 


1 

Z> 2 (r 2 )-Z> 3 (r 2 )=- 

16tt 2 


e 2 

-G 2 



X 


r 1 

' zdz 
0 


2(1-2) 

—r%(l — s)+sm 2 +(l— z)/x 2 * 


+ (r 2 —m 2 )~ 4 Tr[(r+m)S(r)(r+m)757«7*r7T7p] 

- (r 2 — m 2 )“ 4 8r,r Tr[2(r)(r+m)757f7T7p]}. (87) 

On substitution of the general expression S(r)=^4(r 2 ) 
- \-rB{r 2 ) and use of symmetric integration in r, we find 
that the right-hand side of Eq. (87) vanishes, implying 

9TLl f r,rp (2) =0. 

The evaluation of 9TC£f T «P (2) in Eq. (84) will be done 
using two different methods, each giving zero. The first 
method involves a direct calculation of the integrals. 
We recall that each term of Eq. (84) is totally anti¬ 
symmetric in the tensor indices, once the integration 
over r is performed. Using this total antisymmetry and 
reversing the order of the y matrix products in the third 
term in Eq. (84) yields 

arlfif r., <2) = jd*r Tr{A6,{(r)*7^( _ 7 t)^7p j 

+76jC-A.. t (f)+A., t (r) B ]i7 P i7 T 5}, (88) 

where A ff .f^ ) (r) B is obtained from A,r (f ( 2 ) (r) by reversing 
the order of all 7 -matrix products. From Appendix B, 
we find the expressions 


where the upper (lower) entry in { } refers to spinor 
electrodynamics (the tr model), and where / 1 2 is the 
virtual photon, pion, or <r mass. Inserting Eq. (91) into 
Eq. (90), we find that the integral in Eq. (90) is pro¬ 
portional to /(ju 2 /m 2 ), with 

r 1 r°° udu 1— u(l— z) 

/(o)s / zdz / --—--. (92) 

Jo Jo (m+1) 3 uz { 1 — z)+s+(1—s)o 

We show in Appendix C that this integral is identically 
zero, giving 9Ti^( 2) =0. 

The second method used to evaluate 9Tisf T(r p (2) in¬ 
volves the use of an integration by parts. Since the 
derivative on the vertex function removes the effect 
of the renormalization constants, the three terms in Eq. 
(84) may be written diagrammatically as shown in Fig. 
16(a). In the first term in Eq. (84), we use Eq. (86) to 
replace (r —m)~ l y p (r —m)- 1 by —d p (r—m)~ 1 ) and then 
integrate by parts, using the total antisymmetry of the 
amplitude to drop the terms in which the derivative 
acts on the propagators adjacent to k x and 7 *. This 
operation has the effect of replacing the left-hand dia¬ 
gram in Fig. 16(a) by the diagram in Fig. 16(b). The 
expression for 9 TC£{t<tp ( 2) becomes 

3frc^p <a) “fe f Jfes T 3fTCfi frap (2) 


As.fM =757{£i(0+75r{£2(0, 
Aa^)=y^iDi{t 2 )+ya<rrD2{r 2 )+ry a y i Dz(r z ) 

+g*&i(r 2 )+r*rsD5(r 2 )+r a rsrD 6 (r 3f ), (89) 

A«r,fW A =7^'^i(r ! )+r7<r7^2(r 2 )+7?7^3(r 2 ) 
+g*tDi(r 2 )+r<rr i D5(r 2 )+r a r i rD6(r 2 ) , 

where Ei f E 2 , D\, ■ ■ ■, are simple integrals over Feynman 
parameters. Substituting Eq. (89) into Eq. ( 88 ) and 


=J dV Tr[jy 5 $( -ki)skz T A a , T (r)s( - 7 p )s 

+y$s(—k x *)A ff 'S{r)sy p sk2S 

+y5s(-ki)sy ff s{—k 2 T )A PMr (r)s2- (93) 

Each term in the sum of Eq. (95) involves a trace 
containing 7 matrices, r, and the function A,r. f (f). By 
anticommuting r through the 7 matrices and by using 
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>5 * 



Fig. 16. (a) Diagrammatic representation of 311**/** in spinor 
electrodynamics, (b) Diagram obtained from the left-hand loop 
in (a) by integration by parts. 

the antisymmetry of 9TC£f T(r p C2) in its indices, one finds 

31W.p (2) =y d*r (r 2 —m 2 )~ 2 Tr{75A IiP (r)|[7 £ ,7 T ]} . (94) 

(The terms coming from the anticommutators exactly 
cancel because of the antisymmetry.) Substituting Eq. 
(89) into Eq. (94) immediately gives 3rc£f T(rj » C2) =0. It 
is not actually necessary to have the detailed form of 
Eq. (89) to see that Eq. (94) vanishes. Referring to 
Fig. 17(a), we see that in spinor electrodynamics A<r, p 
has the form 

A* tP =yaA<rJ(r)y a , (95) 

and when Eq. (95) is substituted into Eq. (94), the sum 
over virtual photon polarization states a cancels to 
zero. Similarly, from Fig. 17(b) we see that in the <r 
model Ahas the property 16 

A I . p (r)=-t 7 6 A I . p (r)i 7 6 , (96) 

which again implies that Eq. (94) vanishes. In this case, 
the virtual pion term is exactly canceled by the virtual 
a term. 4 

IV. SUMMARY AND DISCUSSION 

We summarize our results and briefly compare them 
with the recent findings of Jackiw and Johnson. 2 We 
have considered two models, spinor electrodynamics and 
a truncated version of the a model. In each model, we 
have studied a particular axial-vector current: in spinor 

-iM,^- -f-\>.r a 

* <□) 

-IG«^- -j -IGo ^ G.)^- -j<i 

y<r -Yp 

(b) 

Fig. 17. Diagrammatic representation of A,., in (a) spinor electro- 
_ dynamics and (b) the <r model. 

15 Comparing Eq. (96) with Eq. (89), we see that in the <r model, 
we must nave Da — Di=Q. This can be verified from the explicit 
formulas of Appendix B. 


electrodynamics, the usual axial-vector current of Eq. 
(1), and in the <r model, the Polkinghorne 16 axial-vector 
current of Eq. (3), which, in the absence of electro¬ 
magnetism, obeys the PCAC condition. By introducing 
cutoffs in the boson propagators we have shown that, 
in the presence of electromagnetism, the divergences 
of our axial-vector currents are modified in a simple well- 
defined way, to all orders of perturbation theory. The 
modification consists of the addition of a simple nu¬ 
merical mutliple of (ato/ 47 r)Ff‘ r F Tp e f(rTp to the naive 
axial-vector divergence. (The naive divergence is the 
one obtained by formal manipulation of equations of 
motion when subtleties arising from the singularity of 
local-field products are neglected.) From the anomalous 
divergence equations we obtained simple low-energy 
theorems for the vacuum-to-2y matrix element of the 
naive divergence. Although these theorems were derived 
with the cutoff A finite, we argued that in both models, 
even in the presence of electromagnetism, the naive 
divergence is a finite (cutoff-independent) operator as 
A—»oo. 17 This allowed us to pass freely to the limit 
A—»°o, obtaining a low-energy theorem for the re¬ 
normalized naive divergence operator. This low-energy 
theorem was checked explicitly to second order in 
radiative corrections in a calculation using only renor¬ 
malized (cutoff-independent) quantities, verifying our 
contention that no subtleties were involved in the A—»°° 
limit. 

Thus, in our calculation, use of the cutoff has been an 
artifice, and the cutoff does not appear in the physics. 
As is made clear by the discussion of Eqs. (66)-(70), 
this important feature can be traced directly back to 
the following property of the two axial-vector currents 
which we have studied: The naive divergences of the two 
axial-vector currents , as well as the axial-vector currents 
themselves , are midtiplicatively renormalizable . We con¬ 
jecture that in any renormalizable theory field with an 
axial-vector current satisfying this property, arguments 
analogous to those of this paper can be carried through. 

With these comments in mind, let us examine the con¬ 
clusions of Jackiw and Johnson. Jackiw and Johnson 
treat the electromagnetic field as an external (non- 
quantized) variable, but allow quantized strong inter¬ 
actions of the spinor particles, so their calculation 
applies, for example, to the a model. Rather than con¬ 
sidering the Polkinghorne current of Eq. (3), Jackiw 
and Johnson take as the axial-vector current the fer¬ 
mion part $ypy&p alone. They find that the effect of 
the strong interactions on the anomalous divergence 
term is ambiguous, and depends on precisely how a 
cutoff is introduced. It is easy to see that (even in the 
absence of electromagnetism) the current yfy p y&p is not 
made finite by the usual renormalizations which make 

16 J. C. Polkinghorne, Nuovo Cimento 8, 179 (1958); 8, 781 
(1958). 

17 That is, multiplication by the usual external-line wave- 
function renormalization factors makes Feynman amplitudes of 
the naive divergence finite. 
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the S matrix in the a model finite, and, by a reversal 
of the Preparata-Weisberger argument, 18 this means 
that the naive divergence of this current is not multi- 
plicatively renormalizable. In other words, the axial- 
vector current considered by Jackiw and Johnson and 
its naive divergence are not well-defined objects in the 
usual renormalized perturbation theory; hence the 
ambiguous results which these authors have obtained 
are not too surprising. 

The presence of two different axial-vector currents 
in the <r model poses, however, the following question: 
Which current should we take as the prototype for the 
physical axial-vector current? The answer is that there 
are two arguments in favor of using the full Polking- 
horne current, rather than its fermion part alone, as 
the current to which the semileptonic weak interactions 
couple: (i) We want the physical axial-vector current to 
satisfy the PCAC hypothesis. Although PCAC does not 
require that the divergence of the axial-vector current 
be a canonical pion field (as is the case for the Polking- 
horne current), it does require that it at least be a 
smooth interpolating field for the pion. However, a non- 
multiplicatively renormalizable operator, such as the 
divergence of the current will not be a smooth 

operator and therefore is not a pion interpolating field 
suitable for PCAC arguments, (ii) When charged fields 
and charged currents are added to the model, we want 
the axial-vector currents to satisfy the Gell-Mann 
current-algebra hypothesis. As Gell-Mann and Ldvy 8 
have shown, the Polkinghorne axial-vector current is 
the one which obeys the current algebra. The fermion 
part alone does not satisfy the current algebra. 

To summarize, then, in the a model (and also in the 
neutral-vector-meson model, which behaves like spinor 
electrodynamics), the current which is a prototype for 
the physical axial-vector current has a simple anoma¬ 
lous divergence term in the presence of electromag¬ 
netism. Other axial-vector currents can be defined which 


do not have simple anomalous divergence behavior, but 
these currents are not good prototypes for the physical 
current. 
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APPENDIX A 

In the first part of this Appendix, we give explicit 
renormalized expressions for the diagrams depicted in 
Figs. 12(a) and 12(b), and we show that they satisfy 
the usual axial-vector Ward identities. In the second 
part, we demonstrate that the axial-vector-photon- 
photon-meson box diagram of Fig. 12(f) satisfies the 
usual axial-vector Ward identity. 19 

A. Meson and Axial-Vector-Current-Meson Loops 

In order to give unambiguous values to the divergent 
loops which we encounter, we define the symmetric, 
cutoff integral symbol 

f - (AD 

J[C,M 2 1 

Equation (Al) is a shorthand for the following sequence 
of operations: (i) We do the usual Dyson rotation to the 
Euclidean region; (ii) we integrate symmetrically over 
the angle variables around the center r=C ; and (iii) 
we integrate the Euclidean magnitude squared x= —r 2 
up to an upper limit of M 2 . Using this symbol, we define 
unrenormalized meson and axial-vector-current-meson 
loops as follows 20 : 


Meson Loops 



T 99 (k) = -i2 9 (k), T vr (k)=-i2 v (k), 



d A r / 1 1 

-Tr(- 

io,m 2 i (2t) 4 \r—wor— k —wo 




Mk)=iG<? 


* d*r 
[0, m a j (2 t) 4 


Tr (r. ; 


1 1 

- n —T 

—wo r-k- 



18 Preparata and Weisberger (Ref. 9), Appendix C. 

19 For a derivation of the Ward identities satisfied by the general spinor loop coupling to scalar, pseudoscalar, vector, and axial- 
vector external sources, with full SU» structure, see W. Bardeen (to be published). The results of Appendix A are a special case of the 
general problem discussed by Bardeen. 

20 In this Appendix, Z y (&) and Z<r(&) denote, respectively, the pion and a proper self-energies, which were denoted by E*(k 2 ) and 
Z*(k 2 ) in the text. 
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r d*r / 1 

= / -Tr( >5- 

J [o,m 7 ] (2t) 4 \ r—fti— 


-75- 


! - -). 

2 —wo r—tnJ 


wo r+ft 2 —w 0 r—mo 


T aaaikxykz) — — 2Gq 3 


10,M 2 ] 


'V / 1 1 1 \ 

— Tr(-), 

t) 4 \r—fti—w 0 r+fe—w 0 r—w 0 / 

iV / 

— Tr(- 

!t ) 4 \ 


d*r / 1 1 

= -Go 4 / -Tr( 7 b- 

lO,Af a l (2 t) 4 \ r+fti+Z^+fts - Wo r+/2i+/22 — Wo 


X7« 


-75 -|-five permutations^, 


r+fti—wo t —wo 


/ 


T 4raa a(ki,k2,kz) = —Go 4 




Tr 


i'- 


1 


1 


lo 


,Af 2 J (2t) 4 \r*Tftl*Tft2*Tfts —trio T-\-k\-\-kz — Wo 


f ^ r ( 

T vv<ri =Go 4 / -Tr( 7 s 

J[0,M 3 } (2t) 4 \ 


1 


11 \ 

X-|-five permutations J, 

r+fti— wo r—mo / 


1 


-75- 

r*Tft \-\-ki~\-kz —Wo r+fci+fe—Wo 


11 \ 

X-|-five permutations J. (A2) 

T+ki —wo r—wo / 


j 4^(^) — —Go 


Axial-Vector-Current-Meson Loops 
d A r 


10.AT 2 ! 


= —Go 


[0, Af*« '*1 


dV / 1 1 \ iC 

*-Tr( 47^,75-7s-H— 

(2t) 4 \ r—wo r—ft—wo/ 0 

dV / 1 1 \ 

—" Trl §7^.75-75 ), 

(2t) 4 \ r—wo r—ft—wo/ 


(A3) 


r d A r / 1 1 1 \ 

= 2G 0 2 / --Tr( 75 - iy SYi> ---) 

J [o, ji/ s i (2t) 4 \ r—fti—wo r+ft 2 —wo r—wo/ 


The loops T aay T WW1 A w and B „ are at least linearly 
divergent, and so specification of the center for sym¬ 
metric averaging is essential. The three-meson and four- 
meson loops, on the other hand, are not linearly diver¬ 
gent, and so the origin of integration in these loops may 
be freely translated. (Terms which vanish as M —>00 
will be picked up from the translation, but may be 
ignored. Similarly, the two different expressions which 
we have given for A ? are not precisely equal, but differ 
by terms which vanish as M —»°°.) 

We have chosen identical upper limits M 2 for all loop 
integrals except A^ where the upper limit has been 
taken as MV' 2 (e is the base of natural logarithms). 
These choices of upper limit guarantee that the un¬ 
renormalized loops satisfy the usual axial-vector Ward 
identities. For example, in the case of A m a straight¬ 


forward calculation shows that 

r d A r / 1 1 \ 

^(~Go) TTT: Tr ( 2TVY6-n— -) 

J [ 0 , Af a J (2 t) 4 \ r —Wo r—ft —Wo/ 


Wo 


with 


/(*)= 


Go 


€2.W-Z,(0)]+*Go/(*), (A4) 


d*r / 1 1 \ 

-Tr(-) 

,m ! 1 (2t) 4 \r—k—mo r—mo / 

r d*r / 1 1 1 \ 

/ -Tr - k - k -) 

J[ o,Ar»i (2 t) 4 \r—k—mo r—mo r—mo/ 


— 2imok 1 




(A5) 
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Therefore, A M (k), as defined in Eq. (A3), satisfies the 
usual axial-vector Ward identity 

k*A&) = -(tno/Go)i& r (k) -2.(0)]. (A6) 

Similarly, the axial-vector triangle B^ki y kz) satisfies 
the Ward identity 

— (ki-\-ki) ft Bf t (k\jki) = (fno/Go)T irira r 

+&'(kt)-i2 w (kd. (A7) 


from the second-order ir and a self-energy diagrams to be 


mo t Go t , 

tM 2 \ 


Z)<»=-In 


(4tt) 2 ' 

\m^i 

-2Go 2 , 

(M 2 \ 

)+& m . 

£(2)=-U 


(4tt) 2 ' 

\m^i 


r 

d*r 

f <®=Z.(0)=*Gtf 



J to, jw a i (2 tt)' 


(A9) 


The axial-vector-current-three-meson box diagram of 
Fig. 12(b) is superficially logarithmically divergent, but, 
because Tr{7*7^(ljYOr(l/Y6)r(l,75)r)=0, this dia¬ 
gram actually converges, which is why we have not 
included it in the list of unrenormalized axial-vector- 
current loops in Eq. (A3). Introducing a cutoff at 
%■=• M 2 (which changes this diagram only by terms which 
vanish as M —) and then taking the divergence yields 
a linear combination of three- and four-meson loop 
diagrams, all with cutoff at x= M 2 as in Eq. (A2) 
Some of these loops may occur with the loop integration 
variable translated by a finite amount with respect to 
the standard forms in Eq. (A2), but as we have pointed 
out, this does not matter because none of the three- or 
four-meson loops is linearly divergent. We conclude, 
then, that the axial-vector-current-three-meson box dia¬ 
gram and the meson loop diagrams of Eq. (A2) satisfy 
the usual axial-vector Ward identity. Identical reason¬ 
ing shows that the axial-vector-current-four-meson 
pentagon, which is superficially convergent, is related 
by the usual Ward identity to a linear combination of 
the meson box diagrams of Eq. (A2) and to the conver¬ 
gent meson pentagon diagram. Note that because the 
axial-vector-current box and pentagon diagrams are 
finite , their Ward identities will necessarily involve linear 
combinations of the meson triangle and box diagrams in 
which the logarithmic divergences exactly cancel. 

Having defined the unrenormalized loop diagrams 
and.shown that they satisfy the correct Ward identities, 
we next construct the renormalized loops and show that 
they, too, satisfy the proper Ward identities. The re¬ 
normalized meson scattering and axial-vector-current 
loops are obtained from the unrenormalized loops by 
adding appropriate matrix elements of i£ counter and 
jp 6 counter with [see Eqs. (57) and (59)] 

counter =n( 2 ^4<r 2 +4:(Go/mo)<r(a 2 +ir 2 ) 

+ (Go/ mo 2 ) (a 2 + tt 2 ) 2 ]+ C2 > [( tbr) 2 +( da) 2 ] 

+hF^Z(2mo/Go)a+a 2 +ir 2 l, (AS) 

jp 6 counter =E (2) [o'5^7r—^5^0*+ (mo/Go)dfiTT ^. 

The subtractions Z> (2) , E C2) , and F C2) are determined 


XTr /75 - 75 -) . 

\ r—mo r—mo/ 

The finite constants £> (2) and _fi C2) are adjusted to give 
the physical pion mass and the meson-meson scattering 
constant the specified values / 1 2 and X. The renormalized 
loops, denoted by a tilde, are given by 

f*=T a +i(mo/Go)FM=0, 
l v (k) =1 v (k) -F< 2 > -£ 2 E< 2 >, 

2*(k) =J, v (k) - F < 2 > -8Z)< 2 >-£ 2 E< 2 >, 
T„ 9 =T m +U(G*/m)Dto , 
F <r aa~F <r <rv-\-24:i(Go/mo)D t2 ^ , (A10) 

T m * = T mr +2Ai(Gt/mf)D™ , 

fa < raa = Taa<r<r+2^i(Go 2 /m 0 2 )D^ } 

7 Trv<r<r = 'F vir<r<r -\-&i(Go*/ mo 2 )Z^ 2) , 

Ajfi) =Ap(k) -\-i(mo/ Go)jE^kp , 

S ll (ki. 3 kt)=B ll (ki J kt)+iE^(kt—k{), l . 

It is straightforward to verify that all of the tilde quan¬ 
tities approach finite limits as M—>oo, showing, as re¬ 
quired by chiral invariance, that the subtraction con¬ 
stants determined from the second-order loops make the 
triangle and box diagrams finite as well. 

From Eqs. (A6), (A7), and (A10) we find that the 
renormalized loops A? and satisfy the desired Ward 
identities 

k^AjtQt ) = —i(mo/Go)^ir(k ), 

—(kiA-k^B^kijkz) = (tno/Go)T T Tv (All) 

-\-iE a (kt) —z'2 t (^i) . 

Next, we recall that the axial-vector-current box and 
pentagon diagrams and the unrenormalized meson¬ 
scattering triangle and box diagrams are related by the 
usual Ward identities. This implies that the same Ward 
identities are satisfied by the axial-vector box and 
pentagon and the renormalized meson-scattering dia¬ 
grams. The reason is that the counter terms which sub¬ 
tract the divergences in the meson loops necessarily 
occur in each Ward identity in the same linear combina¬ 
tion as the logarithmic divergences, and therefore cancel 
among themselves in the Ward identity in the same 
manner as the logarithmic divergences do. This com- 
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pletes the proof that the renormalized basic loop dia¬ 
grams satisfy the normal axial-vector Ward identities. 

B. Axial-Vector-Current-Photon-Photon— 
Meson Loop [Figure 12(f)] 

The Ward identity for the axial-vector-current- 
photon-photon-meson loop of Fig. 12(f) relates it to a 
linear combination of the diagrams shown in Fig. 12(e), 
plus a possible anomalous term. To see that no anoma¬ 
lous term is actually present, we note that: (i) Gauge 
invariance makes the diagrams of Figs. 12(e) and 12(f) 
finite, so no renormalizations are needed to make the 
various terms in the Ward identity well defined; (ii) 
a possible anomalous term must be gauge-invariant 
with respect to both photon indices, must be odd in 
mjo, and must have the dimensions of a mass, since all 
of the other terms in the Ward identity have these 
properties; (iii) a possible anomalous term can have no 
singularities in internal masses or external momentum 


variables, since taking an absorptive part e limin ates the 
superficially logarithmically or linearly divergent loop 
integrals, and therefore the absorptive parts satisfy the 
usual Ward identities. According to (ii), a possible 
anomalous term must have the form 

anomalous term 

<^fnaF\Fi/g(tnoj external momenta), (A12) 

where F i and Ft. are the two photon field strength 
tensors and where g has the dimensions of (mass) 2 . 
However, because of the division by g, Eq. (A12) neces¬ 
sarily has singularities, and therefore (iii) forces the 
anomalous term to be zero. 

APPENDIX B 

We state here the renormalized second-order vector 
vertex, pseudoscalar vertex, and fermion propagator 
used in the calculation of Sec. HI. 



The quantities c , N x , N , Pi, Ps, /, gi, and gt are defined 
as follows: 

Spinor Electrodynamics 

c=e, 

N\= — 2m 2 7x—2[(1 —z+yz)p , —yzp] 

X Yx[(l — yz)p — (1 — y>£'U+4w[(l - 2 yz)p\ 

+(1 — 2z-\-2yz)p \~\, 

iV=4w 2 —4[(1 — yz)p — (1— y)zp f ~\- (B2) 

XUX-z+yzW-yzp'l+lmip-p') , 

P,=s 2 +2z-2, P s = l-2«, 

/= 1 , 

gi=^m—p , gt—4m—2p. 


<r Model 

c =G , 

N\=— 2m 2 7x—2[(1— yz)p— (1— y)zp’~] 

XYx[(l —z+yzW—yzp "}, 

N=-2m(p-p'), (B3) 

Pi=z 2 —2z+2, Ft=-(l-z)\ 

f= o, 

£i=-£, g2 = —2£. 


APPENDIX C 

We show that the integral 



udu 1— u(l— z) 

(w+1) 3 z)+z+(l— z)a 


(Cl) 
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is identically zero. We begin by observing that 1(a) is 
analytic in the a plane, apart from a cut along the real 
axis from 0 to — °°. The discontinuity across this cut 
at a = —A is proportional to 



udu 

(u+iy 


Xll-u(l-z)]6(i4Z+z/(l-z)-A) 



A/u+i) z(l—z)[yl —( J 4 + l)s][(2+ J 4)z—.<4] 
dz - 


[_Z^ + ^(l_z)33 


1 


.A/u+D d iz*lA-(A + l)z¥ 
dz — 


2A 


dz 


[-zH-^Cl-z)] 2 ) 


= 0 , 


(C2) 


which means that 1(a) is an entire function. Further¬ 


more, for Rea> 0, 


\I(a)\< f dz 

J o 


>00 


udu 

z-\-uz(l — z) 

• 

(w+l) 3 

uz(l—z)+z+(l—z)a 


and 
7(0) = 2 


.00 


udu 


H °° udu 1 

-- 

(w+1) 3 2 

dz 


=- (C3) 


0 (u + 1) 3 Jo u(l—z) + l 


>00 


udu 1 1 

=-=0. (C4) 


o (u+iy 2 2 


Since Feynman integrals like Eq. (Cl) never lead to 
functions of exponential type, Eqs. (C1)-(C4) show 
that 1(a) = 0. 




288 


Adventures in Theoretical Physics 


PHYSICAL REVIEW D 


VOLUME 4, NUMBER 11 


1 DECEMBER 1971 


Comments and Addenda 


The Comments and Addenda section is for short communications which are not of such urgency as to justify publication in Physical 
Review Letters and are not appropriate for regular Articles. It includes only the following types of communications: (l) comments on 
papers previously published in The Physical Review or Physical Review Letters, (2) addenda to papers previously published in The Physical 
Review or Physical Review Letters, in which the additional information can be presented without the need for writing a complete article. 
Manuscripts intended for this section should be accompanied by a brief abstract for information-retrieval purposes. Accepted manuscripts 
will follow the same publication schedule as articles in this journal, and galleys will be sent to authors. 


Low-Energy Theorem for 7 + 7-*7r + 7r + 7r 

Stephen L. Adler,* Benjamin W. Lee,f and S. B. Treimanf 
National Accelerator Laboratory, Batavia, Illinois 60510 

and 

A. Zee 

Institute for Advanced Study, Princeton, New Jersey 08540 
(Received 10 September 1971 ; revised manuscript received l October 1971) 

We use the hypothesis of the partially conserved axial-vector current (PCAC) to show that 
the matrix elements for y +7— 7r° + 7r () +7r° and y +y-*7r° +jt + vanish in the soft-7r° limit. 
This, combined with photon gauge invariance, implies low-energy theorems relating these 
matrix elements to the matrix elements for y + y— 7r° and y— 7r° + tt + + tt~. Since the magni¬ 
tude of the former is determined by the 7r° lifetime, while the ratio of the latter to the for¬ 
mer is determined in a model-independent way by isospin and low-energy-theorem argu¬ 
ments, a model-independent prediction for the y +y~- ir-hir + iT amplitude can be given. Our 
results agree with those of Aviv, Hari Dass, and Sawyer in the neutral case, but not in the 
charged case. We give a diagrammatic and effective-Lagrangian interpretation of our for¬ 
mulas which explains the discrepancy. 


The reaction y+y —tt + tt+ tt is of interest, both 
because it may be observable in electron-positron 
colliding-beam experiments, 1 and because it is 
relevant to theoretical unitarity calculations 2 of a 
lower bound on the decay rate of ifj - p, + p,“ In re¬ 
cent papers, Aviv, Hari Dass, and Sawyer 3 and 
Yao* have applied effective-Lagrangian methods 
to calculate the matrix elements for the neutral 
and charged cases of y +7 — 77 + 77 + 77 . The fact that 
Refs. 3 and 4 are in disagreement has prompted 
us to repeat the calculation by standard cur rent - 
algebra-PCAC methods. 5 Our results agree with 
Ref. 3 (but not with Ref. 4) in the neutral case 
y+y — 7T 0 + 7T° + 7T° f and disagree with both Refs. 3 
and 4 in the more interesting charged case 7 + 7 
— 7T° + tt' 1 ’ + 7T“. After briefly discussing our method 
and results, we explain the reasons for our dis¬ 
agreement with the earlier calculations. 

We begin with the simple, but powerful observa¬ 
tion that the matrix elements 

aiKyffc*)+y(fc 2 )^ TT°(q 0 )+v*{q+)+J) 

and 

4 


311 1900 ^ 311(7 (*,) +7 (k 2 )- *°(q 0 ) + TT°(q') + 7T°(^)) 

vanish in the single-soft-7T° limit q 0 — 0, with the 
remaining two pions held on the mass shell. To 
see this, we follow the standard PCAC procedure 6 
of writing the reduction formula describing 3K 04 ” 
or 9K 000 with the tt° off the mass shell, and then re¬ 
placing the tt 0 field by the divergence of the axial - 
vector current (Af//)" 1 3 x fF 5 \ [The normaliza¬ 
tion constant /is given by /-/ ff /(V2"Af ff 2 ) = 0.68Af ff , 
with/ ff the charged-pion decay amplitude.] Be¬ 
cause the corresponding axial charge F 5 com¬ 
mutes with the electromagnetic current, no equal- 
time commutator terms are picked up when the 
derivative d x is brought outside the T product in 
the reduction formula. Integration by parts then 
makes the derivative act on the 7 t° wave function, 
producing a factor q oX . Thus both 3R 0+ ' and 3R 000 
are proportional to q 0 , and since they contain no 
pole terms which become singular as q 0 — 0, they 
vanish in this limit. Note that this argument is 
unaltered by the divergence anomaly 7 in 
since when fF 5X is the only axial-vector current 
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present, its divergence anomaly vanishes when 
the associated four-momentum q 0 vanishes . 8,9 

In addition to the soft- 7 r° limit which we have 
just derived, we know that 311 <H " and 3H 0O ° must be 
gauge-invariant. That is, they are bilinear forms 
in and c 2 (the polarization vectors of the two 
photons) and vanish when either is replaced by 
k x or c 2 is replaced by k 2 . We can now invoke the 
standard lore of current-algebra low-energy theo¬ 
rems , 5 which tells us that since we know three in¬ 
dependent pieces of information about the low-en¬ 


ergy behavior of 311 and 3R 000 (the q 0 — 0 limit, 
gauge invariance for photon 1 , and gauge invari¬ 
ance for photon 2 ), we can determine 3R 0+ “ and 
311 000 from their pion-pole diagrams up to an error 
of order 0(^i k 2 ) at least . 10 In particular, the 
terms in 3TC CM '“ and 3R°°° quadratic in the momenta 
k i, k 27 Qo> and qjq%) are completely deter¬ 

mined. The relevant pion-pole diagrams are illus¬ 
trated in Fig. 1. The pion-pion scattering ampli¬ 
tudes which appear are evaluated from the current 
algebra expression 11,12 


911(77''- 77*(g a ) + ir'lqj + ir'fa)) = if~ 2 { 6^5^ [(?„ + q c f - M 2 ] + 6 bd 6 ac [{q b + q„) 2 - M 2 ] + 6^6^, [{q c + q d ) 2 - M 2 \ 

-x[(q b + q c ) 2 + (q b + q d f + (q c + q d f - + 6 c<1 6^,)} , (la) 


where x is a parameter proportional to the isotensor component of the u o term” and is related to the 7 = 0 
pion-pion S-wave scattering length a 0 by 

a 0 = (7/327T)/“ a Af ff (l-4x). (lb) 

The 7 + 7 — 7 t° and 7 — n 0 + tt*+ tt~ amplitudes are expressed in terms of coupling constants F* and F 3n defined 
by 


3Il(7(fe 1 )+7(fe 2 )-77“)=ife“fe 2 B €?€ 2 6 € aar6 F r , 

Jrc(y(fe,) - 77 ° + 77 + (9 + ) + 77 ~(qj) = ik?e f q^qU aByS F 3 " . 


( 2 ) 


The coupling constant F" is related to the 7 r° lifetime by 13 


r,.- 1 - (i»#.Ve47r)(^'J». 

Comparison with experiment gives |.F ff ) =(n'/j7)(0.66±0.08JW ff )' 1 , with a the fine-structure constant. While 
the coupling constant F 3 * has not been measured, both the theory of PCAC anomalies 14 and model-indepen¬ 
dent isospin and low-energy-theorem arguments (see below) predict 

eF*«=f- 2 F\ e = (47Ta) 1/2 . (4) 


Combining Eqs. (1) and (2) with the appropriate propagators to form the pion-pole diagrams, and adding 
the unique second-degree polynomial which guarantees gauge invariance and vanishing of the matrix ele¬ 
ments as q 0 ** 0, we get the following predictions for 311 t>H and 3R 000 : 


3H 000 = (1 _3x) 9K(<7 0> q' 0 ,q"), 

m( n a' n «) - t Y -2 F x f' a f- B r7r *r (l (9o + 9o) + (fl r o + < ?o) + ( < 7n +< ?o) — 

mq 0 , <?„, q B )- if F k 2 eje 2 € aar5 - \q B TqC+q" B Y-M 2 ) 


= if~ 2 F r k^k 2 ele 2 e aRy6 + k) 2 -M 2 ) ( when three final pions are on mass shell), (5a) 


M°*- = if~ 2 F x k t a k 


-2 


•2 fc l 


aSy£ 


(i _ (q.+g-) 2 -^. 2 

\ (q 0 +q++q.Y -Mf 


)- 


x 3 U(q 0 , q+, q~) 


- ieF 3 r ele£ 


17 (2g + ~k I \ 
\\k 2 -2q.-k 2 


fer(9.-* 2 )v_ 


(2 g_-K) h 

k 2 2 - 2 q_ ■ k 2 


Kq°M_-k 2 y) 


^ ayar 


+ (fe 2 -r— 6 )+ (*, -* 2 )X« 



(5b) 
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A: 


7 T + lq + ) 

Tr*(q.)\ | /*“(qj 

V I / 

¥ 


r<i|>/ \r< k 2 > 

(a) 

7T+-( q+ ) 




! / *"<q.>\ J A 
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ru 


,? 


r(k 2 ) 


l 


ru 2 > 


rtn 2 ) 


+ k, 


(b) 


FIG. 1. Pion-pole diagrams for (a) the neutral and 
(b) the charged cases. 


These equations are our basic results. 15 

Our expression for 9H 000 in Eq. (5a) agrees with 
that given by Aviv et al. We disagree with the re¬ 
sult for 9R 000 quoted by Yao, who has (through an 
apparent algebraic error) replaced -M, 2 in Eq. (5a) 
by -4Af T 2 . In the case of strictly massless pions, 
our on-shell result for 9H 000 is the simple state¬ 
ment that the terms in the matrix element quadrat¬ 
ic in the external momenta vanish. 16 This result 
can be immediately generalized to the reaction 
y+y —ntf 0 , as follows; The PCAC argument given 
above tells us that in the limit when any one tt° has 
zero four-momentum, with the other n - 1 7r°’s on 
the mass shell, the matrix element 3Jl(y +y — nir°) 
must vanish. In addition, gauge invariance implies 
that 3ft must vanish when either of the photon four- 
momenta, k t or fe 2 , vanishes. Taking four-momen¬ 
tum conservation into account, this gives us n +2 -1 
= n +1 independent conditions on the low-energy 
behavior of 3H. Since for massless, neutral pions 
the pion-pole diagrams (tree diagrams) sum to a 
constant, independent of pion four-momenta, the 
n + 1 conditions can be satisfied only if 3H(y+y 
— mt°) vanishes 17 up to terms which are at least 
of order (momentum)" + 1 . 

Our result for SHI 0 *"* in Eq. (5b) disagrees with 
the formulas quoted by Aviv et al. and by Yao, both 
of which overlook the class of pole diagrams pro¬ 
portional to F 3 *. The formula of Aviv et al. also 
has the 1 in the large round parentheses multiply¬ 
ing F* replaced by i. In order to better under¬ 
stand this latter discrepancy, it is helpful to have 
a diagrammatic interpretation of the various terms 
in Eq. (5b). This is given in Fig. 2, which illus¬ 
trates the lowest-order perturbation-theory con¬ 
tributions to 9R 000 and 9R 0 *" in the Gell-Mann-L6vy 
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FIG. 2. Lowest-order diagrams contributing to (a) 
3R° 00 and (b) 3H° + “ in the Gell-Mann - Levy o model. The 
single solid line propagating around each loop denotes 
the nucleon. In this order of perturbation theory, / -1 
~g r /M N} with g r the pion-nucleon coupling constant and 
with the nucleon mass. {The large black dot at the 
four-pion vertices denotes the pion-pion scattering am¬ 
plitude of Eq. (1). To lowest order in perturbation 
theory, this arises as the sum of a direct four-pion in¬ 
teraction [coming from the term (ir • H) 2 in the cr-model 
Lagrangianl and of pole terms involving isoscalar a 
mesons exchanged between pairs of pions.} 


o model. 1 * The first and fourth rows give lust the 
lowest-order contributions to the pole diagrams of 
Fig. 1. The o-pole diagrams in the second row can 
clearly be represented as matrix elements of the 
effective Lagrangian 

= > ( 6 ) 

with F a& the electromagnetic field-strength tensor. 
As a check, we note that 7T° 7 t - tt = (7r°) 3 + 27r°7r + 7r" JI and 
since the matrix element of (jt°) 3 has a Bose sym¬ 
metry factor of 6, the contributions of Eq. (6) to 
311°°° and to 9R 0 *' are in the correct ratio of 3:1. 
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Let us turn next to the five-point functions in the 
third row. Aviv et al. assume that these are rep¬ 
resented by the same effective-Lagrangian struc¬ 
ture as in Eq. (6). If this were so, a five-point 
contribution of -2/* 2 DlT to SHI 000 would imply a 
corresponding contribution of -f/* 2 Jll" to SHI 0 *", 
which would then combine with the a-pole diagram 
to give a total nonpole contribution of \f~ 2 SHI". 
This is the origin of the { in the formula of Aviv 
et al. In actual fact, however, we find that the 
five-point diagrams are not described by Eq. (6), 
but rather by the effective Lagrangian 

(7) 

Equation (7) still couples the three final pions 
through a pure I- 1 state, as required by G parity. 
In the charged-pion case, Eq. (7) obviously leads 
to the five-point contribution listed in the third 
row of Fig. 2(b). Although not gauge-invariant by 
itself, this contribution combines with the pole 
terms in the fourth row of Fig. 2(b) (which are 
also not by themselves gauge-invariant) to give a 
gauge-invariant sum. In the neutral case, using 


the fact that the matrix element of 3 6 7r°(7r 0 ) 2 is 
2i(^o + ^+0 = 2i(k l +k 2 ) and using Eq. (4) to elimi¬ 
nate F 3 " in terms of F", we find that Eq. (7) just 
gives the gauge-invariant contribution -2/~ 2 DU", 
as required. 19 Finally, we note that while Yao ob¬ 
tains the correct value of 1 for the constant term 
in the large round parentheses multiplying F", he 
gets this by using an incorrect effective Lagran¬ 
gian, which does not respect the 1 rule, to 
generalize from the neutral to the charged case. 
The moral is that effective Lagrangians must be 
handled with caution. When ambiguities arise as 
to the form of the effective Lagrangian, they must 
be resolved by reference back to the basic current- 
algebra relations, which the effective Lagrangian 
is supposed to represent. 20 
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breakdown of asymptotic sum rules in perturbation theory 

Stephen L. Adler and Wu-Ki Tung 
The Institute for Advanced Study, Princeton, New Jersey 08540 

(Received 5 March 1969) 

It is shown that all of the principal results of the Bjorken-limit technique break down 
in perturbation theory in the “gluon” model of strong interactions. 

Three years ago Bjorken 1 pointed out that the asymptotic behavior of a time-ordered product of two 
currents is related to equal-time commutators of the currents and their time derivatives, 


lim Jd xe 


4 -iq •% 


q fixed 


T(j“{x)j\Q)) = (iq Q ) VA 


\e iq ‘ X 6(x°)[j^ a (x),jJ > (0)] 


+ (iqj 2 fd 4 xe 


4 xe tq ' X 6(x°)[j^(x),jJ > (0)] + O(q 0 


'), ( 1 ) 


j,V) = (9/9*V “(*). 


This connection has been extensively applied to the study of radiative corrections to hadronic (3 decay 2 
and to the derivation of asymptotic sum rules 3 and asymptotic cross-section relations 4 for high energy 
inelastic electron and neutrino scattering. In all of these applications, it is assumed that the equal¬ 
time commutators appearing on the right-hand side of Eq. (1) are the same as the “naive commutators 
obtained by straightforward use of canonical commutation relations and equations of motion. That this 
is a questionable assumption was pointed out by Johnson and Low, 5 who independently discovered Eq. 
(1). They studied this equation in a simple perturbation-theory model, in which the currents couple 
through a fermion triangle loop to a scalar (vector) meson. They found that in most cases the results 
obtained by explicit evaluation of the left-hand side of Eq. (1) differ from those calculated from naive 
commutators by well-defined extra terms. Because of special features of the triangle graph model, 
however, these extra terms do not directly invalidate the applications of Eq. (1) mentioned above. 

We report here the results of a more realistic perturbation theory calculation, which shows that for 
commutators of space components with space components, the Bjorken limit and the naive commuta¬ 
tor differ by terms which modify all of the principal applications of Eq. (1). We consider a simple, 
renormalizable model of strong interactions, consisting of an SU(3) triplet of spin-i particles # bound 
by the exchange of an SU(3)-singlet massive vector “gluon.” The vector current in this model is J^ a 
= $Yli* a $, and the naive equal-time commutator of two vector currents is 


6 (*° -y °)[ J* (*), A (y )] = 6 4 (*-y )?>(* )C ip {x ), 

C = * {xa ’ xb}(y n Y 0 7 v- Y i7o Y n ) +*^A V oVVoV- 


( 2 ) 


We wish to compare the Bjorken-limit commutator with the naive commutator, to second order in the 
gluon-fermion coupling constant^, in the special case in which Eqs. (1) and (2) are sandwiched be¬ 
tween fermion states. To do this, we calculate the renormalized current-fermion scattering ampli¬ 
tude Tilv (p ,/>' ,q) and compare it, in the limit as with the renormalized vertex f(C;p,p') of 

the naive commutator.® The scattering amplitude can be expressed in terms of the renormalized vec¬ 
tor vertex T{y^\p f p ( ) and the renormalized fermion propagator S(p) by 

T^“ b (P>P’><l) = ; i : '{y^,P,P+q)S{P+q)f{y v -,p+q,p')\ a \ b + f{y v -,p,p-q')S{p-q')f(y^,p-q' y p')K b \ a 

+B n“ b(p,p '’ q) ’ (3) 


with B^ u at> (p ,/>' ,q) the sum of the two box diagrams illustrated in Fig. 1. We find, by explicit calcu- 
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lation, 

lim f “ b (P,P',<l) = « , 0 ~ 1 fr ( C >P >P') + *] + °^q~ 2 ln ^ 0 )> 
q 0 -i°° 

q, p, p' fixed 


A = (//l * 


l/ 0 )y 0 U a > A 6 ] + i(y v y 0 y -y y 0 y„ 





We see that the Bjorken-limit commutator and the naive commutator differ by the term labeled A, 
which is well defined and finite. We note that A vanishes when 4 = 0 or v = 0, indicating that for the 
time-time and time-space commutators, the Bjorken limit and the naive commutator agree. This re¬ 
sult can be independently deduced from the usual on-shell Ward identity 

the consistency between Eq. (4) and Eq. (5) provides a convenient check on the calculation leading to 
Eq. (4). When one or both currents J^ a t jJ* is replaced by the corresponding axial-vector current 
J^ a = ^r^r^ a ^ 9 a formula like Eq. (4) holds, with the appropriate change in C and with A modi¬ 

fied as follows: 


J 


a 


4 




j b -j u 

v v 


a-a, 5 , 





r 56 A 

— tj «=*A — 

v 





One may wonder whether our definition of T(C;p,p f ) could be changed by a finite rescaling in such a 
way as to absorb the term A. However, since y^yqy u + y v y^y^ 4 -£ 41^0 a“ d since y^yqy u 

-YvYOYil the vertex V(C;p f p f ) is a linear combination of vector and axial-vector vertices. 

Therefore, the normalization of this vertex is completely fixed by the time-component current algebra 
and Lorentz covariance, and rescaling is not permitted. 

In addition to studying the q 0 ~* term in Eq. (1), we have also calculated the q 0 ~ 2 term in the special 
case considered by Callan and Gross . 4 Specializing to forward scattering (p =p r , a- b) and Spin aver¬ 
aging, we find 


lim lim mp 


-2 2 

0 q o 


itr 


[( : 


• P+m \^ aa,_ . ^ 

2 —) T ii {p ’ p ’ q) \ 


=-2(6 i } - py ')(\ a ) 


2 +^[ 2 (ln^ 0 2 + const )(6 i; -^V) +/> V](* a )' 



The presence of ln^ 0 8 on the right-hand side of Eq. (7) indicates that the expression of Eq. (1) cannot, 
strictly speaking, be carried out to order q 0 ~ 2 , and that the coefficient (p\&(p®)[jj a (?) f Jj a (Q)]\p) of the 
q 0 ~ 2 term is logarithmically divergent. 7 Using naive commutators to evaluate this coefficient, Callan 
and Gross concluded that the double limit on the left-hand side of Eq. (7) should be proportional to the 
transverse tensor 6^ -pp. The presence of the additional term (g2/6n2)pif>?(\ a )2 in Eq. (7) indicates 
that their conclusion fails in perturbation theory. 

We next indicate how the various applications of the Bjorken-limit technique are modified by our re¬ 
sults. 



FIG. 1. Box diagrams contributing to B The 
dashed line denotes the virtual gluon. 


FIG. 2. Diagrams for the radiative corrections to 
the vector transition. The wavy line denotes the vir¬ 
tual photon. 
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(i) Radiative corrections to <3 decay . 2 —We consider the vector transition between the fermions 
and tp 2 . We introduce a cutoff A 2 and calculate the divergent part of the radiative corrections to this 
process, described by the diagrams of Fig. 2. Using the time-component current algebra alone, it has 
been shown that the first three diagrams in Fig. 2 sum to a universal, structure-independent fraction¬ 
al change in the decay amplitude 6M/M = ( 3 a/ 87 r)lnA 2 . The divergent part of the fourth diagram in Fig. 
2 can be evaluated to orders 2 from Eqs. (4) and ( 6 ), giving 6M/M = (3a/Sn)2Q InA 2 ( 1 —3^/1 6 rr 2 ), with 

Q the average charge of the doublet ip x 2 . The term proportional to g 2 comes, of course, from the iso¬ 
spin-symmetric term in A. The total divergent part of the radiative correction is thus, to order g 2 , 

( 6 M/M) tQtal - ( 3 <*/ 87 r)lnA 2 [l + 2Q(l-3^ 2 /l67r 2 )]. ( 8 ) 

We see that the choice Q = which removes the divergence to lowest order ing 2 , still leaves a re¬ 
sidual divergence in second order. 

(ii) Asymptotic sum rules and cross-section relations .** 4 —We introduce the variable w = - q 2 /p • q and 
define the spectral functions W^oj,? 2 ) and W 2 (oj,? 2 ) by 


disc 1 
_ w . 

-2 m 4 r 


y +m 
2m 



(P,P,q) 


= \ a \ b [W ,q 2 ){-g Jq 2 ) 

2 2 2 
+ W 2 (w,q ){p^-p-qqjq ){P v ~p-qqJq )], 


a) >0. 



In terms of these spectral functions, the asymptotic formula of Eq. (7) may be rewritten as the sum 
rule 


lim (••• )(6 y -^V) + ^y= lim 2 ftwdw [mW^-p l f> j ) + ( m W l +q 2 m W 2 /J 2 )f> l p j }, 

r 2 — OO /7 2 — _ OO 


( 10 ) 


q~ - 


first obtained by Callan and Gross. As these authors note, the quantity mW 1 +q 2 mW 2 / oj 2 is positive 
definite, differing only by positive factors frqm -^ 2 ^(a),^ 2 ), with the longitudinal electroproduc¬ 
tion cross section. Thus the presence of in Eq. (7) implies that, in the quark model, ? 2 a£(a>,? 2 ) 
does not vanish asymptotically, in disagreement with the conclusion of Callan and Gross. In a similar 
fashion, the SU(3)-antisymmetric part of Eq. (4) leads to the asymptotic sum rule 

l-^= lim (11) 

q 2 — -oo 


Apart from the term g 2 / 8 rr 2 on the left-hand side, which comes from the SU(3)-anti symmetric part of 
A, Eq. (11) is the backward-neutrino-scattering asymptotic sum rule of Bjorken . 1 The modification in 
the left-hand side of Eq. ( 11 ) is closely related to the nonvanishing of q 2 orj^(w f q 2 ). To see this, we 
write down the usual fixed-? 2 , time-component algebra sum rule 8 

1 =2 J*dwq 2 mW 2 /w 2 ( 12 ) 

and subtract it from Eq. ( 11 ), giving 9 

-g z /&ir 2 = lim -2f o 2 doj(mW 1+ q 2 mW 2 /oj 2 ). (13) 

q 2 — — oo 

Thus the SU(3)-antisymmetric term in A and the p l fP term in Eq. (7) are basically the same phenome¬ 
non. As an additional check on our arithmetic, we have calculated and W 2 directly, giving mW y 
+ q 2 mW i /wi 2 =£- 2 4>/32it 2 , in agreement with Eqs. (10) and (13). 

We have also studied the scalar (pseudoscalar) gluon model in perturbation theory, and find effects 
similar to those reported here. Full details of the calculations, and further discussion, will be pub¬ 
lished elsewhere. 

We wish to thank W. A. Bardeen and S. B. Treiman for helpful discussions, and Dr. Carl Kaysen for 
the hospitality of the Institute for Advanced Study. After this work was completed, we learned that 
R. Jackiw and G. Preparata had also discovered the breakdown of the Callan-Gross result in perturba¬ 
tion theory. 
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Note added in proof . —(i) We have been informed by J. D. Bjorken of a related paper by A. I. Vain¬ 
shtein and B. L. Ioffe {Zh. Eksperim. i Teor. Fiz.—Pis’ma Redakt. £, 917 (1967) [translation: Soviet 
Phys.-JETP Letters £, 341 (1967)]}. We will discuss this work in our detailed paper, (ii) In the case 
when one current is an axial-vector current [the first two lines of Eq. (6)], we have omitted an SU(3)- 
singlet contribution to the Bjorken limit coming from the triangle diagram discussed by Johnson and 
Low. Addition of this piece does not alter any of our conclusions. 


*J. D. Bjorken, Phys. Rev. 148, 1467 (1966). 

2 For references, see G. Preparata and W. I. Weisberger, Phys. Rev. 175 , 1965 (1968). 

3 For a survey, see lectures by J. D. Bjorken, in Selected Topics in Particle Physics, Proceedings of the Inter¬ 
national School of Physics “Enrico Fermi,” Course XLI , edited by J. Steinberger (Academic Press, Inc., New 
York, 1968). 

4 C. G. Callan and D. J. Gross, Phys. Rev. Letters 22, 156 (1969). 

5 K. Johnson and F. E. Low, Progr. Theoret. Phys. (Kyoto) Suppl. Nos. 37-38, 74 (1966). 

®The renormalized vertex f (C;/>,/>') is obtained from the unrenormalized vertex r(C;£,£') by multiplying by the 
fermion wave-function renormalization constant Z 2 , with no further finite rescalings. 

7 The term ln? 0 2 is present when p 0 is finite and is not a result of the additional p 0 ~~ 00 limit. 

8 See S. L. Adler and R. F. Dashen, Current Algebras (W. A. Benj‘amin, Inc., New York, 1968), Chap. 4. 

9 This connection was first noted by F. J. Gilman, Phys. Rev. 167, 1365 (1968). 
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We present detailed calculations illustrating the breakdown of the Bjorken limit in perturbation theory, 
in the “gluon” model of strong interactions. To second order in the gluon-ferrmion coupling constant in 
the scalar, pseudoscalar, and vector coupling models, we calculate the Bjorken-limit commutator of a 
pair of currents of arbitrary (vector, axial-vector, scalar, pseudoscalar, tensor) type. To fourth order 
in the coupling, in the scalar- and pseudoscalar-gluon models, we determine the leading logarithmic be¬ 
havior of the »SCi r 8-antisymmetric part of the vector-vector commutator. In the body of the paper we present 
the main results and discuss their various features and implications. The computational details are relegated 
to two appendices. 


I. HISTORICAL INTRODUCTION 

E QUAL-TIME current commutators have come to 
play a central role in particle physics. In his 
famous papers of 1961 and 1964, Gell-Mann 1 proposed 
that the time components of the vector and axial-vector 
octet currents satisfy a simple SU 3 ®SU s algebra. The 
exploitation of this postulate by the “infinite-mo¬ 
mentum” and “low-energy theorem” methods has led 
to important predictions, which agree well with ex¬ 
periment. 2 The beauty of these “classical” current- 
algebra methods is that they depend only on the 
postulated commutation relations together with such 
weak dynamical assumptions as pion-pole dominance 
and unsubtracted dispersion relations. They are inde¬ 
pendent of more detailed (and therefore, more dubious) 
dynamical assumptions. The experimental successes 
thus provide a strong argument that any future theory 
of the hadrons must incorporate the time- 

component current algebra. 

This requirement, of course, does not uniquely 
specify a model of the hadrons—there are many 
possible field-theoretic models which satisfy the Gell- 
Mann hypothesis. In an attempt to narrow the selec¬ 
tion, attention has been turned recently to the study of 
the space-component-space-component commutators, 
which can be used to distinguish between models which 
have the same time-component algebra. The problem 
of finding experimental tests of the space-space algebra 
is made difficult by the fact that the “classical” current- 
algebra methods of infinite-momentum limits and low- 
energy theorems cannot be made to apply in this case. 
However, in 1966 Bjorken 3 pointed out that the 
asymptotic behavior of a time-ordered product of two 
currents is simply related to the equal-time commu- 


(1964) Gell " Mann, Phys ‘ Rev * l2S > 1067 K 1962 ) jFhysics^l,^ 

2 For a survey, see S. L. Adler and R. F. Dashen, Current 
Algebras (Benjamin, New York, 1968). 

3 J. D. Bjorken, Phys. Rev. 148, 1467 (1966). 


tator of the currents, 

lim f dAxe** X T (J a) {x) / (2 ) (0)) 

go->t*oo;q fixed * 

= t?,f 1 / *** e iq z t>(*0 Uvi*), Jm (0)]+0 (qo~ r ) . (1) 

Equation (1) has been extensively applied to the study 
of space-space current commutators, leading to a new- 
class of asymptotic sum rules. 4 These sum rules have 
testable experimental consequences in inelastic electron 
and neutrino scattering reactions and important impli¬ 
cations in the theory of radiative corrections to hadronic 
j£? decay. 

In all of the applications of Eq. (1), an important 
assumption is made: It is assumed that the equal-time 
commutator appearing on the right-hand side of Eq. 
(1) is the same as the “naive commutator” obtained by 
straightforward use of canonical commutation relations 
and equations of motion. That this is a questionable 
assumption was pointed out by Johnson and Low, 6 who 
independently discovered Eq. (l). They studied this 
equation in a simple perturbation-theory model, in 
which the currents couple through a fermion triangle 
loop to a scalar, pseudoscalar, or vector meson. They 
found that in most cases the results obtained by 
explicit evaluation of the left-hand side of Eq. (1) 
differ from those calculated from naive commutators by 
well-defined extra terms. Because of special features of 
the triangle graph model, however, these extra terms 
did not directly invalidate the applications of Eq. (1) 
mentioned above. 

Recently, we have reported a more realistic perturba- 


4 For a survey, see lectures by J. D. Bjorken, in Selected Topics 
in Particle Physics , Proceedings of the International School of 
Physics u Enrico FermiCourse XU t edited by J. Steinberger 
(Academic, New York, 1968). 

6 K. Johnson and F. E. Low, Progr. Theoret. Phys. (Kyoto) 
Suppl. Nos. 37-38, 74(1966). Important early work on the validity 
of the Bjorken limit, in the context of the Lee model, has also 
been done by J. S. Bell, Nuovo Cimento 47A, 616 (1967). 
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tion-theory calculation , 6 which showed that for com¬ 
mutators of space components with space components, 
the Bjorken limit and the naive commutator do differ 
by terms which modify all of the principal applications 
of Eq. (1), In other words, asymptotic sum rules derived 
from the naive space-space commutators fail in perturba¬ 
tion theory . One is, of course, still free to postulate that 
nonperturbative effects conspire to make the asymp¬ 
totic sum rules valid when all orders of perturbation 
theory are summed, but the need for this assumption 
means that asymptotic sum rules do not just give a test 
of the space-space algebra, but involve deep dynamical 
considerations as well. 

In our previous work, we considered only vector and 
axial-vector current commutators in the quark model 
with a massive vector "gluon,” to second order in the 
gluon-fermion coupling constant g r . 7 In the present 
paper, we extend our results to arbitrary (vector, 
axial-vector, scalar, pseudoscalar, tensor) currents in 
the quark models with vector, scalar-, or pseudoscalar- 
coupled gluon. Working to second order in g T , we obtain 
results analogous to those found previously in the more 
restricted case. In addition, for the vector-vector 
commutator in the scalar- and pseudoscalar-gluon 
models, we obtain the leading logarithmic part of the 
gf term. In Sec. II we summarize our results and in 
Sec. Ill we discuss b riefly their significance. To facilitate 
reading, all computational details are relegated to 
Appendices. 

n. RESULTS 

We consider a simple, renormalizable model of the 
strong interactions, consisting of an SU$ triplet of 
spin-! particles p bound by the exchange of an SU 3 - 
singlet massive “gluon.” We assume that the gluon 
couples to the fermions by either scalar, pseudoscalar, 
or vector coupling. In order to treat simultaneously 
commutators involving vector (axial-vector, scalar, 
...) currents, we introduce the abbreviated notation 

1) = /<2) = $7(2)^, 

7(1) * 7/A°(7>.75X 0 , X°, ...), (2) 

7(2) = 7*X & (7i7sX 6 , X 6 ,...), 

according to whether the first or second current is a 
vector (axial-vector, scalar, ...) current. The naive 
equal-time commutator of the two currents is 

5(ir°—/)[/(!>(*),/ (2 ) {y)~\=&(x-y)${x)Cyp{x ), 

7oC7o7(i)> 7o7(2)] = 7(1)707(2)—7(2)7o7(i)- (3) 

We wish to compare the Bjorken-limit commutator with 
the naive commutator, in the special case in which 

6 S. L. Adler and W.-K. Tung, Phys. Rev. Letters 22, 978 
(1969). See also R. Jackiw and G. Preparata, ibid, 22, 975 
(1969), who have independently arrived at similar conclusions. 

7 In Ref. 6 we denoted the coupling constant g r by g. In the 
present work, g will always indicate a gluon (or its four- 
momentum). 



Fig. 1. (a) Lowest-order current-fermion scattering diagrams, 
(b) Diagrams obtained from the lowest-order ones by insertion 
of a angle virtual gluon. 

Eqs. (1) and (3) are sandwiched between fermion 
states. To do this, we calculate the renormalized cur¬ 
rent-fermion scattering amplitude ?(i)( 2 >* (P, P'y ?) 
in the limit qa—>i&>, and compare the coefficient of the 
qf l term with the renormalized vertex F(C; p , p r ) of 
the naive commutator. The asterisk on IT(i)( 2 )* in¬ 
dicates that it is the full covariant scattering amplitude, 
which differs from the renormalized T product, 
?<i)(i)(P, P', q), by a “seagull” term £ ( i)<*)(#, p', q) 
which is a polynomial in qo , 

^(i)<2 )*(A p\ ?) == 2t(D(2)(#, P\ q)+?(D(2 )(P, p\ q )« (4) 

Identity of the Bjorken limit and naive commutators 
would mean that 

lim Tmt)(p,p',q) = q<r 1 T‘{C;p,p') 

«-»■»«*;q.p.p' fixed 

+O(q 0 -‘ lnqo). (5) 

In the calculation which follows, we test the validity of 
Eq. (5) in perturbation theory. 8 

A. Second Order 

To second order in the gluon-fermion coupling 
constant g r , there are two classes of diagrams which 
contribute to f( i)( 2 )*. The diagrams of the first class, 
illustrated in Fig. 1, consist of the lowest-order current- 

8 A general discussion of the mechanism responsible for Bjorken- 
limit breakdown has been given by W.-K. Tung, Phys. Rev. 
188, 2404 (1969). See also R. Jackiw and G. Preparata, ibid. 185, 
1929 (1969). 
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Fig. 2. Diagrams containing fermion triangles. 


fermion diagrams and the second-order diagrams ob¬ 
tained from the lowest-order ones by insertion of a 
single virtual gluon. The diagrams of the second class, 
illustrated in Fig. 2, involve a fermion triangle diagram. 
We denote the contributions of these two classes to 
?(D( 2 )* by ?(i)( 2 )* Compt and f(i ) ( 2 ) * trlang , respectively. 

The first-class diagrams are evaluated by the stand¬ 
ard technique of regulating the gluon propagator with 
a regulator of mass X, which defines an unrenormalized 
amplitude ^(l)( 2 )* C ° mpt * To get the renormalized ampli¬ 
tude, one multiplies by the external fermion wave- 
function renormalization constant Z 2 and takes the 
limit \—> co | 

fiM)* Compt = limZ 2 r ( i) (J )*c° m P‘. (6) 

X-*-co 

In certain cases, as discussed below, this limit diverges 
logarithmically; in these cases, we take X to be finite 
but very large, dropping terms which vanish as X—>co 
but retaining all terms which are proportional to lnX 2 . 
The renormalized vertex 

T(C;p,p') = ]imZ i r(C;p,p') 

oo 

is calculated by the same techniques from the diagram 
of Fig. 3. Finally, we take the limit in our 

expression for 2 , (i)( 2 )* Compt and compare with f(C; p,p '), 
giving the results 

<r(i)(i) Coinpt (P, P\ q) = 0, (7a) 

lim f(i) ( j)* Compt (A p\ q) 

go-»3°o;q, p,p f fixed 

= lim fa)<J) Compt (^', ?) 

go-*|oo;q,p,p' fixed 

= tff+fXC; p, p r ) +A Com P t ]+0(^o' 2 lngo), (7b) 
^Compt = (g r 2 /327 r 2 ) {ln(X 2 /| q 0 | 2 )[- 7 ( 1 ) 707707707 ( 2 ) 
+2'YTr7(i)7o7(2)7 T Y 

— ^7(i)7o‘Y7r7(2)7 T Y- \ Y7r7(i)7 T ‘Y7o7C2)] 

— §7(1)7077o77o7(2) — \ 77o7(i)7o7(2)7oY 
+7(i)7o Y7o7(2)7o Y+Y7o7(i)7oY7o7(2) 

— 47(i)7oY7r7(2)7 T Y- i'Y7r7(i)7 T 'Y7o7(2) 
+iY[7T7(i)7 T 7(2)7o+7o7(i)7r7(2)7 T ]Y 

-( 1 )^( 2 )}. (7c) 
In Eq. (7), the notation y***Y is a shorthand for 


1 * • • 1 in the scalar-gluon case, £y 6 * * *^ 7 s in the pseudo¬ 
scalar-gluon case, and (— 7 p)’** 7 p in the vector- 
gluon case. Some details of the calculation leading to 
Eq. (7) are given in Appendix A. 

The second class diagrams (Fig. 2) have been cal¬ 
culated by Johnson and Low . 6 In our model, which has 
only SC/g-singlet gluons, these diagrams contribute 
only to the SZ7*-singlet part of the commutator. Taking 
the Bjorken limit, and comparing with the bubble 
diagram contributions to F(C; p t p r ) illustrated in 
Fig. 4, Johnson and Low find 

lim T iini) *™*»*(p,p',q) 

go-^&a.'q.P.p' /fxed 

= *a)»““•(£, P', ?)+ lim T mw ^(p, p’, q] 

flO-»*®o 

= ?<1 )(J) triang (i>, P\ ?)+?0- 1 [r(C; p, p^) bubble_J_^triangj 

+O( ? 0 - J ln?o). ( 8 ) 

We will not exhibit the detailed form of A triaug , but only 
remark that in all cases A 4 "* 11 ® vanishes when the three- 
momenta q and q'=q+p—p' associated with the 
currents i) and 7 ( 2 ) vanish, 

A trian *| q « q ^=0. (9) 

[Equation (9) is true when the triplet of fermions^ are 
degenerate in mass. Johnson and Low 6 also discuss the 
effect of mass splittings,] Thus, for the physically 
interesting case of the commutator of spatially inte¬ 
grated currents, the entire answer is given by Eq. (7). 
No cancellation between the SZ7j-singlet part of 
^compt an( j ^triang ; s possible, and we conclude that the 
Bjorken limit and the naive commutator in our models 
differ in second-order perturbation theory. 

To make contact with our previous work and with 
our fourth-order results, it is useful to write out two 
special cases of Eq. (7). We consider the commutator 
of two vector currents, taking 7 (i)= 7 M X°, 7 < 2 )= 7 ik ft . In 
the vector-gluon case we find 

A c omP t = (gi/l&s 3 ) { 2(g fi r gpogvo) 7oC^°» VO 

+1 (7»7o7p 7p7o7*) {X®, V] }, (10) 

in agreement with the result which we have reported in 
Ref. 6 . In the scalar- and pseudoscalar-gluon cases we 
find 

A Compt = (^2/16+) { ( guv— gfiOgvo) 7 o[X°, X b ] 

- § ( 7 » 7 o 7 p— 7 p7o7k) {X°, X 6 } [In (X :a /| q 0 | 2 ) — 1 ]}. (11) 

B. Fourth Order 

To fourth order in g r , the number of diagrams con¬ 
tributing to T(i)( 2 )* is so large that a direct calculation 



Fig. 3. Second-order correction to the 
vertex of the naive commutator C. 
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of the Bjorken limit, in analogy with our treatment of 
the second-order case, is prohibitively complicated. 
However, unitarity implies that the part of A pro¬ 
portional to £X°, X b ], and independent of the three- 
momenta q, q', p, and p' and of the fermion mass w, 
is related to an integral over the longitudinal current- 
fermion inelastic total cross section, 6 - 9 Applying this 
connection to the commutator of vector currents in the 
scalar- and pseudoscalar-gluon cases, we have cal¬ 
culated the leading logarithmic contribution to the 
[\°, X b ] term in fourth order, with the result 

Xln(| qo | 2 )+ gf Xconst] + (terms symmetric in a, b) 

+ (terms proportional to q, q', p, p', and m). (12) 

Details of the unitarity relation and of the total cross- 
section calculation are outlined in Appendix B. 


Fig. 4. Self-energy diagram which 
makes a second-order correction to the 
Ai/a-singlet part of C. 


Using the fact that finite quantities on the left- and 
right-hand sides of Eq, (7b) must match up, we see 
that 

^Compt— [Aj+ A(C) - A( 7 d)) — A( 7 ( j>) ]C+finite, (16) 

confirming that the lnX 2 dependence in A Compt results 
from a mismatch between the multiplicative renormali¬ 
zation factors on the left- and right-hand sides of Eq. 
(7b). To check Eq. (16) directly, we note from Eq. 
(A10) that 



in. DISCUSSION 

We proceed next to discuss a number of features of 
our results of Eqs. (7) and (10)-(12). 

1. We begin by noting that to second order in g r 2 , 
ACompt contains terms ln(X 2 /| qo | 2 ) which diverge 
logarithmically both in the Bjorken limit and 

in the infinite-cutoff limit X—»oo, It is easy to see that 
the lnX 2 divergences result from a mismatch be¬ 
tween the multiplicative factors needed to make 
7'(i)(»* Compt (p, p‘, q ) and r(C; p, p') finite (i.e., 
lnX 2 independent). As we recall, the renormalized 
quantities ?\i)( 2 )* Compt (A p' t q) and I s (C; p, p f ) are 
obtained from p', q) and r(C; p, p') by 

multiplying by the wave-function renormalization Z 2 and 
taking the limit keeping any residual lnX 2 

dependence. On the other hand, the finite quantities 
T(i)( 2 )* Compt (/>, p' r g) finite and T(C; p, p') fiaiUs are ob¬ 
tained by multiplying by appropriate vertex and 
propagator renormalization factors which completely 
remove the lnX 2 dependence, 

r (C;p, p r ) finite =^(C)r(C; p, p r ), 

r (1)(2) *compt (?) ? )fmi te=z(7(i))z(7(2))z -i ( 13) 

xr (1)(2) *com P t( ?) p* y 

In general, the vertex renormalizations Z(C), Z( 7 (i)), 
and Z( 7 ( 2 )) are not equal to each other or to Z 2 . If we 
write 

Z(C) = 1+A(0, 

Z 2 =l+A 2) (14) 

then we find, to second order, that 
f ( i)( 2 )* Compt (£, p\ q) = Twf^Kp, P\ * 

+ £2A 2 — A (7(d) — A(7( 2 j)] (15) 

r 1 1 i 

X 7(1) 7T-7(2) +7(2) —7 -, 7(D , 

L yp+yq yp-yq J 

f (C; p, p r ) = r(C; p , p ') flnite +fA 2 — A(C)]C. 

9 F. J. Gilman, Phys. Rev. 167, 1365 (1968). 


A(C)C— (gj/ 327r 2 ) iyy T Cy T y InX 2 , 

A 2 7o= (&- 2 /3 2tt 2 ) I V7t7o 7 t ‘Y h*X 2 , (17) 

which allows us to rewrite the square bracket in Eq. 
(16) in the form 


(gr 2 /32tt 2 ) lnX 2 {7 ( i)IV7r7o7 T V7(2) —7(2)iV7 r 7o7 T V7ci) 

+ 2Y7t[7(1)7o7(2) — 7(2)7o 7(1)]7 t Y 

- 2Yy , r7(i)7 T 77o7(2)+7(2)7oi77r7(i)7 T 7 

- 7(i)7oi77r7(2)7 T 7+iT7r7(2)7 T 77o7(i) }, (18) 


with the four lines coming from A 2 , A(C), — A( 7 ( d), 
and — A ( 7 ( 2 )), respectively. A little algebra then shows 
that Eq. (18) is indeed identical with the lnX 2 part of 
Eq. (7c). 

The presence of terms which diverge as In I go I 2 in 
Eq. (7c) indicates that, in the general case , the Bjorken 
limit does not exist. The fact that the In | qo | 2 and 
lnX 2 terms occur in the combination ln(X 2 /|go| 2 ) 
means that, to second order , the existence of the Bjorken 
limit is directly connected with the matching of renormali¬ 
zation factors on the left- and right-hand sides of Eq. 
(7b): When the renormalization factors match, the 
Bjorken limit exists; when the factors do not match, the 
Bjorken limit diverges. 10 Unfortunately, we shall see 
that this simple connection does not hold in higher 
orders of perturbation theory. 

To interpret the divergence of the Bjorken limit, we 
note that the renormalized T product can be written as s 


00 qo— qo 



10 This was first noted by A. I. Vainshtein and B. L. Ioffe, 
Zh. Eksperim. i Teor. Fiz. Pis’ma v Redaktsiyu 6, 917 (1967) 
[Soviet Phys. JETP Letters 6, 341 (1967)]. These authors con- 
ectured that when the renormalization factors match, the 
ijorken limit and naive commutator agree. Our calculations show 
that this conjecture is invalid. 
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Table I. Cases involving vector (V) and axial-vector {A) octet 
currents with finite Bjorken limit in second order. 


Model 

Current 

-fy) 

Current 

7(2) 

Piece of 
current $C\f/ 

Vector gluon 

V or A 

V or A 

V or ,4 

Scalar or pseudo- 

V 

V 

V 

scalar gluon 

V 

A 

A 


A 

V 

A 


where the spectral function p is defined by 

u(p)p(p, p', q,?o)«(/0 

= (2t) 3 E {p\Jm\ N)(N | J m | p'Mq+p-N ) 

-(2t) 3 E (?!•/(« I iWI I m-H-#-?'). 

N 

( 20 ) 

Provided that the spectral function does not oscillate 
an infinite number of times 11 (and it cannot have this 
kind of pathological behavior in perturbation theory), 
when the Bjorken limit of p\ q ) exists it is 

equal to the integral 

u(p) f dqo' p(p, p', q, qo')u(p') 

— — 00 

= (2t) 3 E (P I Jm | N)(N | /<„ | p')5 3 (q+p-N) 

N 

- (2 t) 3 E (? I Jm I iVXiV | J m \ p')8*( q+N-p'), 

N 

(21) 

which is just the usual sum-over-intermediate-states 
definition of the commutator. Conversely, in the cases 
in which the Bjorken limit diverges logarithmically, 
the integral and sum in Eq. (21) must diverge logarith¬ 
mically, 

2. There are a number of interesting cases in which 
the renormalization factors do match, and hence the 
Bjorken limit exists in second order. We have enumer¬ 
ated in Table I all examples of this type in which all of 
the currents involved, 7<i), 7(2), and \fC\p, are either 
vector or axial-vector octet currents. Specific formulas 
for A 00 ™ 1 * 1 in the case when 7<i) and 7(2) are vector 
currents were given in Eqs. (10) and (11) above, 
(To obtain the corresponding formulas when 7(i) 
and/or 7( 2 ) are axial-vector currents, in the vector- 
gluon case, one simply multiplies from the left or right 
by 75 according to the scheme shown in Table II.) 

The remarkable result that emerges from these 
examples is that, even when the Bjorken limit exists in 
second order, it does not agree with the naive commutator 
(that is, A 00 ™ 111 is finite but nonzero), According to our 

11 This pathological case is discussed in detail by R. Brandt and 
J. Sucher, Phys. Rev. Letters 20, 1131 (1968). 


previous discussion, this means that the Bjorken limit 
agrees with the spectral function integral of Eq. (21), 
but the naive commutator does not. Most of the prin¬ 
cipal applications of the Bjorken limit technique for 
space-component-space-component commutators as¬ 
sume the identity of the Bjorken limit and naive com¬ 
mutator, and therefore, according to our results, break 
down in perturbation theory. Further details of this 
breakdown are given in Ref. 6. 

3. From an inspection of Eq. (10) and Table II, 
we see that in the vector-gluon case, for all commu¬ 
tators involving vector and axial-vector currents, 
A Compt vanishes when either ji= 0 or 0. This result can 
be deduced directly from the Ward identity 10 satisfied 
by ?(i)( 2 j* Compt (p, p\ q) y which, in the case when 
/a) and 7(2) are both vector currents, states that 

Tmm*°° mv Kp, P\ q ) Iraw'VVro-r.** 

= rflV,X»] 7 ,; p,p'). (22) 

Multiplying by £ 0 “ 1 and taking the limit £<j—hoo gives 
immediately 

lim f(i)< S )* Comp *(/», P', q) Ito)=toX 0 .t( 2 )-t^ 4 

£Q-»{eo 

= ? 0 _1 f ([X°, X 6 ]7.; p, p') +0(?(f 2 ln?o), (23) 

confirming our explicit calculation, A similar derivation 
holds in the cases involving axial-vector currents, pro¬ 
vided that the divergence of the axial-vector current 
is “soft,” 18 as it is in the vector-gluon case. We thus see 
that the breakdown of the Bjorken limit which we 
have found is consistent with the constraints imposed 
by Ward identities. Therefore all of the results of the 
Gell-Mann time-component algebra, which are derived 
directly from the Ward identities, remain valid, 13 

4. We turn next to the order g r 4 result of Eq. (12), 
which gives the VV—>V commutator in the scalar- and 
pseudoscalar-gluon models (the second line in Table 
I), We see that even though the renormalization factors 
match , the Bjorken limit in this case diverges in fourth 


Table II. 

Substitutions to get axial-vector current results in 
the vector-gluon case. 

Current 

Current 


J a) 

7 (2) 

Change in Eq. (10) 

V 

V 

none 

A 

V 

^Compt—>_y £ /\Compt 

V 

A 

^Compt—>^C°mpty g 

A 

A 

^Compt—> .yg^Coiiiptyjj 


12 See Ref. 2, pp. 257-260, for a discussion of soft divergences. 
w There is one exception to this statement, which arises when 
Ward identity anomalies are present. See S. L. Adler, Phys. Rev. 
177, 2426 (1969); J. S. Bell and R. Jackiw, Nuovo Cimento 60, 
47 (1969); R. Jackiw and K. Johnson, Phys. Rev. 182, 1459 
(1969); S. L. Adler and W. A. Bardeen, ibid, 182, 1517 (1969). 
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order . We note, however, that the divergence behaves 
asg/ln) go | 2 , whereas in fourth order, terms behaving 
like g r 4 (ln | qo | 2 ) 2 could in principle be present. On the 
basis of this behavior and our second-order results, we 
make the following conjecture: When the renormalization 
factors needed to make 7'(d(2)*(?, p\ q) and T(C; p, p') 
finite are the same , the Bjorken limit in order 2 n of per¬ 
turbation theory contains no terms g r 2n (ln \ qo ( 2 ) n , but 
begins in general with terms g r 2n (ln | q 0 1 2 ) K 

We have only calculated results for the scalar- and 
pseudoscalar-gluon models because these models have 
the simple property that, when the unitarity method of 
Appendix B is used, each individual intermediate state 
makes a contribution behaving at worst as g T 4 In I q 0 I 2 . 
The situation in the vector-gluon model is more compli¬ 
cated, since here the individual intermediate states 
contain terms behaving as g,. 4 (ln | qo | 2 ) 2 , as well as 
terms g,. 4 ln|^o| 2 . If our conjecture is correct, the 
g,. 4 (ln | q 0 1 2 ) 2 terms from the various intermediate 
states in the vector-gluon case must add up to zero. 
We have not checked whether this happens; it would 
clearly be worth doing. 

5. As mentioned in Sec. I, one can try to save 
asymptotic sum rules by postulating that nonpertur- 
bative effects conspire to make asymptotic sum rules 
valid when all orders of perturbation theory are 
summed. A simple way that this could happen would 
be if our order g 2 terms in A Compt were the lowest-order 
terms in an expression 

A ex pf— Bg r 2 In | q 0 1 2 ], U>0 (24) 

which damps to zero as ^(j—H 00 . However, examination 
of our fourth-order result in Eq. (12) shows that 
exponentiation gives 



Xe x p ( 7 ^ln| ?0 |2 )+O(g, 6) ) (25) 

which blows up exponentially rather than damping. 
In other words, the simple damping mechanism of Eq, 
(24) cannot be correct, although our fourth-order 
calculation obviously cannot rule out more complicated 
damping mechanisms, 

6. In Eqs. (A 14) and (A15) of Appendix A, we 
indicate that when the Bjorken limit is taken 

before letting the regulator mass X go to infinity, one 
obtains just the naive commutator. Thus, it is tempting 
to try to “save” asymptotic sum rules by prescribing 
that, instead of using renormalized perturbation theory 
(limit o taken first), one should always work with 
the unrenormalized quantities, with X very large but 
finite. 14 We will now argue, however, that this is a 
spurious resolution of the difficulty. Let us consider the 


sum rule, derived in Appendix B, connecting the 
[X°, X>] term in Eq, (11) with an integral over the 
longitudinal current-nucleon cross section Lr (q 2 , to), 
with to= — q 2 /p*q* In the renormalized (X—»a>) theory, 
where there is Bjorken-limit breakdown, we find to 
second order that 


lim f(i)m* Co “ pt =?(f'i[>, X 6 ] 

X [/Wo7„+7/yo7M+ 2 {guv- gpagvohof] 


+ (term symmetric in a , b) +O($o 2 ln^ 0 ), (26) 



lim 2 



dwLn+g 



where the subscript »+g is a reminder that to second 
order we need only retain the single neutron plus 
gluon intermediate-state contribution in calculating Lr. 
Our explicit calculation shows that 


/=g, 2 /167T 2 , 


lim Ln+g- (q 2 , a>) = {g 2 / 64x 2 )co, (28) 


in agreement with Eq. (27). As we noted in Ref. 6, 
Eq. (27) indicates that the breakdown of the Bjorken 
limit in Eq. (26) is essentially the same phenomenon 
as the breakdown of the Callan- Gross relation, 16 which 
states that 


limW((f| W ) = 0. (29) 


Let us now consider the analogs of Eqs. (26)-(29) in 
the regulated (X-finite) theory. Since, in order gr 2 , 
matrix elements are always linear in the gluon propa¬ 
gator, to obtain the regulated matrix element in order 
g r 2 we simply subtract from the renormalized matrix 
element the corresponding expression with the gluon 
mass ju 2 replaced by the regulator mass X 2 . Since / in 
Eq. (28) is independent of ju 2 , we find that Eqs, (26)- 
(28) become 


lim r ( i)( 2 )* CoInpt =^o [X°, X b ](7/yo7»+7/yo7n) 




+ (term symmetric in a, b) +0{qo 2 ln?o), (30) 

0= lim 2 f da LtoC (^, to); 
a 2 -*- 00 l o 

lim Lt 0 r (q 2 , w) = 0, (31) 

(A*— 

with 

-Ltot {q 2 j to) L n + g (^j co) Lj^g (^ 2 , co) |n 2 .*x 2 ‘ (32) 


As expected, in the regulated theory the Bjorken limit 
is normal and the Callan-Gross relation is satisfied. 
However, a disturbing problem arises when we examine 
in detail exactly how the Bjorken limit is satisfied. Let 
us suppose that the regulator mass X is much larger than 


u This point of view has been advocated by C. R. Hagen, 
Phys. Rev. 188, 2416 (1969). 


18 C. G. Callan and D. J. Gross, Phys. Rev. Letters 22, 156 
(1969). 
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the fermion mass m and the gluon mass n, 

X 2 »< X 2 »/x 2 J (33) 

and let us consider, for fixed co, two ranges of values of 
“9 s * 

range 1: ju 2 , 1), 

range 2: ^/(2o J ' 1 -l)<- ? 2 , J=(X+») S -» S . 

(34) 

The dividing point between the two ranges is just the 
threshold for regulator particle production. For 
— we have (g+p) 2 < (X+m) 2 , and 

regulator particle production is forbidden. Thus, in 
range 1, the second term in Eq. (32) vanishes, 

L*Ux 2 =0, (35) 

while the first term has its asymptotic value 

Ln+g (q 2 , w)^(g r 2 /64?r 2 )a>, (36) 

and the Callan-Gross limit is not satisfied. In range 2, 
we have (q+p) 2 > (X+m) 2 , and regulator production 
is allowed; for — 5 3 ^$>X 2 /( 2 ar 1 —1), the second term in 
Eq. (32) attains the same asymptotic value as Eq. 
(36), and the Callan-Gross limit is satisfied. Thus, we 
see that in the regulator theory , the Callan-Gross limit is 
satisfied only in a region in which — q 2 is big on a scale 
determined by X 2 , and then only by virtue of the un¬ 
physical, negative contribution of regulator production to 
the total longitudinal cross section . We conclude that the 
regulator theory does not afford a satisfactory resolu¬ 
tion of the breakdown of the Bjorken limit in perturba¬ 
tion theory. 
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APPENDIX A: CALCULATION OF A 00 ”* 1 

In this appendix we outline the calculation leading 
to Eq, (7) in the text. We recall that T(D( 2 )* Compt is 
defined as the contribution to the current-fermion 
scattering amplitude of the diagrams shown in Fig. 1, 
consisting of the lowest-order current-fermion diagrams 
and the second-order diagrams obtained from the 
lowest-order ones by insertion of a single virtual gluon. 
We may write 

f( lH 2)* Compt (p , p', q) = T(y {1) ; p , p+q) §(p+q) 

xf(7( 2) ; p+q, p') + ?( 7(2); p, £-?')£(£-?') 

XT( 7 (d; p—q', p')+B(i)(2)(p, p r , q), (Al) 


where S and T are the renormalized propagator and 
vertex functions and where U ( d (2) denotes the sum of the 
two box diagrams on the fifth line of Fig. 1. We shall 
calculate in the limit and isolate the 

coefficient of the qf x term. This is to be compared with 
the matrix element of the naive commutator between 
fermion states, given by T(C; p , p '). 

The renormalized vertex function T for the current 
$7(i)^ is given, to second order in g, by 

f (7 <d; P, p') = z 2^(y(D)p, £')=-Z27(i)+A(7(n; p, p '), 

(A2) 

with Z 2 the fermion wave-function renormalization and 
with A( 7 (d; p , p') the usual unrenormalized second- 
order vertex part (arising from diagrams on the second 
and fourth lines of Fig. 1). Note that T is obtained by 
multiplying the unrenormalized vertex function by the 
wave-function renormalization, with no further sub¬ 
tractions or rescaling. The renormalized propagator is 
given, to second order in g r , by the usual expression 16 

Sipy^ZtSipy^Ztiy-p-mo)-^), (A3) 

with 2(/>) the unrenormalized proper fermion self- 
energy part (arising from the diagrams on the third line 
of Fig. 1) and with m 0 =m+8m the fermion bare mass. 
Denoting the lowest-order current-fermion amplitude 
by T(i)( 2 ) Born , we see that the first two lines on the 
right-hand side of Eq (Al) may be rewritten as 

ZfT (i)( 2) Born +CA(7(i); p , p+q) ( 7 * £+7"?-^)~ 1 7(2) 

+y(i)(yp+yq— m)~ 1 ($m+'Z(p+q)) 

X(yp+yq - m)'^) +7(d ( 7 * £+7*?- «0' ! 

XA( 7 ( 2) ; p+q, p') + (( l)<->(2), q*->— ?')]. (A4) 

According to Eq, (A2), the matrix element of the naive 
commutator is 

Z 2 C+A(C; p, p r ) . (A5) 

It is easy to see that, as the qf 1 term of 

ZfT{ i)( 2 ) Born is precisely Z 2 C, Our task is therefore re¬ 
duced to comparing the qo' 1 term of 

CA(7(i)j p, p+q) {yp+yq—m)~ l y( 2 )+- * * 

+ ((l)* -> (2), q<~*— q')l+B(D( 2 )(p, p r , q) (A6) 
with A(C; pj p'). 

The unrenormalized self-energy and vertex parts 2 
and A are calculated by the usual technique of in¬ 
troducing a meson regulator of mass X, giving 


s( , )= ZJ>L f 1 dx y( VY .p +m ) in + (1 -^ ) . ( - ^ . + " 2)+ ^ +(1 -^ )V l 
{P) lfrr* J 0 y{ ^ P+} Y U(l-^)(-? 2 +m 2 )+^+(l-^)wJ 


(A7) 


16 In this equation, S(p) denotes the unrenormalized propagator in the presence of all interactions. 
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and 


My (d; p, P') = 


~ gr 2 ^ 

16tt 2 


f* f X dy 
J n J n 


2YTr7(D7 


-vln P Xa ~ C( ^ P ’’ X,y) 1 
Lz;i J -C(/>, p',z, y)J 


l_c \p, p i y) zfi 


1 


C(P, p', x, y)-s\ 2 JJ 


(A8) 


where 


z— 1— x— y , 

C(p, p',x, y) =x(l-x) p 2 +y(l- y) p' 2 -2xyp- p'- (x+y)mK 


(A9) 


In order to obtain the renormalized propagator and vertex from Eqs. (A2) and (A3), we must calculate the 
X >oo limit of the unrenormalized self-energy and vertex parts, dropping all terms which vanish in this limit but 
retaining powers of lnX. fNote that the \-^oo limits of 2 and A are not the same as the renormalized self-energy and 
vertex parts 2 and A, which are defined, in the X-—>a> limit, by Z 2 ( 7 ‘ p— mo) — 2(£) — yp—m— 2(£), ZfYo.) 
+A( 7 (d; p, p f ) — 7 d)-|-A( 7 (dj p , p r ).~] Taking the X —>00 limit in Eqs. (A7)-(A9), we find 


-g 2 


limS(p) = 

\-*ao 107T 




xK 2 


p 2 +m 2 ) +xp*+(l—x)hn i 


\ > 


— gr 2 z* 1 ri—* n sX 2 1 

limA( 7a) ; p, p') = — r - dx dy • lY7r7(i)7 r Y m t A , -r 

x-*- 16 t r 2 / 0 J 0 { lzti*-C(p, p', x,y)j 

+ vL(l—x)y m p—yyp f +™]ya)[ m (l—y)yp , —xyp+m]y 

Finally, taking infinite-momentum limits of these expressions, we find 


1 


c(p, p', *, y)-zA* ! 


(A 10) 


lim limS(p) = (- g r yi6ir J ) Y 7 oY?o [5 ln(X*/| pa | J ) +|]+0(ln^o), 

J>0-**oo X-*oo 

lim limA(7 (d; p, p’) = (g r yi6ir J ) {-|Y7r7(i)7 r Y[Mxy| pa | J ) +l]+lY7o7(i)7oY+0(lnpo/?o)}, 

PQ-MW X~*00 

lim limA( 7 (i )5 p, p') = (g r yi6ir J ) {— i Y7r7a)7 r YD n (^VI Po' I 2 ) + 5 ]+? Y7o7(i)7oY+0(l n Po'/po')}. 

PQ f ~* too X-*eo 


(All) 


The box diagram is convergent even without regu¬ 
larization. In the regulator theory, U ( i) (2 ) is the dif¬ 
ference of two terms calculated with meson masses ju 
and X, respectively, but the term with mass X does not 
contribute in the limit, X-—><». £The situation is similar 
to the second term on the right-hand side of Eq. 
(A8) .3 A little care must be exercised in computing the 
Bjorken limit of U c d ( 2 ). The reason is that, because of 
infrared singularities, the limit qo—nco cannot be 
naively taken under the integrals over the Feynman 
parameters. 17 A detailed study yields 

lim limB a)( j)(p, p', q)=q*r l A(C; p, p') 

(ZO-Weo X-frOO 

+ (gr 2 / 16 tt 2 ) q 0 - 1 {\ 77r7(i)7o7(2)7 r v ln(X s /i ?o I 2 ) 

+5 YC7t 7 (I)7 r 7 (2)70 +707 (1)7t 7 (2)7 r 3 Y 
— 47707(1)707(2)707“[(1)^(2)]}+ O(ln^ 0 /?o 2 )- (A12) 

17 The problem which one encounters here can be illustrated by a 
simple example. Consider the integral J* o 1 dx {A x 2 q 2 -\-B)/(xq 2 -\- C) 2 , 
with A, B, and C constants. In the limit q 2 —>— «, both terms in 
the numerator behave as (g 2 )*"\ although at first glance one 
might expect the second term to behave like (g 2 ) -2 and to 
be negligible compared to the first. 


Note that, according to Eqs. (3) and (All), the 
lnX 2 dependence of A(C; p, p') precisely cancels the 
lnX 2 in the curly bracket in Eq. (A12), as required by 
the absence of lnX 2 dependence on the left-hand side. 
Substituting Eqs. (All) and (A12) into Eq. (A6), we 
obtain, finally, 

lim p', q ) = (1/qa) 

q<)r* *oo 

X[r(C; p, P r ) +A Co “P t ]+O( ?0 - 2 ln ? o), (A13) 

with A Compt as given by Eq. (7c) of the text. 

To conclude this appendix we remark that if, starting 
from the regulated quantities of Eqs. (A7) and (A8) 
and the regulated box-diagram part U C i) (2 ), one took the 
Bjorken limit q<r-*i<x> before letting the regulator mass 
X go to infinity, one would obtain 

lim 2(£) — 0(1/ po ), 

PQ-*ioO 

lim A( 7 (d; p, p')=O{l/p 0 ), 

(A 14) 

lim A( 7 «); p, p') = 0{\/pa'), 

po-*t‘oo 

lim 5(i)(ij(p, p r , q) = (1/qo) A(C; p , p f ) +O(l/q 0 2 ). 

QO-^ioo 
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Table III. Regions of phase space where denominators in 
Eq. (2.19) vanish as ju 2 —>0. w", ft", • • • denote the spatial compo¬ 
nents ($=>1,2,3) of n, ft, •... 

Phase-space region Denominators which vanish 

( 1 ) «"=0 ( w + ft ) 2 , ( w + ft ) 2 

(2) gi' = Q (»+&)*, {p-giY 

(3) ft'=0 («+ft) 2 , ( p-g 2 ) 2 

(4) ft-II* 1 (£-ft) 2 

( 5 ) (£- ft ) 2 

(6) ft 8 1| and ft" || (£-ft) 2 , (£-ft) 2 , (p—gi— ft) 2 

(7) ft"||«* («+ft) 2 

(8) ft 8 ||«‘ _ («+fe) 2 _ 

As a consequence, one finds 

lim r (1)(J) *°“»*= (l/ ?0 ) r(C; *') +O(l/? 0 J ); (A15) 

go-»ioo 

that is, the Bjorken limit in the case of finite regulator 
mass agrees with the naive commutator. This result is 
expected for the regulator theory since the anomalous 
term A Compt is independent of the gluon mass and is 
canceled by exactly the same term (with opposite sign) 
which must be present when the regulator mass is kept 
finite. 

APPENDIX B: FOURTH-ORDER CALCULATION 

In this appendix we consider an extension of our 
previous results to order g r 4 . Unfortunately, repeating 
the general calculation of Appendix A in the next order 
of perturbation theory would require a prohibitive 
amount of work, and therefore will not be attempted. 
Rather, we will content ourselves with the calculation of 
one special case, which is made tractable by a combin¬ 
ation of tricks. The special case is the 5£/ 3 -antisym- 


ND W.-K. T UNG 1 

metric piece of the vector-vector commutator, in the 
scalar- and pseudoscalar-gluon models. There are two 
further restrictions. We consider only the leading 
logarithmic behavior in the Bjorken limit, and we limit 
ourselves to the part of the commutator which, like 
A Compt , i s independent of the three-momenta q, q', p, 
and p' and of the fermion mass m. This second restric¬ 
tion means that we can set q=q'=p = p'=0 at the 
outset, so that we are dealing with the forward Compton 
scattering amplitude, and that we can take the limit 
m —>0 wherever lnm divergences do not appear. (We 
will verify that there are no lnm factors in the leading 
In 1 qo l 2 term.) The restrictions allow us to employ the 
following two tricks, which make the calculation 
tractable: (i) We exploit a connection, provided by 
unitarity, between the Bjorken limit of the forward 
Compton amplitude and current-fermion cross sections. 
This connection becomes especially simple in the tn—> 0 
limit, (ii) For dimensional reasons, In | q 0 | 2 terms in the 
current-fermion cross section (at m= 0) must be 
accompanied by — lnju 2 terms, where ju is the gluon 
mass, so we can study the large- qo | 2 behavior by 
studying the small-ju 2 singularities. The latter arise from 
readily identifiable regions of phase space, and are 
much more easily evaluated than the complete current- 
fermion cross section itself. 

We begin by reviewing the unitarity connection 6,9 
between the current-fermion cross sections and the 
forward Compton amplitude. Since we are only in¬ 
terested in the commutator of two vector currents, we 
set 7(D=7 hX“, 7p>=7A 6 , Jo) and J{%)~ Jy b ‘ It will 

further be convenient to restrict a and b to lie in the 
isospin SUz subspace of SU 3 (a, 6=1, 2); this has no 
effect on the part of the commutator antisymmetric 
in a and 6, and has the virtue of making the charge 
structure of our problem identical to the familiar case 
of pion-nucleon scattering. Denoting to= — we 

may write for the spin-averaged, forward-scattering 
current-“proton” amplitude, 18 



=polynomial+ 



#xe'r*<p\TCJS(x)J*(Q))\p) 



1 

yp-\-yq — m 


y,\ b +y y \ h 


1 

yp — yq—tn 


7,X“) 


+ 2 n i“ i (g 2 , oj)(— g„,+ ^-^+2 , ’j ol (g 2 ,a))(^- ^ • ( B1 ) 

On the third and fourth lines, we have explicitly separated off the Bom approximation and made use of the vector 
Ward identities for ToKt)*(pj p , q ), which imply that the non-Born part is divergenceless. The isospin structure 
of the non-Born amplitudes may be written in the form 

ri^(g 2 ,a J ) = r 1 y+)(g 2 ,a J )i{X%XM+^y-K9 2 ^)^X & ]. (B2) 


The standard forward dispersion relations analysis for pion-nucleon scattering 19 may now be taken over to show 


18 Here “proton” means the £-type quark, and similarly “neutron” means the w-type quark. The actual matrix element is 
obtained by sandwiching Eq. (Bl) between “proton” isospinors. 

» G. F. Chew, M. L. Goldberger, F. E. Low, and Y. Nambu, Fhys. Rev. 106, 1337 (1957). 
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that the amplitudes 7\,2 C±) satisfy the following dispersion relations: 

zy+>(g 2 ,co)-7V+>(g 2 , oo)— r(fto'cirr(g 2 ,«')+wv(g*,<o')][>'-^ 1 +(u'+u)- 1 ], 

•'o 

r,«->(5*, w) = - f da' ZWi ( 3 *, a/) - JPi+(5*, w'JKw'-w)- 1 - (w'+w)- 1 ], 

•'o 

7Y+V, w ) — co [ Wt{f, «') +Wi+{(f, w'JXt"'-")" 1 - (“ , +“) _1 3> ( B3 ) 

J 0 03 

2V->(g*, 03) — co f — [TFr (g 2 , a/) — IFY^g 2 , w')X(w'—to)"*+ (co'+co)" *], 

y 0 03 

with absorptive parts given by 

- (2 ir)*i Z Z <? I 2" v*CV=F*V) 1 7^ | 2-u 2 (/,i±i/, 2 ) | ?) 5 4 (?+g-iV) 

spin(p) iV 

= 2^(8*, «)(-&,+ ^)+2WV=(8* >w )(fc- • ( B4 ) 


In writing Eq. (B3), we have assumed one subtraction 
each for !ri c±) [The subtraction constant IV - Kg 2 * 00 ) 
vanishes by crossing symmetry] and no subtraction for 
T 2 (±) . To second order in g r 2 we have explicitly checked 
the validity of these assumptions. Since the asymptotic 
behavior as 6r-»0 of higher orders of perturbation theory 
will differ from second order only by powers of Into, 
and not by powers of to, we expect these assumptions to 
be true to arbitrary order, and in particular, to order 

S' 4 . 

Let us now set q~p=0, in Eq. (Bl) and 

take the limit g<r-H°°. Using Eqs. (B2) and (B3), 
we find that the right-hand side of Eq. (Bl) becomes 

?o _1 2[X°, X 1 ] 

r 2 

X I- lim 2m / do 1 '[Wr(q t ,o 3 ')-W 1 +(q i ,o 3 ')^ 

+ (term symmetric in a , 6) +0(gtr® lng 0 ). (B5) 

We know that the Bjorken limit of T ( i) ( 2 )*(£, p, q) 
must have the general form 

lim T (D( 2 )* — go" * b ] 

go-*ioo;q=p-=o 

X[7 m 7(JY»'+7^07m+ 2(gM*-“&oS^o)7o/(g 2 /M 2 , W 2 /ju 2 )] 

+ (term symmetric in a, b ) +O(go“ 2 lng 0 ), (B6) 

with/ the difference between the Bjorken limit and the 
naive commutator. Setting ju— v=l in Eq. (B6), 
substituting for the left-hand side of Eq. (Bl), and 
comparing with Eq. (B5), we get a sum rule for/, 

/(g 2 /ju 2 , m 2 /ju 2 ) = lim 2m f do/ 

a 2 -*— 00 J 0 

xcw r r(g* 1 «')-w r i + (g l ,«')]. (B7) 

Equation (B7) can be rewritten in a more useful form 
by recalling that the usual fixed-g 2 sum rule, following 


from the Gell-Mann time-component algebra, is 

0 = f {<?,<>') (B8) 

J 0 03 2 

and is valid to all orders in perturbation theory in our 
models. Multiplying Eq. (B8) by 2 mf and adding to 
Eq. (B7), we get the modified sum rule 

lim 2 I* da' 

q 2 -*~ 00 J 0 

X[L-(g 2 , C o')-^ + (g 2 ,a J ')l (B9) 

with 

L*(g 2 ,«) =2tn[Wi*{<?, 03 ) + (<f/o 3 *)W^(q\ «)1 (BIO) 

the total longitudinal cross section for current-fermion 
scattering. The great virtue of Eq. (B9) is that, in the 
limit m— >0, the longitudinal cross sections are given 
by the simple formula 20 

(2t) 3 

xi Z T,\(p\ri(Jr 1 ^Ur i )\N)\V(p+q-N) J 

Spin(p) N 

(Bll) 

as may be readily verified by comparison of Eqs. (Bll) 
and (B4). We will see that the factor p r in Eq. (Bll) 
enormously simplifies the subsequent calculation. 

We are now ready to proceed with the calculation of 
/ to order gA Before doing this, however, let us illustrate 
the procedure and check the arithmetic done so far by 
using Eqs. (B9) and (Bll) to recalculate the order 
g r 2 result contained in Eq. (11) of the text. To second 
order, the intermediate states which may contribute 
are the single “neutron,” 1S N=n, and the “neutron” 
plus gluon, N= n+g (Fig. 5). Neither of these con¬ 
tributes to L + , and the single “neutron” contribution 
to Lr vanishes to order g r 2 , because the zeroth-order 

2 ° We wish to thank D. J. Gross for pointing this out to us. 
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Table IV. Phase-space regions and pieces of | 2flt | 2 which actually make divergent contributions to Eq. (2.18). 

«•, ft*, • • • denote the spatial components ( 5 = 1, 2,3) of n, gi, • • •. 


Phase-space 

region 


Piece of I SII I 2 


(4) fi'llf 

(5) ft'II P a 

(7) gi*|l«' 

( 8 ) 


| gfu(n) \yp[l/(yp—ygi—yg2)3ll/(yp—ygi)']}u(p) | 2 
I gr*w(«) [ypD-/(y'p—ygi— y*ft)][l/(7*£— 1 v*ft)])«(£) I 2 
I gr^in) ([l/(7'»+7'&)]7'#Cl/(7*^-7*ft)])M(^) I 2 

| gr 2 &(n){[l/(yn+yg' i )']ypLl/(yp—ygi)']}u{P) I 2 


£) u(p), where A is the renormalized vertex part. 
Using the fact that £ 2 — 0, one sees from Eq. (A8) 
that A(yp\n,p) contains only a piece proportional to 
yp and a piece proportional to (yn) (yp) (yn), 
both of which vanish when sandwiched between the 
spinors. So the single- “neutron” contribution is zero. 

(ii) “Neutron” plus one gluon. The “neutron”- 
plus- one- gluon contribution, in fourth order, arises 
from the interference of the first-order diagrams in 
Fig. 5 with the third-order diagrams in Fig. 7. The 
third-order diagrams are clearly of the same structure 
as the diagrams in Fig. 1, which we have already 
evaluated in our general treatment of the order — g r 2 
case. We note first that, because of the factor yp, the 
contributions of Figs. 7(a) and 7(b) vanish. Thus, 
just as in the case of the first-order matrix element, the 
terms containing (yp-\-yq)~ la: (2—co) -1 vanish, and 


9 3 


ix. 


ft* 9 - X 


P + q $ 

(1)i(2) 




g 3 




Fig. 9. Diagrams of order g r 2 contributing to the trident inter¬ 
mediate states. 


as a result the integral over a/ in Eq. (B9) converges, 
even for vanishing gluon mass ju 2 . This means that any 
ln/ 1 2 singularities in/ must result from ln/ 1 2 singularities 
in Lr itself. 

To evaluate the contribution of Figs. 7(c)-7( f), we 
calculate the renormalized self-energy and vertex parts 
2 and A, by performing the usual mass and wave- 
function renormalizations on the unrenormalized quan¬ 
tities of Eqs. (A10). Note that the renormalized 
quantities contain no dependence on the cutoff \, 
guaranteeing the validity of our dimensional argu¬ 
ments. In the treatment of the gluon vertex correction 
in Fig. 7(f), a subtlety arises. Instead of subtracting 
the vertex part at g 2 =ju 2 , as required by the Watson- 
Lepore 22 convention, we subtract at g 2 = 0. The dif¬ 
ference between the two methods of subtraction makes 
a contribution to Lr which is proportional to ln(w 2 /ju 2 ), 
but which, for fixed co, is independent of g 2 and therefore 
can be dropped. This is the only place in the entire 
calculation where we encounter lnw 2 terms and where 
the presence of a lnju 2 term does not indicate the 
presence of a term — lng 2 . When the gluon vertex part 
is subtracted at g 2 ~ 0, the m^O limit is finite, and our 
usual dimensional argument applies. On substituting 
the expressions for 2 and A into the third-order matrix 
element, we find that the integration over the inter¬ 
mediate state («+g) variables is always convergent, 
so that lnju 2 terms in i L~ can only arise from the explicit 
lnju 2 dependence of 2 and A. We then find for the con¬ 
tributions of the various diagrams to Lr, 

limZ^( C )' = finite, 

M 2 -* 0 

Kml^(rf)“ = (gr 2 /47r) 2 (co/647r 2 ) lnju 2 +finite, 

(B15) 

limine) - — — (g r 2 /47r) 2 (co/647r 2 ) lnju 2 -|-finite, 

m 2 -*o 

lim Li(fj~ ~ — 2(gr 2 /47r) 2 (co/647r 2 ) lnju 2 -|- finite. 

m 2 -*o 

Next, we must examine the contribution of the box 
diagrams of Figs. 7(g) and 7(h). We deal with these 
diagrams by writing them in Feynman parametrized 
form and substituting into the expression for Lr . For 
example, the contribution of Fig. 7(g) to Lr is pro- 

22 K. M. Watson and J. V. Lepore, Phys. Rev. 76, 1157 (1949). 




R22 


309 


2858 


S. L . ADLER AND W.-K. TUNG 


1 


portional to 

r^max /*1 /*! /*1 

/ dP / ato / zds / dy 

■'O •'o *^0 •'o 

X {(1/Z> 0 2 ) 2^[(1 — z+ 3 » 2 )*/— ysiJ][(l—#) m 2 
+2^s(l — y)v]+(2/Z) 0 ) [v[l — 2x(l —s+ys)] 

~h 2xyzv ]}, (B16) 

Z> 0 = m 2 [>- l+afys(l-*)]+*(!“*) (l“ s ) (?+?) 2 

—^(1 —y)z[ 2 (l—z+ys)v— 23 BV— (l-s) (£+?) 2 ], 
v=p>q, V=p‘g. 

For general values of q*p and a singularity of Eq. 


(B16) at ju 2 —0 can only arise from the integration 
end points £=0, #=(), x—l, •••, y= l . 23 A careful 
analysis of the behavior of Eq. (B16) at these end 
points in all possible combinations shows that there is 
no lnju 2 term as ju 2 -—>0. A similar analysis yields the 
same result for Fig. 7(h), so we get, finally, 

limi 7 ( ff) - =!linLZ^( h) - = finite. (B17) 

/i 2 -*0 n 2 -*0 

This completes our analysis of the “neutron" plus 
one gluon intermediate state. 

(iii) “Neutron” plus two gluons. The “neutron” 
plus two gluon contribution arises from the square of 
the second-order matrix element corresponding to the 
diagrams in Fig. 8 . We have 


( 2 *)'i E E S^^\S^hj^b 5i{p+q ^ g ^ gi)W> (B18) 

o 2 spin(p) spin(n) •' {2iry n° 2 J (27r) 8 2gi°J (2T) 8 2 g 2 ° 


d z n m 1 f (Pgi 1 f d*g 2 1 


t 

with 

3K= g r *w(«) (- 7 -- 7* P 


1 


+ 


1 


y.p 


1 


+yp 


1 


1 


\yn+yg z yp—ygi yn+ygi 7*£“7*g2 yp-ygi—yg 2 yp-ygi 

+yp —--- -V(P)- 

y p — y gi—ygzy p — y gv 


(B19) 


Only four terms appear in 9TC because the contributions 
of the two diagrams on the first line of Fig, 8 are pro¬ 
portional to (yp J ryqT x ypu{p)> and therefore vanish. 
Just as before, this means that the integral over os' 
in Eq. (B9) converges, and any lnju 2 behavior in 
/ 2 giuon must originate in L 2 ~giuon itself. Possible diverg¬ 
ences in L %~giuon as fi^O arise from the eight regions 
of three-particle phase space listed in Table III, where 
denominators in the matrix element of Eq. (B19) 
vanish. To extract the divergent part, we make a 
careful study of the behavior of the integral of Eq. 
(B18) in each of the eight regions. In this connection, 
the following simple inequality is very useful: Let p be 
a null vector and let Q(—gi, g 2 , gi+g 2 ) be timelike 
with p°> 0 and Q°>0. Then we may write 

(yp) (yQ)= z P*Q+hy«yfiT afi > 

T a $= p a QP—p 0 Q a , (B20) 

with the following simple bounds on T a $: 

| T AB I <[4p D G°p*Q] 1,2 ) A, B, = 1, 2, 3 

I T A0 \<l2(p-Qy+^p 0 Q 0 p-QJ 12 - ( B21 ) 

In other words, for small p-Q, the 7 -matrix product 
(yp)(yQ) is always bounded by (p'Q) 112 - Applica¬ 
tion of this inequality shows that many of the po¬ 
tentially divergent phase-space regions actually make 
a finite contribution to Eq. (B18), and that the only 
divergent contributions come from the phase-space 
regions and pieces of | 31Z | 2 shown in Table IV. Evalu¬ 
ation of the spin sums and phase-space integrals show 


that regions (4) and (5) each make a contribution to 
Lr of 

“ i(g 2 / 4tt) 2 (os/ 647T 2 )[ln( 4 w) + (2/co)— 1] In ju 2 +finite, 

(B22) 

while regions (7) and ( 8 ) each make a contribution of 

“i( gr 2 / 4f) 2 (co/647r 2 ) ln M 2 , (B23) 

giving a total of 

limLaTgiuon— - (gr 2 /47r) 2 (co/647r 2 )[ln(^)) + (2/as) — 

p 2 -*0 

Xln/i 2 +finite. (B24) 

(iv) Trident. The three trident contributions arise 
from the squares of the second-order matrix elements 
corresponding to the diagrams of Fig. 9. In Table V 
we list the momentum labeling for each of the three 
states and indicate to which L it contributes. The 

Table V. Four-momentum labeling for trident production. 


Four-momentum label 

Trident state gi gz gz Contributes to 


(1) 

n 

P 

P 

L~ 

(2) 

n 

n 

n 

It 

(3) 

P 

P 

n 

L + 


23 T. Kinoshita, J. Math. Phys. 3, 650 (1952). 
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Table VI. Phase-space regions and pieces of | | 2 which actually make divergent contributions to 

Eq. (3.25). gi*, gi', and denote the spatial components (s- 1, 2,3) of g h g 2j andgz. 


Phase-space 

region 


Piece of | 3H 


Occurs in 


g 2 * II g&* I gr^igdy'PO-/ yg*) ]«(/ , )[i/((g2-b&) 2 '-/* 2 )Jw(g 2 )i'(g8) I 2 I sw fl) j 2 , | srw j 2 

gi'llgi* \gM{gi)ypO-/(yp~y'&~ 7'g3)]«(^)[l/((ft+^) 2 —/* 2 )]#(giM&) I 2 |2m (2) l 2 


matrix element for state ( j ) (/= 1, 2, 3) receives contributions from only those diagrams in Fig, 9 which 
are labeled below with ( j ). We find [the factors of \ in Eq. (B26) are statistical] 


Z* 


trident — 



E 

spm(p) 


E 

epin({7i,{72.{73) 


f d 3 gi m r 

J (2rr) 3 gi° J 


d 3 gt m r 

(2tr YgfJ 


d 3 g 3 m 

(2x)V 


S 4 (p+q~gi-g 2 -g 3 ) | OTf | 2 


(B25) 


with 

| 9E- | 2 = | 9Tl (1 > | 2 +i | 9Tl (2 > I 2 , | 9R+ | 2 =i | 9Tl (3> | 2 , (B26) 

9R (1> =gr 2 {^(g07-?(7 , ?-7*^2-7 , g3)" lw (^)[(g2+g3) 2 -M 2 ]” 1 ^(g2)f(g3) 

+u(g2)w(/')[(^-g2) 2 -M 2 ]“ J w(gi)[-l/(7'?-7 , gi-7*g2)]7*^(g3) 

+w(g 2 )w(^)[(^-g2) 2 -M 2 ]“ 1 ^(gi)7 , ?(7*^-7 , g2-7*g3)" 1 ^(g3) }, 
9R (2) = gr 2 {'U(gl)7 , ?(7*/'-7 , g2-7 , g3)“ 1 ^(?)[(g2 + g3) 2 -M 2 ]" 1 ^(g2)v(g3) 

+w(g 2 )7'?(7 , ?-7'gi-7'g3)" 1 «(?)[(gi+g3) 2 -M 2 Ii" 1 w(gi)v(g3)}, (B27) 

9R (3) =gr 2 {'U(g 1 )«(?)[(^-gl) 2 -M 2 ]' 1 w(g2)[- 1/ (y* p 7'gl 7'g2)]7* pv{&) 

+u(gz)u(p)[(p-gi) 2 - (yp-y gi-y g 2 )yr pv(gz) 

+u(gi)u(p)l(p-gi) 2 -^ l u(gz)y- p(y- p-y gi-y 

+u(g2) i u(p)[_(p~g 2 ) 2 ~fi 2 2~ 1 u(gi)yp(yp~yg2~ygz)~ l v(g i )} r 


The two diagrams on the first line of Fig. 9 make no 
contribution to the matrix elements since they contain 
the factor {yp+yq)~'ypu{p) = 0, and as before, 
this means that divergences in /trident must originate in 
Z^trident themselves. Potential divergences in L T trident 
are associated with special regions of three-body phase 
space where denominators in Eq. (B27) vanish. In 
studying the actual behavior of Eq. (B25) in these 
regions, we use the inequality of Eq. (B21) and the 
estimates 

u(gi, 2 )u(p) I cc (gi,rp) 112 asgj i2 *£“»0, 

I w(gi, 2 )v(g 3 ) | 0X1 (gi. 2 *gs) 1/2 asg 1(2 ’gir-»0. (B28) 

We find that most of the dangerous phase-space regions 
actually give finite results in the ju 2 —>0 limit, with 
logarithmic divergences coming from the regions of 
phase space and pieces of | 31Z (1,2> | 2 shown in Table VI. 
Evaluation of the spin sums and phase-space integrals 
gives the result 

limL+trident^ finite, 

M 2 -*0 

limZr 

trident — 4(gr 2 /47r) 2 (oj/647r 2 ) ln M 2 +finite, (B29) 

with | of this result coming from the phase-space region 
g 2 ‘ || gz a and \ from the region gP || g 3 a . 


This completes our analysis of intermediate states 
which contribute in order g r 4 . Adding up the contribu¬ 
tions from Eqs. (B15), (B24), and (B29), we find, for 
the total fourth-order contribution, 

limZ, + (g® co) = finite, 

lim L~ (if, = g r 2 /47r) 2 (a >/ 64tt 2 ) (B 30) 

M 2 -*-0 

X[ln(£co) + (2/co) +^] ln/i 2 +finite, 
which, by our dimensional argument, implies that 

lim L+{f ) co) = finite, 

^ »- oo 

lim L- (f, a,) = (grV47T) 2 (a ) /64 1 r 2 ) (B31) 

q%-+- oo 

X [ln(Jco) + (2/co) +^] ln^/ju 2 ) +finite. 

Substituting this result into Eqs. (B6) and (B9) 
yields the fourth-order Bjorken limit quoted in Eq. 
(12) of the text. 24 


24 A fourth-order calculation of the longitudinal cross section 
in the inequivalent limit in which | q 2 1 and w _1 simultaneously 
become large has been given recently by H. Cheng and T. T. Wu, 
Phys- Rev. Letters 22, 1409 (1969). 
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Excerpt from S. L. Adler, Anomalies in Ward Identities and Current Commutation Relations, 
in Local Currents and Their Applications, Proceedings of an informal Conference, D. H. Sharp 
and A. S. Wightman, eds. (North-Holland, Amsterdam and American Elsevier, New York, 
1974). Reprinted with permission from Elsevier. 


2.4. Questions raised by the breakdown of the BJL limit 

Experimentally, Bjorken scaling works very well and o s /o T = 0.1810.05, 
i.e. the longitudinal cross section is small. So renormalized perturbation theory 
seems to be a bad guide here. This state of affairs raises several questions. 

(i) Is it only the perturbation expansion that is at fault, or does the trouble 
lie in local field theory itself? Bitar and Khuri[4] have studied the BJL limit 
using only analyticity and positivity. They find that class I intermediate state 
(fig. 10) violate the BJL limit for space-space commutators, but cannot rule out 
a cancelling contribution from class II intermediate states (fig. 11). 






Fig. 11 
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(ii) Can one make a consistent calculation^ scheme in which Bjorken 
limits, the Callan-Gross relation and scaling are all valid? This is a real challenge 
to theorists. The Lee-Wick theory, for example, doesn’t do the job: The BJL 
limits are satisfied but complex singularities change dispersion relations in such a 
way that the Callan-Gross relation is still violated. Perhaps a successful approach 
would involve summation of perturbation theory graphs plus use of the 
Gell-Mann-Low eigenvalue condition (see sect. 3). 

(iii) The same questions apply to light-cone algebra, which is basically the 
BJL limit in the p 0 -► 00 frame, as in the Callan-Gross derivation of their sum 
rules. 


3. Anomalous scaling 

Consider a field theory with a dimensionless coupling constant. When all 
energies become much greater than particle masses, one naively expects the 
«-point functions to scale — i.e. to become mass-independent apart from an 
overall factor. In this section we discuss the formal theory of scaling[5] and its 
breakdown in field theory. 

3.1. Formal theory of scaling 

The infinitesimal generator of dilations, 5 D , transforms coordinates as 
follows: 


6 d * 4 = ( 49 ) 

Under the transformations (49), a field \p transforms as 

~ + d)y > ( 50 ) 


Where d is the “scale dimension” of \p. Scale invariance for renormalizable field 
theories results if we take 


d = 1 


d = 


for bosons. 


for fermions, 


(51) 


In simple canonical field theories it is possible to find a conserved, symmetric 
energy-momentum tensor 6 which can be used to define a “dilation current” 
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D 

4 




The energy-momentum tensor 0^ p is constructed so that its trace is proportional 
to the mass terms in the Lagrangian. Thus . 




4 



Z 

/' 



dfip. - 4£ 
ri m 



where L m = mass term in the Lagrangian (the only term which breaks scale 
invariance). Therefore, the dilation current D is conserved when the theory is 
scale-invariant. Even when D is not conserved*/ the “charge” 


D(t) = /d 3 x D 0 (.x, t) 



acts as a generator of dilations: 

<pj(x, /)] = (x x 9 X + t). (55) 

The relationship 

(56) 

is the scale-invariance analog of PCAC. Like PCAC, it can be used to derive 
low-energy theorems; in the present case for the emission of gravitons in an 
arbitrary process. 

Now consider a single scalar-meson field y with a y self-interaction. Let G(p) 
be the renormalized propagator and r(p,q) the 6^ vertex function. From the 
usual definitions of these quantities we have 

G(P) r (p, q)G(p + q)\^ Q 


= /d 4 x d 4 y e'P-y < 0 |T*(v<yV(O)0j (x)) |0> |^. 0 (57) 


= /d 4 xd (y(y)y(P)DJ - 5 (x 0 -y 0 )U>o(*),*OOMO) 

-vO)«(*o)[^o(*).v(P)n|^o • ( 58 ) 

Integration by parts shows that the first term on the r.h.s. is zero when q = 0, 
so eq. (58) becomes 
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G{p)T{p,Q)G{p) = -/d 4 yz'P'y {5(x 0 ->> o )<0| [D(x 0 )^{y)] ^(0)|0 > 

+ 8(x o )<01 <p(y) [D(x o ),i^(0) ] | 0 > } . (59) 

The quantity in curly brackets on the r.h.s. can be related back to G(p) using 
the dilation generator commutator [eq. (55)], and in this way one gets 

-iT(p,0) = p\G~\p) + ( 2d-4)G~\p ). (60) 

Recall that T(p, 0) is the three-point function shown in fig. 12, Weinberg’s 
theorem says that fy?, 0)~ polynomial in log p as p -► -°°. Thus, neglecting 



YXp , 0) as p 00 in eq. (60), we find 

P\ G~J (P) = (4 - 2d) G^ 1 (p) . (61) 

This implies that G^ satisfies the scaling law 
G~J = A(p 2 ) 2 ~ d 

2 (62) 
- A p when d = 1. 

One sees from this that the neglect of T(p, 0) in eq. (60) was justified. 

Unfortunately, we know that the scaling behavior for Gj, predicted by the 
above chain of formal arguments is not correct in renormalized perturbation 
theory, where we find that 

Gn = p 2 X [power series in log p 2 ] . (63) 


Thfe reason for the breakdown of the formal theory of scaling is not hard to find. 
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To make the formal manipulations valid, we need to put in regulators. But the 
regulators involve large masses , which necessarily break scale invariance. The 
trouble arises because (0 JU ) iregl does not go to zero as the regulator masses go to 
infinity. This behavior is of course analogous to what one finds when one looks 
at the WA Ward identity from the regulator viewpoint. 

3.2. Correct scaling relations: The Callan-Symanzik equations 

Straightforward dimensional arguments require that G 1 (p ) depend on the 
particle mass as follows 

G~‘(p) = SG~ l [pV ] , (64) 

so that we have the identity 

P v G ~ 1 ( p ) = 2G ~ 1 (p) ~ P ~— G ~ 1 (p). (65) 

0 Pp d jj. 

The naive scaling relation, eq. (60), can thus be written 

-iT(p, 0) = (- m -^- + 2 t)G' 1 (p), G~ l (p), (66) 

where y = d- 1 (=0 when the “scale dimension” d equals the naive canonical 
dimension = 1). 

Callan and Symanzik[6] have shown that a correct version of the scaling law 
(66) is given by 

- iT(p,0) = (-p™ + 27(X) + (3(X)-^-)G' , (p) (67) 

04 0 A 

where X is the renormalized coulping constant, y(X) = d(X) - 1 and d(K) is the 
“anomalous” scaling dimension of the theory. If the J3 term were zero, one 
would find 


= A(p 2 ) 2 ' d = Aip 2 ) 1 , (68) 

i.e., scaling with anomalous dimension d(X). 

A simple scaling law like eq. (68) does not result from eq. (67) with J3 # 0. 
The limit of eq. (67) then becomes 
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°~ ( -^ + W) + PW~)G-'(p), (69) 

which can be integrated and gives only the much less restrictive asymptotic 
predictions of renormalization group theory. 

3.3. Remarks 

We conclude with several comments on the foregoing results. 

(i) The statement J3(X) = 0 is closely related to the Gell-Mann-Low 
eigenvalue condition*. 

(ii) If /3(X) = 0, with a simple zero A 0 , then 

(p) = A( P 2 y-y^o) (70) 

In other words, the asymptotic behavior is determined by the bare coupling 
constant A 0 , independent of the value of the renormalized coupling constant 
X[7], 

(iii) Several models with J3 = 0 show scaling properties. For example, this is 
the case for both the Johnson-Baker-Willey model [8] of quantum 
electrodynamics and for the Thirring model[9] (with massless or massive 
fermions). Note that neither of these models has a vacuum-polarization 
structure, which for a fermion theory implies /3 = 0. 

(iv) The Callan-Symanzik equation (67) can be used to prove the 
momentum space version of Wilson’s operator product expansion (in pertur¬ 
bation theory), and this can be used to study the anomalous BJL limit [10]. 

(v) An interesting question is whether the Callan-Symanzik relations can 
be used to do graph summations for objects more complicated than propagators, 
for example, for fermion inelastic structure functions [11 ]. 


References 

[1] S.L. Adler, Perturbation theory anomalies, in Lectures on elementary 
particles and quantum field theory, ed. S. Deser, M. Grisaru and H. 
Pendleton (M.I.T. Press, Cambridge, Mass., 1970). 

[2] A Zee, Phys. Rev. Letters 29 (1972) 1198; B. Schroer and J. Lowenstein, 
Phys. Rev. D7 (1972) 1929. 

[3] J. Wess and B. Zumino, Phys. Letters 37B (1971) 95. 

* For a discussion see ref. [7] and references cited therein. 


R23 


317 


168 Stephen L . A dler 

[4] K, Bitar and N.N. Khuri, Phys. Rev.D3 (1971) 462. 

[5] S. Coleman and R. Jackiw, Ann. ofPhys. 67 (1971) 522. 

[6] C.G. Callan, Phys. Rev. D2 (1970) 1541; K. Symanzik, Comm. Math. 
Phys. 18(1970)227. 

[7] S.L. Adler, Phys. Rev. D5 (1972) 3021. 

[8] S.L. Adler and W.A. Bardeen,Phys. Rev. D4 (1971) 3045. 

[9] A.H. Mueller and T.L. Trueman, Phys. Rev. D4 (1971) 1635. 

[10] C.G. Callan, Phys. Rev. 12(1972)3202. 

[11] N. Christ, B. Hasslacher and A.H. Mueller, Phys. Rev. D6 (1972) 3543. 


In reference [10], Phys. Rev. 12 
should read Phys. Rev. D5. 


318 


Adventures in Theoretical Physics 


PHYSICAL REVIEW D VOLUME 15, NUMBER 6 15 MARCH 1977 
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We relate the energy-momentum-tensor trace anomaly in spin-1/2 quantum electrodynamics to the functions 
/3(a), 6(a) defined through the Callan-Symanzik equations, and prove finiteness of when the anomaly is 
taken into account. 


I. INTRODUCTION 

Spin-£ quantum electrodynamics, characterized 
by the Lagrangian density 1 

£i„,(x) = i P(x)(iy ■ 3 -m 0 )ij](x) - i F uv (x)F uv (x) 

- e a 4><jc)Y u 4){.x)A “(x) , (1.1) 

is one of the simplest field theory models in which 
to study anomalies. The axial-vector divergence 
anomaly in this theory has been extensively 
analyzed 2 ; we wish in this note to discuss some 
properties of the energy-momentum-tensor trace 
anomaly. 3 Taking for the energy-momentum ten¬ 
sor 6 UV the symmetric form 

6 =6** + 6 r 
UU v /i|; 1 fiU I 

uv f X 'F'*-f xu f\, (i.2) 

fl" = * i[’i>Y l i(& v +ie 0 A v )4i+ lj%,(3 U + ie 0 A u )4> 

e 0 A v )y u ip-^(a il -ie ll A ll )yj] , 

a simple application of the equations of motion 
gives the so-called “naive” trace formula 

= . (1.3) 

As has been shown by the authors of Ref. 3, Eq. 
(1.3) is not correct as it stands, but instead must 
be modified by the addition of an anomalous term 4 
proportional to Z 3 ~ 1 F Xa F > *. Our aim in this paper 
is to derive an explicit formula for the trace 
anomaly, valid to all orders in perturbation theory, 
expressed in terms of the functions /3(a) and 6(a) 
of the fine-structure constant defined through the 
Callan-Symanzik equations. 

In Sec. n we give a simple heuristic derivation 
of our result, which, as we shall see, is most 
naturally written in terms of a subtracted operator 
N[F Xa F Xor ]. There, we will be thinking in terms 
of using massive regulator fields. Some related 


details are given in the appendixes. 

Then in Sec. IH we will give a more careful 
derivation using normal-product methods 5 and 
dimensional regularization. 0 in n space-time 
dimensions, we have 

6 U U = -(n-4)£ ln ,-3(|i^i|)-w; 0 #)+m 0 W . 

(1.4) 

The anomaly is the term - (w - 4)£ lav , which would 
vanish if <C lav were finite. We wish to express the 
anomaly in terms of renormalized operators. 

Our derivation will give as a byproduct a proof 
that 0 UV as defined by Eq. (1.2) is finite to all 
orders of perturbation theory even when the trace 
anomaly is taken into account. The earlier proof 
by Callan, Coleman, and Jackiw 7 is incomplete, 
while the one by Freedman, Muzinich, and Wein¬ 
berg 8 is not directly applicable to our case. 

II. HEURISTIC DERIVATION 

The heuristic derivation is obtained by writing 
down an operator formula for the trace equation 
and then determining the unknown coefficients ap¬ 
pearing in this formula by studying its electron- 
to-electron and vacuum - to - tw o -p hoton matrix 
elements. As our initial operator ansatz let us 
write the most general linear combination of 
gauge-invariant scalar C-even operators with the 
correct dimensionality, 

*u u = C 1 m 0 W + C 2 Z 3 -'F >a F*' 

+ C 3 %i[ipy(d + ie 0 A)ip-ipY'(d-ie 0 A)ip 

-2w 0 #] - (2.1) 

The coefficient of C 3 is formally zero by use of the 
equations of motion; it represents a discontinuous 
contribution which is present at zero momentum 
transfer, but which vanishes for nonzero mo- 
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mentum transfers, and hence does not contribute 
to physical matrix elements. The precise structure 
of this term will be determined in sec. m, but we 
will ignore it in the heuristic discussion which 
follows. Focussing on the first two terms, it is 
easy to see that either or C 2 is infinite, or Eq. 
(2.1) cannot be correct as it stands. The reason 
is that both 6 U U and w 0 # are finite operators 9 
(that is, their matrix elements are made finite 
by the usual electron and photon wave-function 
renormalizations), whereas a simple calculation 
shows that the lowest-order diagrams (illustrated 
in Fig. 1) contributing to the electron-to-electron 
and vacuum-to-two-photon matrix elements of 
Z^'F^F** are logarithmically divergent, and 
hence cannot be made finite by wave-function re¬ 
normalizations alone. This problem is analyzed 
in more detail in Appendix A, where it is shown 
that if a photon regulator is introduced to make 
the diagrams of Fig. 1 finite, then energy-mo¬ 
mentum-tensor conservation requires the intro¬ 
duction of extra contributions, proportional to the 
mass squared of the regulator field, in the 0 UU - 
regulator photon vertex. These terms may be 
thought of as arising from the energy-momentum 
tensor of the regulator field. In the limit of in¬ 
finite photon regulator mass these contributions 
survive and, in lowest relevant order, give a 
second logarithmic divergence, which just cancels 
the logarithmic divergence of the diagrams in 
Fig. 1. Thus, C 1 and C z remain finite, and the 
correct form of Eq. (2.1) is actually 

e u “=c lWo # + c 2 w 0 [F XjJ F x< ’J 

+ discontinuous terms , (2.2) 

with N 0 [F Xa F*°] a subtracted form of the operator 
Z 3 ~ 1 F Xa F Xar . Once it is apparent that a subtracted 
operator appears in Eq. (2.2), it is convenient to 
reexpress this operator in terms of another sub¬ 
tracted operator N[i i \ (r F Xff ] defined by 

< e (/>)|jV[F Xc F^]| e (/>')) 

(2.3) 

, <01 z*’ 1 F X .F*' I r(p, <1 )y(-P, «2)> 

through a relation of the form 
N 0 [ x '] = aN[ F» ]+ ton,# 

+ discontinuous term . (2.4) 

— t 



(o) 



permutotions 


FIG. 1. (a), (b) Logarithmically divergent electron 
and photon vertex parts, respectively, of the operator 
Z 3 ~ 1 F Xo F X o1 , the coupling of which is denoted by <8>. 
wavy lines indicate photon propagators, and solid lines 
indicate electron propagators. 


This leads to the final operator form for the trace 
equation 

+ discontinuous term , (2.5) 

with the subtracted operator NlF^F* 1 ] uniquely 
specified by the conditions of Eq. (2.3). 

We proceed now to determine the coefficients 
K x and K z in Eq. (2.5) by taking matrix elements 
of Eq. (2.5) between appropriate sets of states. 
Taking first the matrix element between electron 
states in the limit of zero momentum transfer, and 
using 10 

<e(/>) 10/|e (/>')) n u ‘’( £ JtP ‘ , 2n Up 

( 2 . 6 ) 

and Eq. (2.3) we find 

K 1 (e{p)\m 0 ipi!)\e{p))=y)i . (2.7) 

However, as shown by Sato 11 and as explained in 
Appendix B, it is easy to see from the Callan- 
Symanzik equation for the electron propagator that 



(e(p) |w 0 # | e(p)) = 1+ , (2.8) 

with 6(a) the function of the fine-structure con¬ 
stant a defined by 12 


1 + 6 (a) = —- 
w? 0 


Bm 


(2.9) 


Combining Eqs. (2.7) and (2.8), we conclude that 13 


tf l = l+6(a) = l + 


3a 

2tt 


+ • 


( 2 . 10 ) 


Next we take the matrix element of Eq. (2.2) be¬ 
tween the vacuum and the two-photon state, again 
in the limit of zero momentum transfer. Now as 
Iwasaki 14 has shown, the general form of the 
vertex (O|0 1J „ \y(p u q)) is 


(ok, \vtpu eMPz, € 2 )>=[s(n^p+n'n,)kTuf lx ^J'K<7 2 ) 


<i=Pi+Pz, />i 2 =/>z 2 =o, 


+ f lla n,(/>i-/> 2 )»(ft ~Pz)oB{q 2 ) + \ (n« n s + f 1. F 2 uS )q VC ( q 2 ) , 

^.0 = — (£<)#( € i)o> i = l)2 . 


( 2 . 11 ) 
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As Iwasaki notes, Eq. (2.11) implies that the 
vacuum-to-two-photon matrix element of 6 U U is 

«*)> 

= Ui-€ 2 /> 1 */> 2 -€, •/> 2 € 2 */> 1 ) 

x^[-2B(^) + lC(^)] , (2.12) 

which vanishes at ^ = 0. Hence, from the vacuum- 
to-two-photon matrix element of Eq. (2.5) we get, 
using Eq. (2.6), 

0= [l + 6(a)](0 |w 0 #|y(/>, e 1 )y(-/>, e 2 )) 

+K 2 (0 \Z 3 ~ 1 F Xa F* a \y(p, *M-P, * 2 )>tr« • 

(2.13) 

Now as shown by Adler et a/. 15 and again as ex¬ 
plained in Appendix B, from the Callan-Symanzik 
equation for the photon propagator one sees that 

<0|»i 0 #|y(/>, € 1 h'(-/>, e 2 )> 

. 1 g(«) 

4 1 + 6(a) 

X (0 \Z 3 ~ 1 F >ff F Xa \y(Pi €i)r(-P, € 2 )) tree * (2.14) 


renormalized in higher orders of perturbation 
theory, and in fact would vanish, leaving only the 
"soft” operator [l + &(a)]m 0 # as the trace, if 
/3(a) satisfied the eigenvalue condition 12,19 

/3(a) = 0 . (2.19) 

Second, whereas the axial anomaly involves the 
divergent operator Z 3 ~ 1 F ta F TP € (aTP , with the con¬ 
sequence that matrix elements of j 5 u are not re¬ 
normalized by wave-function renormalization 
factors alone, the trace anomaly involves the 
convergent (once-subtracted) operator N[ *\ CT .F Xa ], 
consistent with the finiteness of matrix elements 
of the energy-momentum tensor. The appearance 
of a subtracted operator in Eq. (2.17), as well as 
closely analogous results of Lowenstein and 
Schroer in <f > 4 scalar field theory, 19 suggests that 
it should be natural to derive Eq. (2.17) within 
the framework of the normal-product formalism. 5 
This is the subject to which we now turn. 

III. NORMAL-PRODUCT DERIVATION 


with /3(a) defined by 12 * 16 


Q/ v 1 da 

0(a) = - m -— 
w a dm 


= — [ 1+ 6(a)] 


a 


rth 


da 

dm, 


2a a 2 

— 4. >■ > 4. • • * 

3tt 2tt 2 


(2-15) 


Comparing Eq. (2.13) with Eq. (2.14), we learn 
that 


*. = 4/H«), (2-16) 

and thus our final result for the trace equation is 

0/ = [ 1 + 6(a) ]»»„#+ i/3 (a)N[ F Xa F'*] 

+ discontinuous term . (2.17) 

The first two terms in the power-series expansion 
of the coefficient of the F Xa F Xa term in Eq. (2.17) 
agree with the fourth-order calculation of Chano- 
witz and Ellis. 17 

The above derivation is evidently closely anal¬ 
ogous to the derivation, 18 by use of the Callan- 
Symanzik equations, of the nonrenormalization 
theorem for the axial-vector divergence anomaly 

+ gF"W^W€ t ,„. (2.18) 

However, there are two important ways in which 
the trace anomaly differs from the axial-vector 
divergence anomaly. First, the trace anomaly is 


In all subsequent discussion we assume that the 
vacuum expectation value of any operator we con¬ 
sider has been implicitly subtracted off. 

In this section we will express 9 U U as a linear 
combination of normal-product operators. Under¬ 
lying this derivation are the following two observa¬ 
tions: 

(1) The expression for 6 U U in terms of normal 
products is determined entirely by its insertions 
at zero momentum into Green’s functions: The 
only operators that can occur are gauge invariant 
and of dimension at most 4; but the only such 
operator which vanishes at zero momentum is 

and t * lls operator has the wrong charge- 
conjugation properties. 

(2) The Callan-Symanzik equation is the Ward 
identity which expresses the nonconservation of 
the dilatation current 20 and the divergence of the 
dilation current is essentially Q u u . So, if we ex¬ 
press this Ward identity in terms of an insertion 
of Jo u u d 4 x J then comparison with the Callan- 
Symanzik equation in its standard form will give 
9 U U (at zero momentum) in terms of renormalized 
operators. 

We will use dimensional renormalization 21 to 
define both the normal products and the re¬ 
normalized Green’s functions. This is by no 
means essential: All that is required is that the 
subtractions performed implicitly by the normal 
products agree with those obtained by an explicit 
redefinition of the fields and parameters of the 
bare theory. 

We will frequently consider insertions at zero 
momentum of operators in Green’s functions. In 
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Lowenstein’s 22 terminology these are differential 
vertex operations (DVO’s). 

First we must define the theory by adding a 
gauge-fixing term 

JSrf = - (3.1) 

to the Lagrangian so the theory is given by 

«C = «C lnT +«C gf . (3.2) 

As usual ^ is renormalized by writing 

£o = Z 3 £ r . (3.3) 

We are now ready to start the proof. 

Consider the equation of dimensional analysis 
for an unrenormalized (but dimensionally regu¬ 
larized) Green*s function G 0 : 

o= ( K l^-^ +w °i- 0 + < 2 -^ )e ° a^) G °- 

(3.4) 

Here D c is the mass dimension of G 0 . 

By the action principle we can express d/de 0 
and d/dnt 0 in terms of operator insertions. Thus, 


3)77 0 = - (0) ’ 

(3.5) 

= -ieJAf(0) , 
oe 0 

(3.6) 


where the superscript tilde means that the opera¬ 
tor has been Fourier transformed into momentum 
space. Then 


the number of external photon lines. Also, m is the 
unit of mass, 21 which is used by dimensional regu¬ 
larization to make explicit the dimension of e 0 , 
while keeping dimensionless the renormalized 
charge e; thus we have e 0 = ;x 2 “ n/2 eZ 3 (e, w) -1/2 . 

These identities are for operations applied to 
Green’s functions, i.e., for DVO’s. 

We can now write 

0/(0) = (2 - ln)iN r + i(l - n)W e 

+ {(2 - i n)(3 • A )7£ 0 + w 0 # 

+ (2 - in)e 0 ipA<p}~ (0) . (3.10) 

Notice that the right-hand side of (3.10) contains 
(a) the operators occurring in Eq. (3.7), (b) N e 
and iV y , which have been expressed in terms of 
renormalized operators, and (c) (n-4)(9 • A) 2 /£o* 
The only operator in an inconvenient form is the 
last one. 

However, 23 an application of the gauge Ward 
identities to each 9 • A in turn proves that (9 6 A) 2 / 

£ 0 has only a single-loop divergence, and that 

±-{z-AY= (a •/!)*] 

Vg 21 _. — _ 

_ T6P^34)(^+ 2 *™o#) • 

(3.11) 

Hence, 

[* i - D ° - I - G * 


0 = - D c — i[w 0 #+ (2 — i n)e 0 4i-A4i]'(O)^jG R , 

(3.7) 

where we have multiplied the equation by Z z ~ 1/z 
for each external fermion line of G 0 , and by Z 3 -1 /z 
for each external photon. Thus, Eq. (3.7) is an 
equation for the renormalized Green’s function 

g r- 

To rewrite (3.7) in terms of 9 U U we will need 
the counting identities. 22 These are simple con¬ 
sequences of the equations of motion, and can be 
written in terms of either bare fields or normal 
products. In QED these identities are 

N e = (i4>3>l>+im M (°) 

= (4 *»JV[#]H0), (3.8) 

JV r = [i **•„,*+ <(B • A^/^+iejA $]' (0) 

“UiNtlVl + iM (B - A)“]€* 

+ ieti*' n/z N[$A (0) . (3.9) 

Here N e and N r denote respectively the number of 
external electron lines of a Green’s function and 


+ 0(n - 4) . (3.12) 

We have not yet proved 9 U U to be finite, so we 
cannot set n- 4 here. 

Next, we recall the Callan-Symanzik equation 24 * 25 
for G r : 


0= (' c i- z>G - ^3 i + (1+> ' >, 


9 

Bm 


+ Y a t s -^-iYji'-iY a N r ) G K ■ ( 3 - 13 > 

Comparison of the last two equations shows that 
5 U W (0) is finite at n- 4, and that 




2 - y 3 N r 

= *] + (1 + yjm 0 # - ir,M(8 •*)*]/€„ 

(3.14) 
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Here the renormalized action principle has been 
used to express derivatives with respect to e etc. 
in terms of normal products. Also, we have used 
the result 9 that 

Finally, we use (a) the identities (3.8) and (3.9) 
to express N e and N Y in terms of normal products, 
(b) the result 0=ey 3 /2, and (c) the observation 
made earlier that the zero-momentum expression 
for 9 U U determines the expression at all momenta. 
We get 26 

*»“ = h,N[V]+ ( 1 +vJ w, o# 

- [y 2 + 3 -e 2 $„/(8?r 2 )] 
x . (3.15) 

Use of the fermion equations of motion gives 27 

+ > C 3 * 1 ®) 

the same operator formula as was found in Eq. 
(2.17) above. 

Nate added in proof. After this work was com¬ 
pleted, we learned that essentially identical re¬ 
sults have been obtained by N. K. Nielsen (unpub¬ 
lished). 
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APPENDIX A 

We analyze here the consequences of including 
a photon regulator to make finite the divergent 
diagrams of Figs. 1(a) and 1(b). It proves con¬ 
venient to use a regulator scheme similar to that 
used 28 in studying the axial-vector divergence 
anomaly, and specified as follows; 

(i) The smallest closed fermion loops illustrated 
in Fig. 2(a) are given their usual gauge-invariant, 
renormalized values. 

(ii) The larger fermion loops, such as illustrated 
in Fig. 2(b), are calculated to be photon gauge-in¬ 
variant and hence finite. 

(iii) All photon propagators are regularized: 
Photon propagators emerging singly from vertices, 
as in Fig. 2(c), are regularized by the replace¬ 
ment 

1 1 1 - * i \ 

p z * p 5 ~ p z -M 2 ~ p z {p z — M 2 ) ’ KAL} 


COLLINS, AND ANTHONY DUNCAN 15 




c- 

(a) 


+■ photon interchange 


+ photon permutations 


9p V + photon permutations 

(b) 


4P 


4 P 


(c) 



9 Ov + photon permutations 

(e) 


FIG. 2. (a) Smallest closed fermion loops which are 
given their gauge-invariant fully renormalized values. 

(b) Larger fermion loops which are evaluated to be 
gauge Invariant, (c) Photons emerging from single - 
photon vertices, which are regulated according to Eq. 
(Al). (d) Photon pair emerging from 6^ vt which is reg¬ 
ulated according to Eq. (A2). (e) Fermion-loop diagrams 
with photon radiative corrections. 


with M the regulator niass. Pairs of photon prop¬ 
agators emerging from the energy-momentum 
tensor 9 uin as in Fig. 2(d), are regularized by 
the replacement 


— v 

pZ v uvafi 



pT. (Pl> Pz ) p 3 


~~ p* _M 2 p*-M 2 ’ ^ 

with the regulator vertex V% va$ chosen so that 
(apart from photon gauge terms, which do not con¬ 
tribute to on-shell matrix elements) the algebraic 
structure of the gravitational Ward identities im¬ 
plied by conservation of 9 UV is preserved. Specif¬ 
ically, the Feynman rules for vertices of 9 ulf give 
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Vuvat (Pi,Pz)= “2 VuuiPl ~Pl»Pza^ + 2 (Pi + PluPz^a»~ PluPztJIvfi “Pzi/Plrfua 

■*" Pi */*2^?L>a^u0 “PlitPza^uH ~Pzu Plfl W va ) , (A3) 

which when contracted with (/>i+/> 2 ) w gives 

(.Pi+P^v v . ua ^p l ,p 2 )=gW3%e terms + kp z (.p 2v f) ae , -P 2a vJ+iPi Z (Pii,il a e-PiB 1 lvJ ■ ( A4 ) 

We wish to construct V^ afi (p lt p i ) so that 

(.Pi + Pz) Vt Vlt>atkPi>Pz) = % aa & e terms + Hp! Z -M z ){p Zv ri a6 -p Za ri oli ) + ^(P 2 z -M z )(p l0 ri aS - P is i} 0a ) , (A5) 

which gives for the divergence of Eq. (A2) 


iPl + Pz^^p^Z^ Uua&^Pl’Pz) p 2 p^—M 2 ^2t»afl(/ > lj/ > 2) pZ ^ 

= gauge terms + i(Pz^ aB ~P 2a k(PiJlc*-Pi» 0(^5 ~p^~w) ’ ^ 


which has the same structure as the divergence of 
(l/pi Z )V UVa fi(l/p z z ), apart from the replacement of 
the photon propagators by regularized propagators. 
One easily finds that the lowest-order polynomial 
in momenta satisfying Eq. (A5) is 

Vuvafi(pl>Pz) = V iu*ct&(Pl>Pz) 

(A7) 

Thus, the requirement that the regularization 
scheme respect gravitational Ward identities in¬ 
troduces an explicit M z dependence into the 0 a „- 
photon vertex. This is, of course, just the con¬ 
tribution to & uu expected from the mass term in 
the regulator field Lagrangian. 

(iv) The regularization prescription adopted 
above makes radiative correction diagrams such 
as illustrated in Fig. 2(e) finite for finite M, but 
divergent as M— °°, with the divergences canceled 
by appropriate counterterms appearing in the re¬ 
normalization constant Z 3 (M). We note, however, 
that since explicitly renormalized values for the 
single-loop diagrams of Fig. 2(a) are always used, 
Z 3 contains no counterterms referring to these 
diagrams. In effect, we have adopted a type of 
intermediate renormalization procedure, in which 
Z 3 contains counterterms only for those vacuum 
polarization graphs which involve internal virtual 
photons. 

Having specified the regularization procedure, 
we can now turn to a study of the lowest-order 
divergent e u u insertions of Fig. 1. It suffices to 
consider these insertions at zero four-momentum 
transfer, since the difference between zero and 
nonzero four-momentum transfer will converge. 
Focussing on the trace-to-two-photon vertex on 
the left-hand side of the dashed line in Fig, 3(a), 
we find in one-fermion-loop order that there are 
two classes of 6 UV couplings which contribute, as 


illustrated in Fig. 2(b) and Fig. 2(c). [We note in 
passing that an explicit check of 6 UU conservation 
for the diagrams of Figs. 2(b) and 2(c) shows that 
the structure of the Ward identities is guaranteed 
by the regularization scheme sketched above, with 
no need for any additional vertex modifications 
beyond that given by Eq. (A7).] Taking the trace 
on py of Fig. 3(b), using the trace anomaly formula 
of Eq. (2,17) to leading order, and dropping gauge 
terms gives 

3 ( 6 ) ] = - 3 ^- *? a8 pz( p z _ M zf ■ ( A8 > 

In the absence of regulators, the trace of Fig. 3(c) 
would vanish, but when regulators are included it 
is nonvanishing, on account of the term propor- 


P,a L 



-P,*' 



+ photon permutations 




FIG. 3. (a) The divergent diagrams of Fig. 1, at zero 
four-momentum transfer. We focus on the trace-to-two- 
photon vertex on the left-hand side of the dashed line. 

(b), (c) Classes of one-fermion-loop diagrams which 
contribute to the left-hand side of the dashed line in (a). 
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tional to M z in Eq. (A7), and one finds 
n u,r 3(r }i_ -4M 4 7j Qfl n l2) (/>7mSa!) 

V LSleJJ- 


(A 9) 


Setting -p z - x, and using the fact that 

n iz) (p z /m z , a) In* - c , (A10) 

with c a constant, the sum of Eqs. (A8) and (A9) 
becomes 


*r[3(b)]+7r[3( C )] 

_ 2a 4q uB M A (a/ZiTlnx + c) 

= 3tr V** x(x+M 2 ) 2 ~ (x+M 2 Y 


= 2tj o8 M 4 


dx 


a/Zirlnx+c 

U+M 2 ) 2 


(All) 


Now the leading single logarithmic divergence of 
either of the diagrams in Fig. 3(a) comes from an 
integral of the form 


/" ^{7j“‘'[3(b)]+7j“‘'[3(c)]} i |.U) , 


(A12) 


where ip(x)~c l /x+ • * • represents the right-hand 
side of the dashed line. But substituting Eq. (All) 
and the leading term of i/j(x) into Eq. (A12), we get 
a result proportional to 


*>*/ 


d / a/Zvlnx + c 
\ (x+M z ) z 


= M 4 


a/37rlnjr + c 
(x+M z ) z 


X s » 

) 

x finite 


(A13) 


which approaches a finite limit as the regulator 
mass M approaches infinity. In other words, the 
logarithmically divergent contributions to the 
trace coming from Figs. 3(b) and Figs. 3(c) pre¬ 
cisely cancel: in effect, the extraM 2 term in 
the ^-regulator photon vertex of Eq. (A7) gen¬ 
erates, in the limit as A/—°°, a subtraction counter¬ 
term for the divergent operator Z 3 ~ 1 F Xa F Ka . The 
mechanism operating here is evidently a photon 
analog of the fermion regulator behavior 3 which 
can be thought of as producing the trace anomaly 
in the first place. 


APPENDIX B 

We give here the derivation of Eqs. (2.8) and 
(2.14), and also illustrate Iwasaki’s theorem on 
the vanishing of (o|f? u w |yy) in a special case. To 
derive Eq. (2.8) we follow the method of Sato- u 
Introducing the scalar vertex part r 3 (p lf p z ), we 
have 

(e(.p)\wM\e(.p))=T s (p,p)\„ n . (Bl) 

Writing 


^(AjA) iZ z w 0 [5{r(/>)] 1 


~ s (/>)] > ( B2 ) 

with S^the unrenormalized electron propagator 
and 2 the renormalized electron proper self¬ 
energy, and substituting into Eq. (Bl), we get 




= m 0 


9w 

—— + 

&m 0 





(B3) 


Now by the chain rule we have 

(”•1^)1.. 



dnt 9E 9a 9E \ 
9 m 0 9m +m ° dw 0 da 



(B4) 


while from the fact that E is homogeneous of 
degree 1 in^ and m we get 


(B5) 

Combining Eqs. (B4) and (B5), we see that the 
renormalization conditions on E, 


E 


-15 

dp 


= 0 

p m m 


imply that 


(B6) 



(B7) 


[in Eq. (B6) we have assumed the Yennie gauge, 
in which 5^) has a true pole at jf = m\ this re¬ 
striction is immaterial, since the final result of 
Eq. (B8) is manifestly gauge invariant.] Thus, 
the second term in Eq. (B3) vanishes, giving the 
desired result 


(B8) 

To derive Eq. (2.14) we follow a similar pro¬ 
cedure. Introducing the zero-momentum-trans¬ 
fer scalar to two-photon vertex T rrs (p z /m z , a), we 
have 


(o|w 0 #|y(/>, e 2 )) 

= iaf„ s (0,a) 

x (0|Z 3 - l F Xo F x<r |y(/>, eM~P, • (B9) 

However, r rYS (p z /nt z , a) is related to the photon 
renormalized proper self-energy n (p z /m z , a) by 
the Callan-Symanzik equation 12 
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T+T(a) ( W i + £) 5 1 1 + a) ] 

= a) . (BIO) 

On settingp z = 0 in Eq. (BIO) and using the re¬ 
normalization condition 

n(o, a)=0 , Oil) 

we get 

a ' r ^°’ a) = -JTm > < B12 > 

which when substituted into Eq. (B9) gives the 
desired relation 

(O|w 0 #|r(P, e 2 )) 

. 1 0(g) 

4 1+ 6(a) 


x |y(p, c,M-P> • 013) 

It is also instructive to rearrange Eq. (BIO) into a 
slightly different form by writing 

n“•’(p, -P) = ( p*p v -p z n‘ iv )B(p' l /m t , a) , 

tf’Kp,-P ) = W -/> V-JafV^Vw*, a) , 

(B14) 


giving 

■ -f>" IT5!S)[( 2 -*' *)*«“> (°h -■)] 

x ,«-(/>, -p) - JWq (/>-/>" -prr) . 

(B15a) 

An equivalent form, suggested by Eq. (2.17), is 
[ 1 + 6(a) ] -p) + fi{a)(p»pr - phf v ) 


2 —p * 



1 


n -p). 


(B15b) 


Equation (B15) is an exact expression, at zero 
momentum transfer, for the vacuum-to-two-photon 
matrix element of the "naive” or canonical trace 
w 0 #. 29 Substituting the second-order perturba¬ 
tion formula 


n {z) (p z /m z , a) 

s= r-f 0 l dxx(l-x)ln(l- p * x( lr x) ) (B16) 

into Eqs. (BIO) or (BIO), we recover the usual 
second-order perturbation theory formula 

arJKOVm*, a) 

= ~tI 0 1 dxx{1 ~ x) ^-pk iT -x l ■ (B17) 

As a simple, explicit check on Iwasaki’s theorem 
we have calculated the second-order vacuum-to- 
two-photon matrix element of at zero momen¬ 
tum transfer. (This can either be done directly by 
diagrammatic techniques, or more simply by 
using the Ward identities 90 following from con¬ 
servation of 9 UU .) Denoting this matrix element 
by with p, -p the (virtual) photon four- 

momenta and with a and fi the photon polarization 
indices, we find that 

T'EJLP) = a) 


d . 


+ (P a Pe-P 2 n a s)PuPv 2 jp ii l2) (p 2 /w 2 , a), 

(B18) 


^nvat&~~P 

&PuPv~ WuvP aP&^~ 

+ 1 1vaP t iPfi +1 h>aPuPa + 1 1ufiPvPa * 

Taking the trace we obtain 


x^n'^Sa), 


(B19) 


which evidently vanishes for on-shell photons 
(/> 2 = 0) as asserted by Iwasaki’s general argu¬ 
ment. To exhibit the splitting of Eq. (B19) into 
"naive" and anomalous trace terms, we substitute 
Eq. (B16) and rearrange by comparison with Eq. 
(B17), giving 

r"Tl\' ae (p) = - (PePt-P^J 


*(“?£(/> 2 /™ 2 > “)+!?) (320) 

as expected from Eqs, (2.17) and (B9) in second 
order. 
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PHOTON SPLITTING IN A STRONG MAGNETIC FIELD 

S. L. Adler, J. N. Bahcall,* C. G. Callan, and M* N* Rosenbluth 
The Institute for Advanced Study, Princeton, New Jersey 08540 

(Received 6 August 1970) 


We determine the absorption coefficient and polarization selection rules for photon 
splitting in a strong magnetic field, and describe the possible application of our results 
to pulsars. 


Recent work on pulsars suggests the presence 
of trapped magnetic fields within an order of 
magnitude in either direction of the electrody- 
namic critical field 23 cr -nfife =4.41 xiO 13 G. 1 
(Here m and e are, respectively, the electronic 
mass and charge.) In such intense fields, elec¬ 
trodynamic processes which are unobservable in 


the laboratory can become important. One such 
process, for photons with energy (*) > 2m, is 
photopair production, for which both the photon 
absorption coefficient and the corresponding vac¬ 
uum dispersion have been calculated by Toll. 2 
For o) < 2m the photopair process is kinematical¬ 
ly forbidden, and the only 3 photon absorption 
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mechanism which does not require the presence 
of matter is photon splitting, i.e., 

y(#) + external magnetic field 

-jikJ+rfaz). (l) 

We present in this note the results of calculations 
of the absorption coefficient and the polarization 
selection rules for this reaction, in the case of 
a constant and spatially uniform external magnet¬ 
ic field B . 4 

To begin, let us consider photon splitting when 
dispersive effects caused by the external field 
are neglected, so that the photon four-momenta 
satisfy the vacuum dispersion relation 

( 2 ) 

Because the external field B is constant and spa¬ 
tially uniform, it cannot transfer four-momentum 
to the photons, and so the four-vectors #, k lt k 2 
must satisfy four-momentum conservation by 
themselves, 

k=u)(l,&)=k 1 +k i =u l (l,k 1 ) + u) 2 (.l,k 2 ). (3) 

It is easily seen that Eqs. (2) and (3) can be sat- 

A 

isfied only if the three propagation directions #, 
% l9 and # 2 are identical, which implies that the 
photon four-vectors are proportional, 

# x =( 0 ^/ 0 ;)#, # 2 = (o> 2 /o>)#. (4) 

We will use Eq. (4) to simplify considerably the 
matrix elements for photon splitting. To leading 
order in e, the matrix element involving 2n + 1 
interactions with the external field comes from 
the ring diagrams with 2w + 4 vertices which are 
illustrated in Fig. 1. When all permutations of 
the vertices are summed over, the matrix ele¬ 
ment is gauge-invariant, and therefore must 
couple the three photons and the external field 
only through their respective field-strength ten¬ 
sors F MI/f F mi/ \ F m „ 2 , and F Ml ,. Because Eq. (4) 
tells us that only one four-momentum is present 
in the problem, the matrix element for Fig. 1 is 


a sum of terms of the form 

FF l F 2 x F-vF x (5) 

2n + 1 factors 21 factors 

with the Lorentz indices contracted to form a 
Lorentz scalar (which is why the number of 
factors# must be even). Sinee # p F py =# ^F^ 1 

tlv k v -Q t a nonvanishing contribu¬ 
tion is obtained only if each factor k is contracted 
with a different F, which means that we must 
have We will now show further that when 

Z=w, the contribution to the matrix element still 
vanishes. Writing v il a 'F liv k v f a term with 2n 
factors k has the form 

FF 1 F z Fx v»v t ( 6 ) 

2 n factors 

again with Lorentz indices contracted to form a 
Lorentz scalar. Because of the antisymmetry of 
the field strength, the number of factors v which 
can be contracted with field strengths can only 
be 0, 2, or 4. An enumeration of these contrac¬ 
tions 5 shows that they must always contain at 
least one factor of the following five types: 
F aR l F 2aa , v a F laS F By *vy, F a& F 1& yF ^F 6a , 
v a F aS F Ry l F 2 y s F 6e v € , or v a F aS F 0 /i? y s F &e 2 v € , 
or factors obtained from these by permuting the 
photon field strengths F, F 1 , and F 2 . A simple 
direct calculation shows that for free photons 
propagating along the same direction, the five 
factors always vanish, irrespective of the orien¬ 
tations of the photon polarizations. We conclude, 
then, that the term with 2 n factors k vanishes, 
so that at most 2*z-2 factors k can be present 
(for 1) in the term in the photon-splitting 
matrix element involving 2n +1 external field 
factors F. 

Let us now apply this result to the two small¬ 
est ring diagrams: the box diagram (w=0) and 
the hexagon diagram (w = 1). We immediately 
learn that the box contribution to photon splitting 
vanishes identically, 6 * 7 so that the leading dia¬ 
gram which contributes is the hexagon. Further- 


2n + I 

interactions 
with external 
field 


n k) 



+ 

2 ‘ 


all permutations 
of vertices 


FIG. 1. Ring diagram for photon splitting involving 2n +1 interactions with the external field. 
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more, the hexagon contains at most 2x 1-2= 0 factors of k in addition to those contained in the field 
strengths, which means that the hexagon diagram is given exactly by its constant-field-strength limit, 
which can in turn be easily calculated from the Heisenberg-Euler 8 effective Lagrangian. Larger ring 
diagrams will, of course, also be present, but for purposes of rough order-of-magnitude estimates 
the leading dependence on B/B CT and co/m given by the hexagon should be sufficient. Carrying out the 
effective-Lagrangian calculation for the hexagon gives the following formulas for the photon-spliting 
absorption coefficients in the various photon polarization states: 




'u'V /Bsine 'V 
m) \ B ct ) 

' V/ 5 3infl \ 6 

?*}\ ) 


m = 0.39 

m =0.121 


M 6 /S sinfl r 

yn) \B ct / 

f w\*( B sin(? \ 6 

?*) \ *cr / 


cm" 1 , 

cm” 1 , 


- (Jl )x + W 2 ] + k [(±) - + (||),] = 2k[(|| ) - Wx + (-L)J. 



Here a=e 2 « 1/137 is the fine structure constant, 
0 is the angle between the photon propagation di¬ 
rection £ and the direction $ of the external mag¬ 
netic field, and the linear polarization eigen- 
modes are labeled || or ± according to whether 
the B vector of the eigenmode lies in, or is nor¬ 
mal to, the£-£ plane. Only formulas for pro¬ 
cesses involving an even number of J- photons 
have been given; the absorption coefficients for 
processes involving an odd number of J- photons 
vanish by a simple CP argument. To see this, 
we note that for each || photon, the matrix ele¬ 
ment will contain a CP-even factor B photon -5 
[the only other possible scalar product, B photon 
r £, vanishes by transversality], while for each 
J- photon it will contain a CP -odd factor B photon 
•5. Since the only scalar not involving the photon 
fields is £•£, which is CP even, the matrix ele¬ 
ments for the odd--L-photon processes are CP 
odd, and hence vanish. 

So far we have assumed that the photons satisfy 
the vacuum dispersion relation of Eq. (2). Actu¬ 
ally, because of the absorptive processes taking 
place in the external field, there will be disper¬ 
sive effects which modify Eq. (2). A simple CP 
argument shows that the photon eigenmodes re¬ 
main linearly polarized, with the parallel and 
perpendicular characters described above, but 
with the ratio of wave number to frequency 
changed from unity to 

*/w=w|| >± . (8) 

The indices of refraction n n , can be calculated 

II t ^ 

from the total absorption coefficients K\\ t ± by 
Kramers-Kronig (dispersion) relations, with the 
dominant contribution coming from photopair 
production [the contribution from photon splitting 
is smaller by a factor ~(af/7r) 2 (B sin0/B cr ) 4 , and 
can be neglected]. For small B/B CT the calcula¬ 


tion has been carried out numerically by Toll, 2 
who gives curves for ± as a function of fre¬ 
quency When we have w < 2m, so that the pa¬ 
rameter x = (o>/2fn)(B sinO/B CI ) is also small, 
Toll’s results can be approximated analytically 
by 



AT||U)~0.18+0.24x 2 , 

N ± (x) ~ 0.31 + 0.44x 2 . (9) 

When dispersive effects are taken into account, 
the equations of conservation of four-momentum 
become 

w = w 1 + w 2 

n((i))d)& =n(o) 1 )o) 1 ^ 1 +n(w 2 )wjt 2 , (10) 

with each n the refractive index appropriate to 
the respective photon polarization state. These 
conditions can be simultaneously satisfied only if 

0^ A=w(w 1 )w 1 +n((j0 2 )v 2 

-(o^ + u> 2 h(wi + <*> 2 )y (11) 

in which case the photon propagation directions 
are not precisely parallel, but rather diverge 
from one another by small angles "(A/w) 1 ^ 2 . As 
a result of this nonparallelism, the box diagram 
is no longer precisely zero, but a careful esti¬ 
mate shows that it is still much smaller than the 
hexagon. When A is negative, the photon-split¬ 
ting reaction is forbidden. Substituting the in¬ 
dices of refraction of Eq. (9) into Eq. (11) shows, 
indeed, that for small x the only reaction in 
Eq. (7) which is kinematically allowed is (||) — (±\ 
+ (J-) 2 , and that this reaction occurs without re¬ 
striction on the photon frequencies w 1 and o> 2 . 
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Table I. Selection rules for photon splitting. CP-forbidden reactions are suppressed 
by a factor ~(a/n) 2 (B sin 0/B ct ) a relative to CP-allowed cases. 




Reaction 

CP 

selection rule 

Smalls kinematic 
selection rule 

( 

)-( 

)i + (ID 2 

Allowed 

Forbidden 

( 

)-( 

) 1+ (±) 2 ,(±) 1+ (ii) 2 

Forbidden 

Allowed 

( 

|)-W, + W, 

Allowed 

Allowed 


>1 + (IDs 

Forbidden 

Forbidden 

W-( 

)i + W 2 » Mi + (ID 2 

Allowed 

Forbidden 

W-Wi + M, 

Forbidden 

Forbidden 


The various polarization selection rules for pho¬ 
ton splitting are summarized in Table I. Be¬ 
cause of the small nonparallelism of the photons 
in the kinematically allowed regions, the “CP- 
forbidden” reactions are not precisely forbidden, 
but are down by a factor ~(af/tr) 2 (2 sin0/23 cr ) 4 re¬ 
lative to the “CP-allowed” cases. We see that 
for small x , all reactions by which perpendicu¬ 
larly polarized photons might split are kinemati¬ 
cally forbidden, while parallel-polarized photons 
split predominantly into perpendicularly polar¬ 
ized photons . Hence photon splitting provides a 
mechanism for the production of linearly polar¬ 
ized y rays . 9 

To conclude, let us briefly discuss the possible 
application of our results to pulsars. We assume 
that the hexagon-diagram absorption coefficients 
in Eq. (7) can be used for order-of-magnitude 
estimates even when the parameters B/23 cr and 
c c/m are of Order unity. 10 Taking, for illustra¬ 
tion, £/23 cr ~a>/tw~sin0~l, we find k[(||)-(J-)i 
+ (j.) 2 ]~0,l cm” 1 . This gives 10 5 absorption 
lengths in the characteristic distance P pu isar 
~10 6 cm over which the trapped magnetic field 
has its maximum strength, indicating that photon 
splitting can be an important absorption mecha¬ 
nism for y rays emitted near the pulsar surface. 
Before we can apply the kinematic polarization 
selection rules to the pulsar problem, two ques¬ 
tions must be dealt with. First, since Toll's 
curves for the indices of refraction were obtained 
assuming B/23 cr small, an extrapolation is in¬ 
volved in extending the selection rules forbidding 
perpendicular-photon decay and parallel-photon 
decay into parallel photons to the region where 
S/23 cr is of order unity. However, Toll's photo¬ 
pair production curves show that #c ± > /cy when 
B/B ct is unity. By combining this fact with the 
Kramers-Kronig relations, one easily sees that 
the selection rules in Table I hold as long as 
00 < 2m, and hence the extrapolation is justified. 
Second, one expects a plasma to be present near 


the pulsar surface which will contribute addition¬ 
al dispersive terms to the inequality of Eq. (11). 
For a plasma-electron density of 10 17 -10“ 19 cm" 3 
(in rough accord with current pulsar models 10 ), 
a detailed estimate shows that plasma-induced 
splitting of perpendicularly polarized photons 
occurs with an absorption coefficient of at most 
10“ 9 -10“ 7 times the absorption coefficient for 
the allowed reaction (II) —-■(-L) 1 h- (J-) 2 , and there¬ 
fore will be completely negligible. 11 We conclude 
that if y rays in the range 0,5-1 MeV are emitted 
near the pulsar surface, and if the surface mag¬ 
netic field is as large as 23 cr , only those gammas 
with perpendicular polarization will escape. A 
distant observer would see linearly polarized 
gammas, with their B vector perpendicular to 
the plane containing the line of sight and the 
traversed pulsar magnetic field. 12 

A detailed account of the calculations summa¬ 
rized here will be presented elsewhere. 10 The 
authors wish to thank P. Goldreich (who first 
brought this problem to our attention), E. P. 

Lee, and M. Rassbach for informative conversa¬ 
tions. After this manuscript was completed, we 
learned that some of our results have been ob¬ 
tained independently by Z. Bialynicka-Birula 
and I. Bialynicki-Birula. 13 


^Present address: California Institute of Technology, 
Pasadena, Calif. 

1 For a recent view, see F. Pacini, “Nentron Stars, 
Pulsar Radiation and Supernova Remnants,” to be pub¬ 
lished. We use unrationalized Gaussian units, with 
H=c =1. 

2 J. Toll, dissertation, Princeton University, 1952 
(unpublished). 

3 The phase space for a photon to split into three or 
more photons vanishes. 

4 In a pulsar, the field B varies over a characteristic 
distance of f^puisar ~10 8 cm, but this can be shown to 
have a negligible effect on our results. 

5 We need not consider contractions involving the 
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antisymmetric tensor € a gy£, because by parity the 
number of such factors must be even, and they can be 
eliminated pairwise in terms of Kronecker deltas by 
means of the identity 

€ 0tey5 € a'6'y'5' = Z) R&gyy'g 66'* 

Perrr(ot'0'y'6') 

6 This result for the box diagram has been obtained 
independently by M. Rassbach. 

7 This disagrees with the conclusion of V, G. Skobov, 
Zh. Eksp. Teor. Fiz. 35, 1315 (1958) [Sov. Phys. JETP 
8, 919 (1959)], who failed to make a properly gauge- 
invariant calculation of the box. Skobov 1 s result is 
quoted in the review article of T. Erber, Rev. Mod. 
Phys. 38, 626 (1966). 

8 W. Heisenberg and H. Euler, Z. Phys. 38, 714 
(1936). 

‘‘Toll (Ref. 2) points out that in a frequency interval 
Aw ~eB/m just above the photopair threshold at w =2 m, 


the photopair process acts as a linear polarizer of the 
opposite sense, absorbing photons of perpendicular 
polarization, but not those of parallel polarization. 

10 For further discussion, see S. L. Adler, to be pub¬ 
lished. 

u In a plasma, the propagation eigenmodes become 
elliptically polarized, but are still “almost plane || ” 
and “almost plane in nature. Faraday rotation, 
which arises from interference between two unattenu¬ 
ated propagation eigenmodes of different phase veloci¬ 
ties, cannot occur in our case since only the “almost 
plane J-” eigenmode propagates without attenuation. 

As a result of photon splitting the “almost plane ||” 
eigenmode is rapidly absorbed. 

12 Solid-state linear-polarization analyzers for gammas 
in this energy range have been described by G. T. Ewan 
et ed, f Phys. Lett. 29B. 352 (1969). 

13 Z. Bialynicka-Birula and I. Bialynicki-Birula, to 
be published. 
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We determine the refractive indices for photon propagation, and the absorption 
coefficient and polarization selection rules for photon splitting, in a strong constant 
magnetic field. Results are presented both in the effective Lagrangian (low frequency) 
approximation and in a more accurate approximation which exactly sums the vacuum 
polarization ring diagrams, neglecting only internal virtual photon radiative corrections. 
Our principal conclusion is that photon splitting can provide a mechanism for the 
production of linearly polarized gamma rays. 


1. Introduction and Summary 

Recent work on pulsars has suggested the presence of trapped magnetic fields 
within an order of magnitude (in either direction) of the electrodynamic critical 
field B cr = rrpje = 4.41 • 10 13 gauss 1 (with m and e, respectively, the electronic 
mass and charge)[l]. In such intense fields, electrodynamic processes which are 
unobservable in the laboratory can become important. One such process, for 
photons with energy co > 2m, is photo-pair production, for which both the photon 
absorption coefficient (inverse absorption length) and the corresponding vacuum 
dispersion have been calculated by Toll [2]. For o* < 2m, the photo-pair process is 
kinematically forbidden, and the only photon absorption mechanism in the absence 
of matter is photon splitting, i.e., 

y(k) + external magnetic field -> y(k 1 ) + y(k 2 ). (1) 

We give in this paper detailed calculations of the absorption coefficient and 
polarization selection rules for this reaction, in the case of a constant and spatially 
uniform external magnetic field B. A summary of our results, and a brief discussion 
of their possible application to pulsars, have already been given in [3]. 

In Section 2, we consider photon splitting in the absence of dispersion, so that 
the three photons are strictly collinear. We find that the box diagram matrix 

1 We use unrationalized Gaussian units, with h = c — 1. 
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element vanishes, while the hexagon matrix element is nonvanishing and is given 
exactly by its constant-field-strength (small a>/m) limit. Using the Heisenberg- 
Euler effective Lagrangian [4], we calculate the sum of the constant field strength 
limits of all ring diagrams for photon splitting involving arbitrary numbers of 
interactions with the external field. Then, we use proper-time techniques to exactly 
calculate the photon splitting matrix element to all orders in the external field, 
without the restriction to constant photon field strengths. (The details of this latter 
calculation, which neglects only internal virtual photon radiative corrections, are 
given in Appendix 1). The magnitude of the resulting photon splitting 
absorption coefficient, for photon propagation normal to the external field, is 
k ~ 0.](B/B CR y (co/m ) 5 cm -1 . Thus, if pulsar fields are as large as B CR , for photon 
frequencies a> of order m, there are many photon splitting absorption lengths in a 
characteristic pulsar distance of 10 6 cm. A comparison of the photon splitting and 
photo-pair absorption coefficients shows that when the photo-pair process is 
kinematically allowed it dominates over photon splitting as a photon absorption 
mechanism. To complete our discussion of the no-dispersion case, we estimate the 
leading corrections arising from the fact that the magnetic field B is not strictly 
uniform, but varies over characteristic distances of order 10 6 cm. A nonuniform 
magnetic field can transfer momentum to the photons, with the result that the final 
photons emerge at small but finite angles with respect to the initial photon direction. 
This means that the argument for the vanishing of the box diagram no longer holds, 
but an estimate shows that the resulting box diagram contribution is negligibly 
small compared with the hexagon diagram absorption coefficient. 

In Section 3 we discuss dispersion effects and polarization selection rules. 
Because of photon absorption processes which take place in the presence of the 
external magnetic field B, the vacuum in the presence of the field B acquires an 
index of refraction n, and the photon dispersion relation is modified from k/oo 1 
to kjoo = n. There are actually two different indices of refraction n corresponding 
to the two photon propagation eigenmodes. A general argument based on the CP 
invariance of electrodynamics shows that the eigenmodes are linearly polarized, 
with the B-vector of the eigenmode either parallel to (|| mode) or perpendicular to 
(_L mode) the plane containing the external field and the direction of propagation. 
The indices of refraction n\\ i± can be calculated in the constant field strength 
(small co/m) limit from the Heisenberg-Euler effective lagrangian, and can be 
calculated without the constant field strength restriction, by the proper time 
methods of Appendix 1. They can also be obtained from the absorption coefficients 
*:il i± by Kramers-Kronig relations, with the dominant contribution coming 
from photo-pair production. When dispersive effects are taken into account, we 
find that energy-momentum conservation in the photon splitting process can be 
satisfied only if the inequality 0 < A = n(co 1 ) + n(co 2 ) a> 2 - n(oo x + co 2 )(co r + co 2 ), 
holds, with co = co r + co 2 and with each n the refractive index appropriate to the 
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respective photon polarization state. The photon polarization directions are no 
longer precisely parallel, but rather diverge from one another by small angles of 
order (zd/o/) 1 / 2 . Again, as a result of this nonparallelism, the box diagram is no 
longer precisely zero, but a careful estimate shows that it is only an order 
a(a = e 2 = fine structure constant) correction to the hexagon. When A < 0, the 
photon splitting reaction is forbidden. Using our expressions for the refractive 
indices, we analyze the sign of A for the various photon polarization cases. We 
find that when o* is below the pair production threshold at 2m, only the photon 
splitting reactions ([[) -* (±)i + (±) 2 and (|]) -* (NX + (±) 2 , (±)i + (li ) 2 are 
kinematically allowed. Furthermore, when the small angles between the photon 
propagation directions are neglected, a simple C/Mnvariance argument shows that 
the photon splitting reactions involving an odd number of (J_) photons are for¬ 
bidden. Hence the only allowed polarization case is (||) —► (_L)i + (J_) 2 , indicating 
that photon splitting provides a mechanism for the production of polarized 
photons: perpendicular-polarized photons do not split, and parallel-polarized 
photons split predominantly into perpendicular photons. 

Finally, in Section 4 we discuss corrections to our results arising when a plasma 
with electron density n e is present in the region containing the strong magnetic 
field. We show that for n e of order 10 17 — 10 19 cm -3 (in rough accord with current 
pulsar models) the plasma-induced splitting of perpendicular-polarized photons 
occurs with an absorption coefficient of at most 10~ 9 — 10~ 7 times the absorption 
coefficient for the allowed reaction (]|)—► (_L)i + (JJ 2 > and hence will be com¬ 
pletely negligible. 


2. No-Dispersion Case 2 


A. Kinematics 

We consider photon splitting in the presence of a time-independent, spatially 
uniform external magnetic field B. Because of the constancy of the external field it 
can absorb no four-momentum, and as a result the four-vectors k, k x , k 2 of the 
initial and final photons must satisfy energy and momentum conservation by 
themselves, 

k = a>(l 9 k) = k-L + k 2 = oj 1 (1 9 ^) + co 2 (], k 2 ). (2) 

It is easily seen that this condition can be satisfied only if the propagation direc¬ 
tions of the three photons are identical, 

k k x = k 2 , (3) 

2 We follow the metric and other notational conventions of J. D. Bjorken and S. D. Drell, 
“Relativistic Quantum Fields,” McGraw-Hill, New York, 1965, pp. 377-390. 
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external field direction. The vanishing of all matrix elements involving an odd 
number of perpendicular photons results from the CP invariance of quantum 
electrodynamics, as will be explained in detail below. 

Finally, doing the phase space integrals gives us the following expressions for 
the photon splitting absorption coefficients: 
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(24) 


Eqs. (15), (17), (23) and (2 4 ) constitute our results for photon splitting in the small 
co/m limit when dispersive effects are neglected [3, 8]. We will see below that when 
dispersive effects are taken into account, the reactions (||) —► (||)i + (||) 2 , 
(_L) —► (|IX + (J_) 2 and (J_) —>(J_) 1 + (]]) 2 are kinematically forbidden, while 
the reaction (||)—>(J_X + (J_) 2 still occurs with the absorption coefficient given 
by Eq. (23). 

D. Exact Calculation 

As we have noted, the effective Lagrangian formulas of Eq. (23) give the sum 
of ring diagrams shown in Fig. 4, in the limit of constant photon field strength 
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(small co/m). It is possible, by using the proper-time techniques developed by 
Schwinger [7], to exactly calculate this sum of ring diagrams, without the small 
cjjm restriction. (Virtual photon radiative corrections to the ring diagrams, such 
as shown in Fig. 6, are still neglected, but these are expected to be strictly an order 



Fig. 6. Typical virtual photon radiative corrections to the hexagon diagram for photon 
splitting (cf. Fig. 3). 

ol correction to our results.) Leaving calculational details to Appendix 1, we present 
here only the results for the kinematically allowed reaction (] ]) —► (A_)i + (J_) 2 . We 
find that the matrix element J?[(\\) —► (J_) 1 + (J_) 2 ] appearing in Eq. (23) is now 
given by the rather complicated expression 
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D = {exp[o > 2 2 R(s, t) + w 1 2 R(s 9 u)] — exp[<Vi?(j, t) + a> 2 2 R(s, u )]} 
X expt^oja A(s 9 t, u)] C_(j, t 9 u) coth(e.Bj), 

E = {exp[oj 2 jR(j, /)] — 1} 

X [smh(<*) + - »] - cshW 


with 


C±(s 9 t 9 u) 


R(s, t) 


1 

2 

1 _ 

2 



- lj] ± cosh [(eBs) (-y- - lj] 

cosh[eBs(2t/s — 1)] — cosh (eBs) ■ 
eB sinh(e&) 


A (s 9 t, u) = 2u (1-— 

V s 


sinh(2 eBu) sinh[(e5j)(2r/j — 1)] 

2 eB L sinh (eBs) 


+ 


[1 


cosh(2e5w)] r cosh[(e Bs)(2tfs — 1)] — cosh (eBs) 
2 eB L sinh(d§j) 


(25b) 


We make some remarks on various features of this formula: 

(1) Bose symmetry, which states that ^[(||)—► (_L)i + (J_) 2 ] is symmetric 
under the interchange co 1 co 2 , is explicitly evident. 

(2) The small a>/m limit of Eq. (25) is obtained by replacing A by the zeroth 

«s; is; is; 

order term and C, D and E by the (leading) second order terms in their respective 
expansions in powers of photon frequency. (The term B does not contribute to the 
constant field-strength limit.) Doing the u and t integrations then shows that, in 
the small cj/m limit, C 2 [co, a> l9 a> z , B ] reduces to the effective Lagrangian expression 
C 2 (B/B cr ), in just the form given in Eq. (15). 

(3) From Eq. (25b), we easily see that R(s , /) and A(s 9 /, u) are even functions 

of the external field B, both of which vanish like B 2 when B is small. Therefore, the 
small B-limit of Eq. (25) is obtained by expanding out the curly-bracketed exponen¬ 
tial terms in A,...,E, just as in the small arfm case. The leading small B contribution 
to C 2 [ a*, , oj 2 , B] is of order 5 3 , as expected for the hexagon diagram, and is 

found to be frequency-independent, confirming our above-stated result that the 
hexagon diagram is given exactly by its constant field strength limit. When higher 
order terms in the expansions of the curly-bracketed exponentials are kept, we see 
that the general term in C 2 with 2 n powers of photon frequency contains at least 
2/7 + 3 powers of B, in agreement with the theorem of Subsection 2 B. 
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(4) Because Eq. (25) contains infinite multiple integrals, it is necessary to 
look carefully at the question of convergence. By making the rescalings 


t - 5 (1 + v), 
u = ^ (1 + z), 




the t and u integrals are transformed into integrals over finite intervals, with only 
the s integral still extending to infinity. To examine the large-.? behavior of the 
rescaled integrand, we use the fact that when s —► oo with t/s and u/s fixed, the 
quantities R(s , /) and A(s, t , u) are simply approximated by 


R(s , /) p& t(l — t/s ) + finite, 
A(s, t, u) pv 2u{\ — t/s ) + finite. 



On substituting these expressions into Eq. (25), and making similar large-j 
approximations in the factors multiplying the curly brackets, we obtain the follow¬ 
ing result for the region of convergence: When co is below the pair production 
threshold at co = 2m , Eq. (25) converges at least as fast as 


J ds e- 8 (m 2 -™ 2 /4> x (power of s) (28) 

for all values of the secondary photon frequencies u> x and co 2 . When co is above the 
pair production threshold, there are values of and co 2 for which the integral 
diverges. Thus, Eq. (25) gives a valid expression for the photon splitting matrix 
element only in the photon energy region where this matrix element is real, but 
fails when the photon splitting matrix element becomes complex, as a result of 
absorptive contributions such as the one pictured in Fig. 7. This failure is no real 
problem, since we will see below that when co > 2m , the photopair production 
process is a far more important photon absorption mechanism than is photon 



Fig. 7. Lowest-lying absorptive contribution to photon splitting, corresponding to an electron- 
positron intermediate state. The doubled lines indicate the presence of interactions to all orders 
with the external field F. (Such interactions were individually denoted by an x in Figs. 2, 3, 4, 
and 6.) 
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splitting, and so an exact formula for the photon splitting matrix element in this 
region is not really of interest. 

(5) As we noted above, in the limit of small B the matrix element 
C 2 [ a*, u > x , oj 2 , B ] vanishes as B 8 . However, a glance at Eq. (25) shows that there are 
individual terms which vanish with lower powers of B; the B 3 behavior is a result 
of cancellations. It is easy to see, though, that these cancellations are entirely 
contained in the integrands A ,..., E , and do not involve the j, t , u integrations. As a 
result, reliable numerical results for the photon splitting matrix element and absorp¬ 
tion coefficient can be obtained with rather coarse integration meshes. Typical 
numerical results are shown in Fig. 8, which gives the ratio of the exact photon 



Fig. 8. Ratio of the exact photon splitting absorption coefficient *[(ID ( _L)i + (_L) 2 ] to the 

hexagon diagram value for the same quantity. 

splitting absorption coefficient calculated from Eq. (25) to the absorption coefficient 
obtained from the hexagon diagram alone (Eq. (23) with C 2 (B/5 ci? ) replaced by 
C 2 (0) = 6 • 13/945). The upper curve is the result for oj/m = 1, while the lower 
curve, giving the low frequency limit, is identical to a plot of C 2 (B/B CR ) 2 (cf. Fig. 5 
which gives a plot of C 2 (B/B cr )). We see that in the range 0 < B/B CR < 1, 
0 < tojm < 1 the leading power dependence given by the hexagon diagram alone 
suffices for rough order of magnitude estimates. Furthermore, the effective 
Lagrangian calculation of C 2 (B/B CR ) gives the bulk of the corrections coming 
from higher ring diagrams, with the co-dependent effects contained in the compli¬ 
cated formulas of Eq. (25) (i.e., the spread between the two curves in Fig. 8) being 
fairly small. 
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K [ \z) = — 8z 2 /45 + 0{z% 
KHz) = - 14z 2 /45 + 0(z% 


621 



into Eq. (49) gives back the box diagram result of Eq. (46b) in the limit of small 
B/B cr . The calculation can be further improved by using the proper-time tech¬ 
niques discussed in Appendix 1 to exactly sum the ring diagrams involving arbitrary 
numbers of interactions with the external field B, without the restriction to small 
tojm. This gives the following formulas for the refractive indices, 
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and with R(s, t ) given by Eq. (25b). When to is set equal to zero, the integral over t 
(or equivalently, over v) is readily done, giving J 0 dv v) = K^^eBs), and 

so Eq. (51) reduces directly to Eq. (49). To examine the region of convergence of 
Eq. (51), we substitute Eq. (27) for R(s , /) and replace J I l*- L (j, v) by their dominant 
large-.? behavior, giving 
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Maximizing W 1 - and WW with respect to we find that the integral for n 1 - con- 


342 


Adventures in Theoretical Physics 


622 


ADLER 


verges for co < 2m, while that for converges in the somewhat larger region 
co < m[l + (1 + 2B/B CR ) 1 ^ 2 ]. As Toll [2] has shown, these are just the thresholds 
for photopair production by perpendicular-polarized and by parallel-polarized 
photons, respectively. Thus, Eq. (51) is valid when the refractive indices are real, 
but fails when the refractive indices become complex, as a result of the presence of 
the absorptive processes pictured in Fig. 12. 



Fig. 12. Lowest-lying absorptive contributions to the refractive indices for parallel ([[)- and 
perpendicular (_L)-poIarized photons. The doubled lines indicate the presence of interactions to all 
orders in the external field F, 


An alternative method for calculating the indices of refraction uses the fact that 
for each eigenmode the refractive index n is related to the corresponding absorption 
coefficient k by the Kramers-Kronig (dispersion) relation 
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The dominant contribution to Eq. (54) comes from photo-pair production, which 
gives 



a /B sin 9\ 2 __ a 2 B 2 sin 2 9 
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as found in Eq. (46b); in comparison to this, the photon-splitting contribution, 
which can be estimated to be of order 
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can be neglected. The contribution of other absorptive processes to the index of 
refraction will be even smaller. Thus, substituting into Eq. (54) the actual thresholds 
for photopair production by parallel and perpendicular polarized photons, we get 
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We will not actually evaluate these formulas numerically, but will make use below 
of the computational results of Toll [2], which show that for B/B cr <, 1 the perpen- 
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Region 4. 


In this region | n ror 

forbidden decays 
— ' ■—■ 

allowed decay 


— 1 | can become as big as unity, so we have 
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Region 5. According to Eq. (74),/7 T0T —1 can become arbitrarily large in 
region 5 because w 1 can come arbitrarily close to Q c e . But thermal effects, which 
have so far been neglected, prevent the effective value of ] — Q c e ] in Eq. (74) 

from becoming greater than the thermal spread AQ c e in the electron cyclotron 
frequency, which is 
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Even for T as low as 10 _1 °K we have AQ c e > 2 • 10~ 12 m, which limits ] n ror — 1 
to 10~ 2 at most and makes Eq. (86) smaller than one. Hence we get 


forbidden decay 
allowed decay 


a >\Q c e f 10~ 7 
oj 5 /30 


3 • 10- 8 < 10“ 7 . 


(87d) 


We conclude, then, that the reactions of Eq. (83) have absorption coefficients 
which are at most 10" 7 of the absorption coefficient of the allowed reaction 
(| |) —^ (_L)i + (_L) 2 . A more detailed analysis shows that this upper band of 
10~ 7 applies particularly to the decay (||) —► (HX + (||) 2 ; in the case of all of the 
decays of an initially perpendicular photon, the upper bound can be reduced at 
least another two orders of magnitude, to 10~ 9 . Although all of our estimates 
have assumed B/B CR ^ 0.1, as B is further increased the electrodynamic terms in 
Eq. (81) rapidly increase in size relative to the plasma terms, causing our upper 
bound to get even smaller. On the other hand, if the plasma density n e is increased 
by a moderate factor the upper bound only increases proportionally. For example 
a factor of 100 increase in density to n e ~ 10 19 cm -3 leads to an upper bound on 
perpendicular photon splitting of 10~ 7 relative to the allowed case. We conclude , 
that plasma induced violations of our selection rule against perpendicular photon 
decay should be negligibly small . 


APPENDIX I: Proper-Time Calculation of the Refractive Indices and 

Photon Splitting Matrix Element 

We outline here the calculations leading to Eq. (51) for the refractive indices 
and Eq. (25) for the photon splitting matrix element. As noted in the text, we work 
to all orders in the strong external field B, without restrictions on wjm (other than 
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those needed to assure convergence of the final formulas), but we neglect virtual 
photon radiative corrections. This means that we are dealing with the problem of 
vacuum polarization effects produced by a c-number (unquantized) electro¬ 
magnetic field, which is a superposition of the strong, constant field B and of the 
plane wave fields of the photons. Diagrammatically, we are discussing the problem 
of a single virtual electron loop, with arbitrary numbers of interactions with the 
external fields and either with two photon vertices (refractive index calculation, 
Figs. 9 and 11) or with three photon vertices (photon splitting calculation, Fig. 4). 

Very powerful techniques for dealing with the vacuum polarization of c-number 
fields were developed some time ago by Schwinger [7], and we will follow his 
methods quite closely. We begin by finding an expression for the electromagnetic 
current density OwW> induced by vacuum polarization effects at point x when 
an external c-number electromagnetic field A u is applied to the vacuum. Denoting 
the electron field by ifs, and the electron-positron vacuum expectation by < > 0 , we 
have 

j u {x) = \e[x }{x), y u 4>(x)], 

<Ju(x)> = ie<Wx), y u ip ( x )]> 0 . 


(Al.l) 


In order to evaluate Eq. (Al.l), we introduce the electron Green’s function <7(x, x'). 


G(x, x) = i <T(>Kx) </<0'))>o 


(A 1.2) 


which satisfies the differential equation 


\ m - Y u (i ~ eA u (x))] G(x, x) = § 4 (x - x). 


(A 1.3) 


It is easy to see that <j w (x)> is just the limit of the Green’s function in which x 
symmetrically approaches x', 

ie Tr[y M G(x, x)] = \ie Tr [y u G(x, x')\ Xo=x<) ’^ + y u G(x, x') 

= ¥e Tr[yJ<>Kx) >J>(x)y 0 - yJ<Mx) ac index transposej 

= ¥<X4>{x), y u Hx)l >0 = <Ju(x )'/■ (A 1.4) 

Thus, to calculate the induced current it suffices to calculate the Green’s function 
G(x, x'). 

As Schwinger shows, this calculation is facilitated if we introduce a condensed 
notation in which G(x, x') is regarded as the (x | **• | x') matrix element of an 
operator G, 

G(x, x') = (x | G | x'), 

§(x — xO = (x | xO* 


(A1.5) 
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Introducing the additional operator tt u , 


It* = * - eA u(x), 


(A1.6) 


the differential equation (A1.3) for the Green’s function can be rewritten as the 
algebraic operator equation 


(m — y • 7r) G = 1. 

Inverting Eq. (A1.7) and substituting into Eq. (A 1.4), we get 
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where we have used the fact that the trace of an odd number of y matrices is zero. 
The next step is to exponentiate the operator [ m 2 — (y • tt ) 2 ] -1 using the identity 
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which on substitution into Eq. (A1.8) gives 


(jjx)/ = — \e f ds e im?s Tr [y u y v (x \ tt v U(s)\ x) + y v y u (x \ U(s) n v \ x)]. 

J 0 


(A1.10) 


Let us now exploit the fact that U(s) is just the “proper-time evolution” operator 
U(s) = exp [—iJfs] for the quantum mechanics problem with Hamiltonian 


34? — —(y * 77 ) 2 = —7r 2 — |eo- * F, o • F = a uv F uv , 
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and with “proper time” s. Introducing the definitions 
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(A1.12) 
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we can rewrite Eq. (A1.10) as 


<j u 0)> = - ie f ds e- im ' ls Tr[y u y v (x(s) \ tt^s) x(0)) 

J 0 


+ YvY u (x{s) \ TT v (0) j x(0))], 


(A1.13) 


or equivalently, as 


r 

= . l e ds e~ imh Tr[(x(^) | tt u (s) + 77 u (0)| x(0)) 

J 0 

- io u \x( S ) \ 7T v (s) - 77,(0)! X(0))], (A1.14) 

which is our final expression for the induced current. Using the commutation 
relations 


[x u , 77 ,] i§uv 5 

K. , 77 ,] = ieF uv (x), 


(A1.15) 


we deduce from Eq. (A 1.12) the following equations of motion satisfied by x u {s) 
and tt u {s). 


dx u {s) 

ds 

dn u {s) 

ds 


= U-\s)[iJF, x u ] U(s) = 2tt u (s), 


= U~\s)[iJf, 77J U(s) 


(A 1.16) 


- -eW{s)F uv {x{s)) + F uv {x{s)) tt\s) + iFJs) F^(x(s))l 


with 


F^x) = dF a %x)/8x\ 
Z-ds) = U-\s) o aB U(s). 


(A1.17) 


So far we have made no assumptions about the nature of the external field 
F uv {x). Now, let us specialize to the case in which F uv is the superposition of a 
strong constant magnetic field F and a plane wave of amplitude a and field strength 
/, 

F uv {x) =F UV +f u ££), £=n-x, (A1.18) 


with n = (1, n) a null vector defining the direction of propagation of the wave. 
Taking our constant magnetic field to point along the 3 axis, we have 

F 21 = —F 12 = B; all other components — 0 . (A 1.19) 
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To calculate the refractive indices, we will take 

= € U e M 9 

f»v(£) = i<*>(n v c» — n u € v ) e tco \ 


(A 1.20) 


and compute the term in OwW) linear in €, while to calculate the photon splitting 
matrix element, we will take 


aM) = 

f»XQ = ^i<A e iu — e‘" lf + ia> 2 {n v € 2u — n u e 2v ) e ta)2e . 


(A1.21) 


and compute the term in </ w O)> bilinear in and e 2 . 

The procedure now is as follows. First, we substitute Eqs. (A1.18) and (A1.19) 
into the equations of motion, Eq. (A1.16). We then perform two integrations 
which give us expressions for tt u (s) ± tt w (0 ) as linear combinations of a quantity 
0 U , which is entirely of first order in the plane wave field f uv , and of x u (s) — x w (0): 

®u(s) = -eW{s)f uv (m + fji(s)) n(s) + iZJs)r B M{s))l 
TT u (s) - = C ( ~^[xXs) - xXO)] + f dt 0 u (t), 

J 


7T u (s) + 7T U (0) = cl' )v [xXs) - xXO)] + f s dt T(s, t); &Xt), 

J 0 


C (T) = c-no. matrices, 


(A 1.22) 


T(s, t)j = 


C(s, t ) = 


~v 0 
0 C(s, t) 
0 S(s, t) 
0 0 

2 t/s — 1, 

sin(eB^t') 

sin(eB^) 


0 

-S(s, t ) 
C(s, t ) 
0 


T o T 1 
1 o 1 0 

7\° 

T a ° ••• 


r , 8 


S(s, t ) = 


cos(eBsv) — cos(eBs) 
sin(eBs) 


Substituting Eq. (A1.22) into Eq. (A1.14), we find that the terms proportional to 
x v (s) — x v (0) vanish, since 00) | x v 0) — x,(0)| x(0)) = x v — = 0, giving 


aw> = - ie f ds e~ im2s f * Tr{[T(j, t)J - ia u v ](x(s) I *(0))}. (A1.23) 

J o J o 


Our next step is to systematically develop Eq. (A 1.23) in a power series in the 
plane wave amplitude. This is done by going over to an interaction picture in 
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which the zeroth order approximation describes the constant magnetic field alone, 
with no plane waves present. Thus, we write 

U w \s) = e ~ iX>m \ 

^(0) = _ ;r (0)2 _ l e(y .p, 

[4 0) , n?] = -ig uv , (A 1.24) 

77 (0 >(0) = 77 (0) , x (0) (0) = x (0) , 

77 (°)(s) = U w ~ l {s) 77 <°)( 0 ) U w \s), x m (s) = x ( 0 , (0) U m (s). 

As Schwinger has shown, the proper time evolution problem defined by Eq. (A1.24) 
can be simply and exactly integrated, giving 


= 

A) = 
ms = 


R(s); n? ( 0 ), 
xi o) (0) + I(s)J IT 1 '? (0), 


1 


0 


0 


0 


0 cos(2e5,y) —sin(2e5,y) 0 
0 sin (2 eBs) cos{2eBs) 0 




0 

~2s 
0 

0 

L0 


0 

0 _ 

sin(2e5 l s') 

eB 

1 — cos(2eBs ) 

7b 

0 


0 1 
0 

cos(2eBs) — 1 
eB 

sin(2 eBs) 
eB 
0 


(x {0 \s) | x ( 0 ) (0)) = -/(4tt) 


esB 


sin (esB) 


_ g—2g%ie$o‘F 


0 

0 


„ (A1.25) 


0 
2^J 


We now develop the problem in the presence of the plane wave field in a pertur¬ 
bation expansion around the zeroth order solution of Eq. (A1.25). At zero proper 
time, the exact and zeroth order coordinate are the same, while the exact and 
zeroth order canonical momenta differ just by the plane wave amplitude, 

XM = x?{Q), 77 U ( 0 ) = 77 ^( 0 ) - eaMm = ^ o) (0) - ea u (^ (o) (0)). (A 1.26) 

To find the relation between the exact and zeroth order time evolution operators, 
we use Eq. (A 1.26) to write 

# = jw) + e [a u (£M) tt + 7r w »a u (^) 

- eaM {Q) ) <r(£ i0) ) ~ (A 1.27) 
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Defining A = — iJf [Q) s 9 B = —i(Jf — 3f {Q) ) s, we apply the identity 

6 a\b _ 6 a j eX p j^J df e~ At Be At J, (A1.28) 

where T is the time ordering operation. Making a change of variable u = st, this 
gives 


U(s) = U {0) (s) Uj(s) 9 

Uj(s) = T exp \ — ie f du [tf w (£ ( 0 ) (w)) 7r [0)u (u) + 7r {0)ti (u) a M (^ ( 0 ) (w)) 

( J o 

- ea u (Z ( %)) a u (i (n \uj) - *2$(«) P B (f a) (u))} j, (A 1.29) 

where £$(w) is defined by 

2$(u) = U (a) -\u) o ae U (0 \u) = (A 1.30) 

As expected, the time evolution operator in the interaction picture is constructed 
from dynamical variables £ { 0 ) (w), 7t [0) (u) and ^^(m) which have the proper-time 
dependence of the unperturbed problem. 

We now use Eq. (A1.29) to rewrite our expression for <y w (x)> in final form. 
Referring back to Eq. (A1.23), we write 

(x(s) | 0 v (t)\ x(0)) = (x(0) \ U(s) U~\t) 0 V (O) U(t)\ *(0)) 

= (x(0) I U (0 \s) U,(s) Uj\t) U (0) -\t ) u ( %) U,(t)\ x(0)). 

(A 1.31) 

Making use of the explicit expression for 0 V in Eq. (A 1.22), recalling Eq. (A1.26) 
and substituting Eq. (A1.31) back into Eq. (A1.23), we get 

r s 

ds f dt Tr{[T(s, t)J - ioj) 

0 J 0 

x (x ( %) \ U,(s) U7\t)[^\t) +f u M ( °\ 0) w (o), (0 

- 2 ea\^\t))U,{t)\ * (o) (0))}. (A1.32) 

By expanding the operators Uj in this equation to the requisite order in the plane 
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wave amplitude, we can obtain expressions for both the refractive indices and the 
photon splitting matrix element, as follows: 

(i) Refractive indices. We take the plane wave amplitude as in Eq. (A1.20) 
and keep only first order terms in Eq. (A1.32), giving 

<Ux)> = |e 2 f ds e~^ I" dt Tr {[T(s, t); - io u l(x ( %) | 

j 0 J 0 

+ fM%)) * <0) '(0 + «(£%)){ x (o) (0))}. (A1.33) 

To evaluate the matrix element in Eq. (A 1.33), we use Eqs. (A1.25) to express 
7r (0) (/) and exp[zo>£ ( 0 ) (r)] in terms of x { 0 ) (.y) and x { 0 ) (0), 

?r« 0 >(/) = R(t) • I-^s) • [x {0) (s) - jc'°>(Q)], 
exp[zo>£ { 0 ) (r)] = exp[zWz * x { 0 ) (0] (A1.34) 

= Qxp{ico[n * (1 — I(t) * • x ( 0 ) ( 0 ) 

+ n • I(t) • [(s)- 1 * x ( 0 ) (.y)]}. 


Since the commutator 


[4°V), *, (0) (o)] = u(s) uv 

is a c-number, we can then use the identities 

e a b = be a + [a, b] e a 


(A 1.35) 


(A1.36) 


(valid when [ a , [ a , Z?]] = [b, [a, Z?]] = 0) to bring all factors x { 0 ) (.y) to the left and 
all factors x ( 0 ) (0) to the right in Eq. (A1.33), where they act on the left- and right- 
hand states to give c-numbers, 


(x { 0 ) (.y) I x i0) (s) = (x { 0 ) (.s) | x, 
x { 0 ) ( 0 ) | x ( 0 ) ( 0 )) = x | x ( 0 ) ( 0 )). 


(A1.37) 


This leaves a completely c-number expression multiplied by the transformation 
function (x (0) (j)I x { 0 ) (0)), which is given by Eq. (A1.25). Note that the matrix 
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element (x (0) 0)i X$(/)| x (0) (0)) is equal to (x (0) 0)| x (0) (0)) £$(/) but not to 
Xi$(0(* (0) (' s ')l x (0) (0)), since the state | x {0) (0)) has no y-matrix dependence, while 
the state (x (0) (j)I has the y-matrix dependence of U {0) (s). Evaluating the y-matrix 
traces, contracting tensor indices and making the change of integration variable 
(contour rotation) s —► — is, t-+ —it gives the final result. A particularly simple 
answer is obtained for the cases in which the plane wave is linearly polarized in 
the ([[) or (J_) senses defined in the text. Taking, for simplicity, sin 8 — 1, we get 


[I case: 0* u > = —a > 2 A ll [a>, B ] a u , 

_L case: <_//> = — t o 2 A ± [a> > B] a u , 


(A 1.38) 


with A^' 1 - expressions given in Eq. (51) of the text. 

To relate the refractive indices to AW'\ we note that in the self-consistent field 
approximation, the propagation eigenmodes satisfy the equation 


12JI.-L 


□ a < ,x = = -tu 2 (l - wfi.x) a'J 

= <_/i. U > ~ -^ Iu k B] a'J-\ 


,± 


(A 1.39) 


This gives n* u± ^ 1 — A B], or taking the square root, 

«ll,x ^ 1 — iA Ut± [a) 9 B], 


(A 1,40) 


as stated in the text. Note that in deriving Eq. (A 1.40), we have in two places 
assumed that W|| iX are not much different from unity. The first place is in Eq. 
(A1.39), where we have used the coefficients AW^[co, B] computed for plane waves 
satisfying the usual vacuum dispersion relation, rather than satisfying k/oj = rt\\ tl _. 
(Recall that in Eq. (A 1.18) we took £=«■*, with n a null-vector.) The second 
place is, of course, in taking the square root to get Eq. (AMO). Referring back to 
Eq. (55) of the text, we see that n\ j i± will be close to unity provided that 


a / B 

7T \ B cr 


<h 


(A 1.41) 


a condition which is still well satisfied even when E/B CR is of order unity. Our 
final formulas for the refractive indices are nearly identical with those obtained 
previously by Minguzzi [2], whose procedure we have followed rather closely. The 
only difference is that for the first term in JW^(s,v)(sqq Eq. (51)) Minguzzi has 
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=F 1 instead of ^feBs cosh(eBsv)/sinh(eBs ), an error which results 5 from his 
incorrectly replacing in Eq. (A1.33) by its average s~ r Jo dt X$(0 - 6 

(ii) Photon splitting matrix element. To calculate the photon splitting matrix 
element, we take the plane wave amplitude as in Eq. (A1.21) and compute the 
second order terms in Eq. (A1.32), using the expression in Eq. (A1.29) for t/j. 
This gives 

a«> = f * f dt Tr{[T(s, t)J - io u v ] 

Jo Jo 

X (x ( %) [ -2 

- i f S du [a o (| (0) (u)) t r (o) » + 77 (a) °(u) a a (f'\u)) - \E^{u)f\^\u))] 

J t 

x + fui (o) (t)) 7r (a) \t) + m 

- fjf'Xt)) + u^\t)) 

x f* du [a o (| ( 0 ) (u)) rr^iu) + a 0 (t'\u)) 

J 0 

- /*(£%))] I x (o) (0)). (A 1.42) 


5 The error first appears in Minguzzi’s analysis when, in his version of Eq. (Al. 16), he writes 
\o a $ F^{s) instead of J E a ${s) F^{s). This makes the final term of the matrix element in Eq. 
(Al. 33) read (x i 0 ) (j) I |a a ^/^(^ { 0 ) (V)) | x i 0 ) (0)). Then, when evaluating (x ( 0 ) (j) | o al 3 1 x ( 0 ) (0)), instead 
of simply equating this to (x l0i (s) \ x l0) (Q))o a p , Munguzzi notes that (x t0) (j) I x i0) (0)) oc expCjfeftr • F) 
x (y-matrix independent factors) and then regards (x l0) (s) \ o a $ \ x i 0 ) ( 0 )) as the variational derivat¬ 
ive of ( x i0 \s ) | x < 0 ) ( 0 )) with respect to a small change in the constant field F. Thus, he writes. 


(* (0, Cr) I o a& | *<°>(0)) “=” —3. (*“”(*) I *<°>(0)) 


h€ a& 
B 




^iesF-* 

exp (iiesa • F + e • <7) x (y-matrix independent factors) 


= (x m (s) i x l0l (0)) jL- rexp [J dt e-l i “ ta - F e ■ <7 J 

= (x'«Hs) | x'»>(0)) j -1 f dt Z™(0, 

J 0 


where use has been made of Eqs. (Al. 28) and (Al. 30) and where, in the final line, the change of 
variable st -> t has been made. So we see that Munguzzi’s two errors result in his replacing 
(x ( 0 , (j) | jc 10 ) (0)) £<$(;) by the r-average of this quantity. As a result of this error, in Munguzzi’s 
version of Eq. (51), the function Wl< governing the convergence of the representation is — 
co 2 /(l — tls)-m 2 s , rather than the expression given in Eq. (53). This leads Munguzzi to the in¬ 
correct conclusion that absorptive contributions to rt\\ begin at o> = 2m, rather than at the larger 
value a) = m[ 1 + (1 + 2BjB C id 1/2 ] obtained from Eq. (53). As we have noted, the larger value 
is the one which agrees with the parallel-photon photopair production threshold found by Toll. 

e As we have implied in the text, the refractive index calculation is simplest when sin 6 = 1. To 
obtain the answer for general sin 0, we note that the only Lorentz scalars on which the refractive 
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Letting 0*£ Ilinear > denote the part of Eq. (A1.42) which is bilinear in € x and e 2 , the 
matrix element for photon splitting is 

Ji = -i(47t)»/® € M<jMUnear >> (A1.43) 

with € the initial photon polarization. The evaluation of Eq. (A1.42) can be carried 
out by the same methods used to obtain the refractive indices, and leads to the 
result quoted in Eq. (25) of the text for the physically interesting case (II)—►(_L) 1 -f. 

U ) 2 * 6 


APPENDIX II: Small Opening Angle Corrections to the Box Diagram 


We estimate here the nonvanishing box diagram contribution to photon splitting 
which arises when spatial variation of the external magnetic field causes the three 
photon momenta to be nonparallel. With trivial modifications, as explained 
below, the calculation also applies to the case in which the external magnetic field 
is strictly constant and the nonparallelism results from vacuum dispersion effects. 
Our aim is to show that, in both of these cases, all terms in the box diagram matrix 
element are at least quadratic in the small angles fa , fa , fa 2 between the photon 
wave vectors. 

We proceed by considering the most general momentum-depen dent term 
appearing in the part of the photon splitting matrix element which has one external 
field factor F, whenF carries nonvanishing four-momentum p. This is 


FF^F^F k • • * k k r * • • k x k 2 * * * k 2 p • • • p , 
i factors m factors n factors r factors 


(A2.1) 


with f + m + n + r even and with all Lorentz indices contracted to form a 
Lorentz scalar. We consider various cases in turn: 

(i) r > 0. Since p = (0, p) and, according to Eq. (36) in the text, | p | ~ fa, 
Eq. (A2.I) is of order fa at least. 

(ii) r = 0. We distinguish three principal subcases. 

(a) Two photon four-momenta are contracted with F. Since the photon 
four-momenta are proportional, apart from terms of order fa and since F is an 


indices can depend are — 2B 2 and k^P^Fpk^ = <d 2 B z sin 2 0, indicating that the recipe is 

simply to replace a> by a> sin 6. A similar argument in the photon splitting case indicates that the 
matrix element for general 6 is obtained from that for 6 = ttJ2 by making the replacements 
o>i, a> 2 to sin 0, o>! sin 0, o> 2 sin 0. 
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Photon splitting in a strong magnetic field is an inter¬ 
esting process, both from a theoretical viewpoint because 
of the relatively sophisticated methods needed to do the 
calculation, and because of its potential astrophysical ap¬ 
plications. The first calculation to exactly include the cor¬ 
rections arising from nonzero photon frequency co was 
given by Adler [1], who obtained the amplitude as a triple 
integral that is strongly convergent below the pair produc¬ 
tion threshold at co = 2m , and who included a numerical 
evaluation for the special case co = m. Subsequently, the 
calculation was repeated by Stoneham [2] using a differ¬ 
ent method, leading to a different expression as a triple 
integral, that has never been compared to the formula of 
Ref. [1] either analytically or numerically. Recently, a 
new calculation has been published by Mentzel, Berg, and 
Wunner [3] in the form of a triple infinite sum, and numeri¬ 
cal evaluation of their formula by Wunner, Sang, and Berg 
[4] claims photon splitting rates roughly 4 orders of magni¬ 


tude larger than those found in Ref. [1], Since this result, if 
correct, would have important astrophysical implications, 
a recalculation by an independent method seems in order. 
We report the results of such a recalculation here, together 
with a numerical comparison of the resulting amplitude 
with those of Adler and of Stoneham, as well as with a re¬ 
cent recalculation independently carried out by Baier, Mil¬ 
stein, and Shaisultanov [5]. The comparison shows that 
these four independent calculations give precisely the same 
amplitude, showing no evidence of the dramatic energy de¬ 
pendent effects claimed in Refs. [3] and [4]. 

Our recalculation of the photon splitting amplitude uses 
a variant of the world line path integral approach to the 
Bern-Kosower formalism [6-9]. As is well known, the 
one loop QED effective action induced for the photon field 
by a spinor loop can be represented by the following double 
path integral: 


r [a] = 



ye'" 2 * f Dx DiAexp - £* dr(j i 2 


1 • 

+ — \fj\fj + ieA 



ie^F^ v 



Here s is the usual Schwinger proper-time parameter, 
the x^(tY s are the periodic functions from the circle 
with circumference s into spacetime, and the i/^(t)’s are 
antiperiodic and Grassmann valued. 

Photon scattering amplitudes are obtained by specializ¬ 
ing the background to a sum of plane waves with defi¬ 
nite polarizations. Both path integrals are then evaluated 


by one-dimensional perturbation theory; i.e., one obtains 
an integral representation for the N-photon amplitude by 
Wick-contracting N “photon vertex operators” 



2i*j/ fl ijf v ki l £ l ,]exp[ikx(T )]. ( 2 ) 


The appropriate one-dimensional propagators are 


{y lx {ri)y v {T 2 )) = -g^Gsir i,r 2 ) = -g 




2-i 



(<A m (ti)^"(t2)) = jg^G F (T i,t 2 ) = jg M "sign(Ti - r 2 ). 



The bosonic Wick contraction is actually carried out in the relative coordinatey(r) = x(t) — xo of the closed loop, while 
the (ordinary) integration over the average position xo = ~ f s 0 dTx(r) yields energy-momentum conservation. 


Reprinted with permission. 
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To take the additional constant magnetic background 
field B into account, one chooses Fock-Schwinger gauge, 
where its contribution to the world line Lagrangian 
becomes 


AZ = ^iey fJ 'F fJbV y v - iexjf^F ^ v \fj v . (4) 


Being bilinear, those terms can be simply absorbed into 
the kinetic part of the Lagrangian [9,10]. This leads to 
generalized world line propagators defined by 

“ 2z'eF t 2 ) = <5(n - r 2 ) - —, 

2 \dr z drJ s 



\ 

2 


dr 


- 2ieFjQ F (T U T 2 ) = S(r\ - r 2 ). 



The solutions to these equations can be written in the form 

[ 11 ] 


Qb(t i, t 2 ) — 


eF 


2 (eF ) 2 Vsin(ejF) 


-iesFG B |2 


1 


+ ieFGsn “ — L 

s 


u t 2 ) — Gf 12 


g ~/ejfG B |2 

cos(ejF) 


(7) 

( 8 ) 


(we have abbreviated Gsij — G b (t (? t;), and a dot always 
denotes a derivative with respect to the first variable). 


Those expressions should be understood as power series 
in the field strength matrix. To obtain the photon splitting 
amplitude, we will use them for the Wick contraction 
of three vertex operators Vo and Vi 2 , representing the 
incoming and the two outgoing photons. 

The calculation is greatly simplified by the special kine¬ 
matics of this process. Energy-momentum conservation, 
&o + k\ + &2 = 0 , forces collinearity of all three four- 
momenta, so that, writing ~ko = k = con, 

k\ = —k, k 2 = —k\ k 2 = k\ = k\ 

CO CO 

= k • k\ = k • &2 = k\ • &2 = 0 . ( 9 ) 

Moreover, a simple CP invariance argument together 
with an analysis of dispersive effects [ 1 ] shows that 
there is only one allowed polarization case. This is the 
one where the incoming photon is polarized parallel to 
the plane containing the external field and the direction 
of propagation, and both outgoing ones are polarized 
perpendicular to this plane. This choice of polarizations 
leads to the further vanishing relations 

£ 1,2 * £0 = £ 1,2 * k = £ 1,2 * F = 0 . ( 10 ) 

In particular, we cannot Lorentz contract s\ with anything 
but £ 2 - This leaves us with only a small number of 
nonvanishing Wick contractions. 


(VoViv 2 > 


- n/o 


dr\ i exp 


4- X Mitej n( 3Bijn 


2 * 
z iJ-0 


[£l&12£2 + £ lQ F l2 £ 2 co \ M 2^Q F \2 n ] 


- ^ 

X - I <*>iS0§B0in + coQSo(g F oon 


/- 0 


- coocoico2£]<g F i2S2[n<g F ioeon<g F 2on — (1 «-► 2 )] 


( 11 ) 


For compact notation we have defined coo = <*>,&\,2 = _ <^i, 2 - This result has still to be multiplied by an overall 
factor of (esB)cosh(esB)/(47rs) 2 sinh(esB), which by itself would just produce the Euler-Heisenberg Lagrangian, and 
here appears as the product of the two free Gaussian path integrals [ 8 ]. 

It is then a matter of simple algebra to obtain the following representation for the matrix element C 2 [<w, co\, C 02 , B] 
appearing in Eq. (25) of [1]: 

,8 


C 2 [<w, CO], C 02 , B] — 


m 


4 co 


co 


-I 

1 CO2 Jo 


■nrs 


0 S (esB) 2 sinh(ejfi) 
2 


/■*./ 

Jo Jo 


dr\ I dr2 
0 


X exp* 




2 *- 

L ij= 0 


G B ij + 


1 cosh(esBG B ij) 


2eB sinh (esB) 


X 


[— cosh(ejS)(/Bi 2 + coico 2 (cosh(esB) - coshiesBGsn))] 


X 


co(coth(esB) — tanh(ejfi)) — co\ 


cosh(esBGsoi ) 

sinh (esB) 


CO 2 


cosh(esBGso 2 ) 
sinh (esB) 


+ co co\co 2 


G f 12 


cosh(esB) 


[sinh(esBG B o\)(cosh(esB) — cosh(esBG B 02 )) — (1 «-► 2)] 


( 12 ) 


Here translation invariance in r has been used to set the position to of the incoming photon equal to s. Coincidence 
limits have to be treated according to the rules G B (r, r) = 0, G B (r, t) = 1. 
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Alternatively, one may remove Gb \2 by partial integration on the circle. This leads to the equivalent formula 

m % r°° _ cosh(esB) 


C2[(o , CO\, B ] = 


ds s e 


r 

JO 

r i 2 r 

X expj —— ^ wjtoj G B ij + 
l L 


(esB) 2 sinh (esB) fo ^ Tl fo ^ T2 

1 cosh(esBGsij) 


X 


X 


Gsni Gb 12 ~~ 


2eB smh(esB) 


+ Gb 12 


sinh(ejB(/£i2) 

sinh(ejB) 

— coth(ejZ?) + tanh(ejfl) + 

/ cosh(ejBGfi02) 1 


- 1 - 


coshjesBGsn) 

cosh(esB) 


• • 

o)\ cosh(esBGsoi ) + <^2 cosh(ej£(/502) 

co sinh(ejB) co sinh(ejB) 


+ y G B \2 


+ Gf\2 


L\ sinh(ejfl) esB 

OJ ( ■ sinh(esBG B oi) \ 
L W 2\ B01 sinh(ejB) / 
sinh(ejBGeoi) 
cosh(ejB) 


_ sinh(e.rBG fl oi) \ _ ^ 


sinh(ejB) / 


2 ) 


- (1 ~ 2 ) 


— coth(ejB) H-+ tanh(ejB) 


esB 


( _ cosh(e-rBG fl o 2 ) \ _ , ^ 
V cosh(ejfl) / 


(13) 


This form of the amplitude is less compact, but the 
integrand (apart from the exponential) is homogeneous in 
the coi. 

Finally, let us remark that the analogous expression 
for scalar QED would be obtained by deleting all terms 
in Eq. (11) containing a (gp , as well as the cosh(ejB) 
appearing in the overall factor and the global factor of 
—2 in Eq. (1). 

In order to compare the amplitudes of Eqs. (12) and 
(13) to those of Refs. [ 1 ], [2], and [5], we observe that 
both Eqs. (12) and (13) can be written in the form 

m 8 

Cz[c0,C0\,C02,B] — 


4 B 2 


CO CO \ CO 2 


Once all amplitudes are put in the form of Eq. (14), 
we can compare them by comparing the proper time in¬ 
tegrand J 2 {s,co,co\,co 2 ,B), which in each case involves 
only a double integral over a bounded domain. The only 
remaining subtlety is that we must remember that J 2 van¬ 
ishes as coco\co 2 for small photon energy; this is manifest 
in Eq. (13) above, but in Eq. (12) and the corresponding 
equations obtained from Refs. [1], [2], and [5], there is 
an apparent linear term in the frequencies which vanishes 
when the double integral is done exactly. In order to get 
robust results for small photon frequency when the double 
integral is done numerically, this linear term must first be 
subtracted away, by replacing expressions of the form 


X 


r- ds_ 

Jo S 


trrs 


J 2 (s, co, C0\, 0)2, B), 

(14) 


// 


e Q (L + C), 


(17a) 


in which J 2 is independent of the electron mass m. 
Inspection shows that the amplitude expressions of Adler 
[1] and Baier, Milstein, and Shaisultanov [5] are already 
in the form of Eq. (14), while that of Stoneham [2] can 
be put in this form by doing an integration by parts in the 
proper time parameter j, using the identity 

d 

(15) 


with L, 2, and C, respectively, linear, quadratic, and cu¬ 
bic in the photon frequencies, by the subtracted expression 


f f [(e Q - 1 )L + e Q C]. 


(17b) 


m 2 e 


■m^s _ 




■m~s 


ds 


to eliminate a term proportional to m 2 in the amplitude. 
In rewriting Stoneham’s formulas in this form, we note 
that his M\(B) is what we are calling C^co, co\, C 02 , B], 
and that there is an error of an overall minus sign 
in either his Eq. (37) or the first line of his Eq. (40). 
Similarly, in rewriting the formulas of Baier, Milstein, and 
Shaisultanov in this form, we note that their amplitude T 
is related to C 2 by 


This subtraction is already present in the expression 
of Eq. (25) of Ref. [1], and is discussed in the form 
of Eqs. (17a) and (17b) in Ref. [5], and it also must 
be applied to Eqs. (37) and (39) of Ref. [2] after the 
integration by parts of Eq. (15) has been carried out. 
While in principle this subtraction should be applied 
to Eq. (12) above, it turns out not to be needed there, 
because the linear term in the frequencies involves only 
integrals of the general form 


f drif(s,Ti) f dr 2 [S(r l - r 2 ) - 1 /j], 
Jo Jo 


(18) 


C^_co , co\, a>2, B] — 


1 /2^8 


7T ' m 


4a^ B^coco\co2 


T . 


(16) 


which is exactly zero using a discrete center-of-bin 
integration method when the S function is discretized as a 
Kronecker delta. Thus Eq. (12) is robust for small photon 
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frequencies as it stands, when used in conjunction with 
center-of-bin integration. 

With these preliminaries out of the way, it is then 
completely straightforward to program the functions 
72 (j» <wi, for the five cases represented by the 

formulas of Adler [2], Stoneham [3], Eq. (12) of this 
paper, Eq. (13) of this paper, and Baier, Milstein, and 
Shaisultanov [5], with the result that they are all seen to 
be precisely the same; the residual errors approach zero 
quadratically as the integration mesh spacing approaches 
zero, as expected for trapezoidal integration. We have 
not carried out the s and o>\ integrals needed to get the 
photon splitting absorption coefficient, since this was 
done in Ref. [1], with results confirmed by the more 
extensive numerical analysis given in Ref. [5]. However, 
anyone wishing to do this further computation can obtain 
our programs for calculating the proper time integrand 
J 2 by accessing S.L. A.’s home page at the Institute for 
Advanced Study [12]. 
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In a series of recent papers, Johnson, Baker, and Willey study quantum electrodynamics 
with internal photon self-energy parts omitted. They find that in this model the asymptotic 
electron and photon propagators have remarkable, simple properties. In the present note we 
show that these properties can be derived in a very economical fashion by using the Callan- 
Symanzik scaling equations. 


In a series of recent papers, Johnson, Baker, 
and Willey 1 (JBW) have examined the question of 
whether quantum electrodynamics can be a self- 
consistent, finite theory. They start from the as¬ 
sumption that the Gell-Mann-Low eigenvalue con¬ 
dition 2 has a finite root e 0 , giving the renormalized 
photon propagator the asymptotic behavior 3 

e 2 b' F (q)p, a2 ~ 2 ~ g ^i — + gauge terms , (1) 

m = electron mass, 

e 0 = finite bare charge . 

A simple application of Weinberg’s theorem 4 then 
shows that the asymptotic behavior of the renor¬ 
malized electron propagator Sp(p) is correctly ob¬ 
tained 5 by replacing all internal photon propagators 
by their asymptotic form, Eq. (1). Thus, one is 
led to consider quantum electrodynamics without 
internal photon self-energy parts. In this model, 
Baker and Johnson 1 find, using renormalization- 
group methods, that the asymptotic electron prop¬ 
agator has the remarkably simple form 

Sf(P)' 1 ^^ 2 C[r 'P + ~P 2 Y] • (2) 

Here € is a power series in a 0 = e 0 2 /(4ir). 



a is a constant, and (in the Landau gauge where the 
electron wave-function renormalization 2^ is finite) 
C is another constant. According to Eq. (2), if 
€ >0 [as suggested by the leading terms in the ex¬ 
pansion of Eq. (3)], the asymptotic electron propa¬ 
gator is identical to the propagator of a free, 
massless fermion. This means that the electron 
bare mass is zero in the limit of infinite cutoff, 


and not divergent, as would be indicated by expand¬ 
ing (n?/-p 2 ) € in a perturbation expansion in at 0 and 
truncating at a finite order. 

Johnson, Baker, and Willey 1 have also studied 
the photon propagator in the model with no internal 
photon self-energy insertions. Introducing a cutoff 
A 2 to define the unrenormalized photon propagator 
and photon proper self-energy D^iq)^ and tt(< 7 2 ), 5 

e,*D' r (q)^= ^ + ^a"( a)] + gauge terms , (4) 

they find that, asymptotically, 

*(q a ) , f'-' k(a 0 )+f(ct 0 )ln(-q 2 /A 2 ) . (5) 

j2»m* A £ »m* 

For Eq. (5) to be consistent with the ansatz of 
Eq. (1) the logarithmically divergent term in 
Eq. (5) must vanish. This gives the simplified 
eigenvalue condition 

/(*„)= 0 , ( 6 ) 

which involves only vacuum-polarization graphs 
without internal photon self-energy parts. Equation 
(6) has been shown 1 to be equivalent to the Gell- 
Mann-Low eigenvalue condition (which involves all 
vacuum-polarization graphs), so the discussion 
starting from Eq. (1) is self-consistent. 

The purpose of the present paper is to consider 
quantum electrodynamics without internal photon 
self-energy parts from the viewpoint of the Callan- 
Symanzik 6 scaling equations. The basic idea which 
we exploit is that when photon self-energy parts 
are omitted, the troublesome coupling-constant- 
derivative terms, which would destroy scaling be¬ 
havior, do not appear in the Callan-Symanzik scal¬ 
ing equations for quantum electrodynamics. 7 As a 
result, application of the scaling equations in 
asymptotic situations leads to simple scaling be- 
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havior with an “anomalous” dimension. But this is 
just the type of behavior which Baker and Johnson 
find for the mass term in Eq. (2), so it is not sur¬ 
prising that the Callan-Symanzik equations lead to 
an economical derivation of Eq. (2). The same 
methods, we find, lead to a simple derivation of 
Eq. (5) as well. 

In deriving the Callan-Symanzik equations, we 
follow closely a method due to Coleman. 8 We first 
make the unrenormalized quantities n Iq, Z 2r and 
ir(q 2 ) finite by introducing an ultraviolet cutoff A 2 
and an infrared cutoff \x 2 in the following manner: 

(i) We take the propagator for internal photons 
to be 




1 

q 2 - n 2 + zt 


-A 2 

<7 2 -A a + 1€ 



This means that we are working in massive elec¬ 
trodynamics with photon mass /x 2 . Since internal 
photon self-energy parts are omitted in our model, 
there is no distinction between bare and physical 
photon mass. 

(ii) We calculate the lowest-order vacuum- 
polarization contribution to n(q 2 ) [see Fig. 1(a)] in 
the following manner. First we impose gauge in¬ 
variance to remove the quadratic divergence, and 
then we regulate the fermion loop, with fermion 
regulator mass A, to remove the logarithmic di¬ 
vergence. 

(iii) All vacuum-polarization loops with four or 
more vertices [see Fig. 1(b)] are calculated by im¬ 
posing gauge invariance, which makes them finite. 
The requirement of gauge-invariant calculation of 
loops, together with the photon-propagator cutoff 
specified in (i), renders convergent the vacuum- 
polarization contributions to n(q 2 ) of the type illus¬ 
trated in Fig. 1(c). As a result of this cutoff 
scheme, the quantities Z 2t and ir(q z ) become 

A-dependent. On the other hand, because we omit 
internal photon self-energy parts, the photon cou¬ 


j 



•p-m o 


On the right-hand side of Eq. (9) the operator 
m(d/dm) + n(d/dfx) is understood to act only on the 
quantity enclosed with it in square brackets; in de¬ 
riving this equation, we have used the fact that 
Z(p) can depend on the physical mass m only 
through the bare mass m^. The second and third 
terms on the right-hand side can be simply inter¬ 
preted as follows: The quantity 1 + 82 (p)/dm^ ap- 




(b) 



+ + 



FIG. 1. (a) Lowest-order vacuum-polarization contri¬ 
bution to n(q 2 ). (b) Vacuum-polarization loops with four 
or more vertices, (c) Vacuum-polarization contributions 
to n(q 2 ) which involve the loops with four or more ver¬ 
tices illustrated in (b). 


pling constant e 0 is a fixed number, independent of 
A and of the physical electron and photon masses 
m and . 

Having precisely specified our model, we are 
ready to discuss the scaling behavior of the elec¬ 
tron propagator. The renormalized and unrenor¬ 
malized electron propagators are related by the 
equation 

S^pr^ZtS'Apy^Zjiyp-mv-lKp)], ( 8 ) 

with 2(p) the unrenormalized electron proper self¬ 
energy. Let us consider the change in Eq. (8) when 
the physical electron and photon masses m and n 
are varied, with the ratio p/m, with A, and with 
e 0 all held fixed. This is described by acting on 
Eq. (8) with the differential operator m(8/8m) 

+ m(9/8m), giving 



8 8 
m — +n — 
dm 8 fi 



1 + 


as(/>) ~ 

awo _ 


~ M 


9 Up) 

Bfx 



i---—— ----- 

pearing in the second term is just the zero- 
momentum-transfer vertex of the scalar electron 
current = 


x , 8S(p) 

9Wo 


= r s (/>, p ). 


( 10 ) 


Typical diagrams contributing to T s (p t p) are illus¬ 
trated in Fig. 2(a). Because internal photon self- 
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energy parts are omitted from S f F (p)~\ they are 
omitted from T s (p t p) as well; absent in addition 
are diagrams of the type shown in Fig. 2(b), which 
would arise from electron-mass differentiation of 
an internal photon self-energy part. The quantity 
fxdZ(p)/dn appearing in the third term is a second 
type of scalar vertex at zero momentum transfer, 

/>,/>). (ii) 


Diagramatically, it is the sum of contributions ob¬ 
tained by replacing successively each internal pho¬ 
ton propagator (of four-momentum, say, q) by 


photon propagator ( q)x 


U 


q 2 - n 2 + ie * 



as illustrated in Fig. 3. 

The next step is to reexpress the right-hand side 
of Eq. (9) in terms of renormalized quantities. 

Since the skeleton graphs for T S '(p t p) are all con¬ 
vergent, this vertex is made finite by multiplica¬ 
tion by Z 2t 

P ) s ^S'( Pi P) 

^cutoff-independent as A-». (13) 

By contrast, the vertex T s (p t p) has divergent skel¬ 
eton graphs [see Fig. 2(a)], and so needs a vertex- 
renormalization factor in addition to the wave- 
function renormalization Z 2 . In Appendix A we 
show that this factor is just the bare mass m 0 , 

2 r s( Pi P) = mf s ( p , p) 

~ cutoff-independent as A - °° . (14) 
Hence Eq. (9) takes the final form 



= -m(l + a)f s (p, p) - n 2 r S ’(p, p ), 


(15a) 

with 



(15b) 


Equation (15) is a typical Callan-Symanzik scaling 
equation, as simplified by the neglect of internal 
photon self-energy parts. 9 

Let us now consider the behavior of Eq. (15a) as 
p becomes infinite in a spacelike direction. We 
will keep all terms which are constant or which 
grow as powers of In/) 2 , but will drop terms which 
vanish as (/>~\/>~ 2 , .. .)x(powers of In p 2 ). By a 
simple application of Weinberg’s theorem to the 



+ • • ■ 


(a) 



FIG. 2. (a) Typical diagrams contributing to r s (/>,/>). 
(b) Type of diagrams which are omitted from T s (p ,/>), 
because they could arise only from electron-mass differ¬ 
entiation of an internal photon self-energy part. 

graphs which contribute to f s and t s > (see Figs. 2 
and 3) we find that, to any finite order of perturba¬ 
tion theory, 


f s (/>,/>) ry (powers of In/) 2 ), 
f Ap,p) p 1 x (powers of In/) 2 ). 

p-+<n 


(16) 


Thus, in the asymptotic limit, f s must be retained 
in Eq. (15) but f s *(/),/)) may be dropped. Further¬ 
more, introducing the general functional forms 


S F (p) 1 = y • pF(p 2 /n #, jLt 2 /m 2 , e 0 ) 
+mG(p 2 /m 2 , p 2 /nP, e 0 ), 

rnT s (Py p)=y pH(p 2 /m 2 t p 2 /m 2 , e 0 ) 
+mJ(p 2 /m 2 i mVw 2 , e 0 ), 


(17) 


and applying Weinberg’s theorem again, we find 
that F , G, H t J have, to any finite order of per¬ 
turbation theory, the asymptotic behavior 


F t G t J (powers of In/) 2 ), 

p-'~ 


H p'~ 1 '* (powers of In/) 2 ). 


(18) 


Substituting Eq. (17) into Eq. (15), equating sepa¬ 
rately the coefficients of y *p and m, and dropping 
terms which vanish asymptotically, we get 


m 


JL +fl A. + ^ M 7m 2 , e B ) ~ 0 , 


(19a) 


m 


am + M 3 m" + v) mG (P 2/,m2 > mYw 2 , e 0 ) 

~nt( 1 + a)J(p 2 /m 2 t \i 2 /r*F, e 0 ). 

p-+«> 

(19b) 

From Eq. (19a) we learn a number of things. 

^jr 


FIG. 3. A typical diagram contributing to IV(/> ,/>). 
The doubled photon propagator denotes 

S^i/AV (? 2 ~M 2 + ier 2 (fl 2 ~ A 2 + ie) -1 . 
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First, since everything in this equation except y 
is cutoff-independent, y must be cutoff-indepen- 
dent, 10 i.e., 


y=y(.l* 2 /m 2 ,e 0 ). 


( 20 ) 


Comparing with Eq. (15a), we then learn that a is 
cutoff-independent also, 10 i.e., 


a=a(n 2 /m 2 9 e 0 ). 


( 21 ) 


[We will see shortly that there is actually no de¬ 
pendence on iL 2 /n? in Eqs. (20) and (21).] Integrat¬ 
ing Eq. (15b), we find that the A dependence of Z 2 
and is given by 

Z i =C l ^ 2 /rr?,e^S 2 /n?V' 2 , 

( 22 ) 

w!o=C 2 (mV»J 2 , e 0 )m(A 2 /m 2 )- a/2 , 

with C x and C 2 dimensionless functions. Finally, 
integrating Eq. (19a) we find that F has a power 
dependence on p 3 for large spacelike p , 

F(p 2 /n< r 2 , mVw 2 , e 0 ) rw e 0 )(~p 2 /m 2 ) y/2 . 

P" ►«© 

(23) 

To obtain the equation satisfied by G which is 
analogous to Eq. (23), we must study the asymp¬ 
totic behavior of the quantity J appearing on the 
right-hand side of Eq. (19b). We do this by calcu¬ 
lating the Callan-Symanzik scaling equation satis¬ 
fied by f s (/>, />). Starting from Eq. (14), and pro¬ 
ceeding in analogy with the calculation of Eqs. (9)~ 
(15), we find 




=m(l + a)T w (/>, p) + v 2 f ss ,(p i p ), 

(24) 

with the electron-scalar four-point functions f& 
and Tgs. defined by 


m 2 fss( />, p) = m*Z 2 — T s ( p, p ), 


8W2 q 


a 


(25) 


Mii 2 T S s'(p i p)=m 0 Z 2 ii-~r s (p, p ). 


Again applying Weinberg's theorem, we find that 
the entire right-hand side of Eq. (25) vanishes 
asymptotically as p~ l x (powers of In/) 2 ), and so 
substituting Eqs. (17) and (18) we get 


p —.OO 


J(p 2 M » 2 , n 2 /m 2 ,e 0 ) 

/ 3 (M 2 /»« 2 .e„)(-/,Vm 2 )”-“ )/2 . 

(27) 

Substituting into Eq. (19b) and doing a final integra- 
tion, we get 

G ( p 2 /rr?, n 2 /m 2 ,e 0 ) 

~K(/>7m 2 ) (r+1)/2 -/ 3 (M 2 An 2 , e„)(-/> 2 A» 2 )<>-“ ,/2 , 

(28) 

with the first term a solution of the homogeneous 
equation 

( m iw + M+Y ) mG( 'P 2/m2 ’ mVw 2 , e a ) ~ 0 . 

(29) 

To determine A, we note that y cce 0 2 [see Eq. (15b)]. 
Hence when expanded in powers of e 0 2 the first 
term in Eq. (28) has, to any finite order in e 0 2 , the 
form 


K (j^ x (powers of ln/> 2 ), 


(30) 


which violates the Weinberg-theorem asymptotic 
behavior of G stated in Eq. (18). So we conclude 
that K = 0, giving 

-/ 3 ( M Vw 2 ,e„)(-/.Vw 2 ) ( ^“ ,/2 . 

(31) 


G(p 2 /m 2 , mVw 2 , e 0 ) ~ 


Defining 

f ( u 2 /m 2 e ) = £^V- 2 /—> g °) 

we may combine our results for the asymptotic be¬ 
havior of SJU/))’ 1 into the form 

SAP )- 1 ^i/.UVm 2 , e 0 ){-p*/mV /2 

x[y • P~mf 2 (ii 2 /m 2 , e 0 )(-p 2 /m 2 )- a/2 ]. 

(32) 

Our final step is to show that y and a are inde¬ 
pendent of iL 2 /m 2 . We do this by writing down the 
analogs of Eq. (15a) and Eq. (24) obtained by dif¬ 
ferentiating with respect to the photon mass p. only, 
with the electron mass m held fixed. These are 

• +r M )5>(/»)- 1 = -wa M f s (/», p) - n 2 f s .(p, p). 


(m^ t + ti-^ + y- a y(p 2 /m‘,^/m 2 ,e 0 )p L 0. a <) T ^P’P^- ma ^ T s^P>P) + ^ T ss-(P, p), 


(26) 




a„=Wb“V^->no- (33) 


Equation (26) may be immediately integrated to 
give 


8m 


Evaluating Eq. (33) asymptotically, substituting the 
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results of Eqs. (27) and (32), and separating the 
terms proportional to yp and m, we find the two 
equations 11 

( M ^ + ^ A+/i ln (S) M ( * y) ~ 0 ■ 

(34) 

( M + - a,)/,/, + fj 2 In (J^j h ^\(y - a ) ~ 0 . 


These equations can be satisfied only if the loga¬ 
rithmic terms vanish separately, which implies 


d 8 

— y ~ t— a = o, 

Bfi Bfi 


(35a) 


The vanishing of the constant terms in Eq. (34) then 
implies 





On substituting Eq. (22) for m 0 and Z 2 into Eq. (33) 
for and a^, and then inserting the resulting ex¬ 
pressions into Eq. (36), we find the equations 


fi/C i = F x (e ,)» / 2 /C 2 = F 2 (e 0 ). 


(37) 


giving the desired result 
7 = 7(6,,), a=a(e 0 ). 


(35b) 


Hence our final result for the asymptotic behavior 
of the electron propagator is 


Sf(P)' 1 Pi F i(e 0 JCjOiVw 2 , e„)(-/» 2 /m 2 ) rU o ,/2 [y-p - wF 2 (e„)C 2 (M 2 /w 2 , e 0 )(-p 2 /m 2 ) a(e ° )/2 ], 


(38) 


with C x andC 2 the same functions as appear in 
Eq. (22) for Z 2 and wa*,* 

The result of Eq. (38) is valid for arbitrary val¬ 
ues of the gauge parameter £ in Eq. (7). 12 Under 
the change of gauge 

ZQnQv _ \ 1 _ -a 2 

q 2 + 9 !-A ! + if 

i'quQv _ \ 1 _ -a 2 

q 2 Slw ) q 2 - n 2 + te q 2 -\ 2 + ie 


fi i-jy' r(i + |y)r(i -j y ') 

F l “ l-iy r(l ~{y)r(l + {y') 

xexp[(-a ll /4ir)(2yg - !)(£' - £)1, 

(41b) 

F£ F, r(l-4y-ia)r(l + iy' + ^a') 

F 2 F[ r(l+^y + ^a)r(l-Ay'-ia') 

xexp[(a 0 /4u)(2y E - !)(£' - £)1, 


(39) 

it is easily shown 13 that the unrenormalized posi¬ 
tion-space electron propagator transforms in the 
simple fashion 


Sjrlx) - exp{(r- Z)[\(x) - A(0)]}S>(x), 


X(x )-lh L_ [111 

M ; (2tt) 4 J q 2 


-A ; 


(40) 




q 2 - m q 2 -A 


By studying the large-* and small-* behavior of 
Eq. (40), we can find the behavior under gauge 
transformation of the quantities F 19 C lt F 2t C 2 , a, 
and y appearing in Eqs. (22) and (38). Suppressing 
the dependence on n 2 /m 2 and e 0i and letting primed 
quantities denote those computed with gauge pa¬ 
rameter and unprimed quantities those computed 
with gauge parameter £, we find 


7'-7-(a./ftr)U'-«). 

a '- a = 0 , 

Cl (nP Y a ° /4Tr)(r-£) C£_ 

Ci = \M 2 / ’ C 2 =1, 

Zi J Afyoo/^W-C) ^ 
Z 2 ~\n 2 ) ’ m 0 = 1; 


(41a) 


with y E = 0.577 21... = Euler’s constant and with T 
the usual T function. The derivation leading to 
Eq. (41) is given in Appendix B. 

According to Eqs. (22) and (41a), if we choose 
to satisfy (a 0 /27r)(£' - £) + y(e 0 , £) = 0, then we have 
y'Hy(e 0 , £') = 0 and the wave-function renormaliza¬ 
tion Z 2 becomes finite as A-«. This choice of 
gauge (the Landau gauge) is the one used by Baker 
and Johnson in their work. In the Landau gauge, 

Eq. (38) becomes 

ndau gauge C[y • p + am(»n 2 /-/> 2 ) £ ] , 

(42) 

C = F'f \, a= -F'fi 2 > «=5d(e,), 

in agreement with the Baker-Johnson result stated 
in Eq. (2)_ We note also that the gauge-independent 
quantity w*b = m~E| r .p =m is finite as - 0 [only Z 2 
= 1 - 32/8(7 • p) | y . p=m is infrared-divergent]; hence 
the function C 2 (m 2 /^ 2 , e 0 ) appearing in Eqs. (22) and 
(38) has a finite limit as ^-*0. Let us give two 
simple second-order calculations which illustrate 
Eqs. (42) and (38). First, we calculate € in Eq. (42) 
by noting that to second order 
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1 + 


. v mid M 9 \ 
a ~ m 0 \dm + m dp j*” 0 

**-(=**&>«• 


(43) 


* 1 + . 

m \ dm 

£(?n) =E I y - p=m Q =m ■ 

Explicit calculation in an arbitrary covariant gauge 
shows that 

Ztyn) =>n[(3o!o/47r)l n (A 2 /m 2 )+fun c ti o n of (p 2 /nF )\, 

(44) 


which on substitution into Eq. (43) gives 

a(e 0 ) = 3o! 0 /27r, € = |a 0 /2ir, (45) 

in agreement with the second-order term in Eq. (3). 
As our second illustration we show that, to second 
order, Z 2j and the full renormalized electron 
propagator S f F {p)~ l in the Feynman gauge do satisfy 
Eqs. (22) and (38). A straightforward calculation 
gives 


5;( py 1 = z 2 (y • P - O - U J 0 ‘ <*4(1 - z)y ■ p - 2m] In ( ^ - +(1 


^2 






dz 


z(\-z 2 )2n? 
z 2 m 2 + ( 1 - z)p 




a 0 Tj.,, _n, / zm 2 +(l - z)p 2 - 

p - m \} ~ 2 # J 0 d2(1+2)ln l— + (f- 


- 2(1 - 2)/> 2 


from which we can identify the quantities appearing in Eqs. (22) and (38), 

y(e 0 ) = ~a 0 /2ir , a(e 0 ) = 3a 0 /27r, F^e,,) = 1 + 3a 0 /87r, 

_ , ,, , \ „ a B f 1 , [~/, , / 2 W + (1 - z)p 2 \ 2 (l- 2 2 ) 2 m 3 

C,(m /m,e„)-l + 2jr J o d2^(l z)ln^ j + (1 _ Z ) M * 

F 2 (e 0 ) = 1 + 5a 0 /87r, 


)]}■ 


]• 


(46) 


(47) 


As p - 0, we see that C 2 (p 2 /m 2 t e 0 ) approaches a 
finite limit, as expected. 

This completes our treatment of the electron 
propagator. Let us next turn briefly to the asymp¬ 
totic behavior of the photon propagator. 14 Because 
renormalization of the photon propagator is sub¬ 
tractive, i.e., 

Hq 2 ) = *(q 2 )-n(0), (48) 


m Jm 9 2 /V. q 2 /™ 2 , e 0 ) 

= m(l + a- a^stoVA. 2 , q 2 /V, q 2 /m 2 , e 0 ), 

a (49) 

^ niq2/ A 2 .9 2/ M 2 .9 2/ <«. ) 

=ma^ s (q 2 /\ 2 , q 2 /y. 2 , q 2 /m 2 , e 0 ) 

+ mW/A 2 , <?7m 2 , 9 2 /^ 2 , e 0 ). 


the renormalized photon self-energy jr involves, 
even for asymptotically large q 2 t the nonasymp- 
totic piece tt(0). As a result, the asymptotic be¬ 
havior of the renormalized photon propagator can¬ 
not be calculated by replacing all internal photon 
propagators by their asymptotic form, Eq. (1). On 
the other hand, the asymptotic unrenormalized 
photon self-energy ir(q 2 ) does involve only the 
asymptotic behavior of the internal photon propa¬ 
gator, and thus can be calculated in our model in 
which internal photon lines are replaced by Eq. (1). 
To proceed, we write down the two Callan-Syman- 
zik scaling equations obtained by differentiating 7 r 
with respect to m and with respect to p. These are 


with a and a„ given by Eqs. (15b) and (33) above 
and with the photon-photon-scalar three-point func¬ 
tions tt s and ir s , given by 


2 9 


(50) 


Let us now consider the asymptotic limit in which 
q and A both become large. Application of Wein¬ 
berg's theorem 15 shows that 7 r s and 7 r s , vanish as 
(powers of ln<? 2 ) and q~ 2 x (powers of ln<? 2 ), re¬ 
spectively, so that the right-hand sides of the scal¬ 
ing equations can be neglected, giving 
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m dm ff toVA a f q 2 /» 2 , q 2 /™ 2 , e 0 ) 0 , 

(51 > 

^(9 2 /A 2 , q 2 /v 2 , q 2 /m 2 , 0 . 

This tells us that the asymptotic unrenormalized 
photon self-energy has no dependence on m and n, 
that is, 

n(q 2 /\ 2 , q 2 /n 2 , q 2 /m 2 , e 0 )^~ m *(q 2 /h. 2 , e 0 ). (52) 

Furthermore, since to any finite order of pertur¬ 
bation theory the dependence of it on A 2 can only 
be through powers of InA 2 , then n(q 2 /A 2 f e 0 ) must 
have the form 

ir(q 2 /\ 2 , e 0 ) =£ B n ( eo )[ln(-<? 2 /A 2 )l" - (53) 

n =0 

We now invoke the fact that since it is gauge-inde¬ 
pendent we are free to choose the gauge which 
makes Z 2 finite, and since it contains no internal 
photon self-energy parts, the subintegrations of it 
which do not involve all lines in the graph are fi¬ 
nite. But the single subintegration which does in¬ 
volve all lines is made finite by a single differen¬ 
cing of it, 

ir(q 2 /A 2 , e Q ) - it(q 2 /A 2 t e Q ) = cutoff-independent, 

(54) 

which tells us that only the n = 0 and n = 1 terms can 
be present in Eq. (53). Thus, 

*(q 2 /\ 2 , q 2 /n 2 , q 2 /n,?, e 0 ) 

In (~q 2 /\ 2 ), 

(55) 

in agreement with the result of JBW stated in 
Eq. (5), with f(a 0 ) =B 1 (e 0 ) the function which deter¬ 
mines the eigenvalue condition. 

We wish to acknowledge the hospitality of the 
Aspen Center for Physics, where this work was 
done. 


APPENDIX A 

We give here a proof that mt s (p, p)=m Q Z 2 T s (p, p) 
is finite (cutoff-independent as A - «>) to all orders 
of perturbation theory. Let us define T 5 (p, p) to be 
the zero-momentum-transfer vertex of the pseudo¬ 
scalar electron current j 5 = ipy 5 ip. We have previ¬ 
ously shown, 16 by using the axial-vector-vertex 
Ward identity, that mf 5 (p , p) =m Q Z 2 T 5 (p > p) is finite 
to all orders of perturbation theory. 17 Let us de¬ 
fine 

Mp,p)=f s (p,p)y 2 -fHp,P). (Al) 

To zeroth order in perturbation theory, A(p, p) = 0 
is finite. Let us now make the inductive hypothesis 
that, to order n ~ 2 in perturbation theory, (i) 


A( /), p) is finite and (ii) as /)-«>, A(/), p)~ p~ x 
x(powers of ln/> 2 ). To prove that these hypotheses 
are satisfied in order n as well, we follow very 
closely the procedure used in Chap. 19 of Ref. 3 to 
prove that the usual renormalizations of electro¬ 
dynamics make the vector vertex finite. We begin 
by observing that mf s and mf 5 satisfy the integral 
equations (see Fig. 4) 

mf s = m 0 Z 2 - J mf s S' F S' F K , 

J ,_ (A2) 

mr 5 =m 0 Z 2 y 5 - I mT 5 S F S F K , 

with R the connected, renormalized electron-posi¬ 
tron scattering kernel, obtained by excluding the 
class of graphs shown in Fig. 5. Substituting Eq. 
(A2) into Eq. (Al), we find that the inhomogeneous 
terms cancel, giving the following expression for 
a: 

A = Jf s 5;S>K-Jf s s;s;x 7 5 . (A3) 

Since the perturbation expansion of R begins in sec¬ 
ond order, to calculate A to order n we need only 
insert f 5 and f s to order n - 2 on the right-hand 
side of Eq. (A3). But these are known to be finite 
by the inductive hypothesis, so the individual fac¬ 
tors appearing on the right-hand side of Eq. (A3) 
are finite. According to Weinberg's theorem, to 
see whether A is finite to order w, and to deter¬ 
mine its large-/) asymptotic behavior, we must de¬ 
termine* the naive degree of divergence D of each 
subintegration contributing to the right-hand side 
of Eq. (A3). As is shown on pp. 330-334 of Ref. 3, 
all subintegrations have D ^ -1, except possibly 
those involving both electron propagators S F and 
all lines in the kernel it. These are of two basic 
types, according to whether the electron-positron 
lines emerging from f s and f 5 do [Fig. 6(a)l or do 
not [Fig. 6(b)] connect directly with the external 
electron-positron lines entering the kernel it. 
Clearly, the diagrams shown in Fig. 6(b) involve 
a closed electron loop with a single scalar or pseu¬ 
doscalar vertex. Charge-conjugation invariance 
implies that such a loop can have only an even num¬ 
ber of photon vertices and an odd number of elec¬ 
tron propagators; the fact that the trace of an odd 
number of y matrices vanishes then implies that 
such loops are proportional to the electron mass 





FIG. 4. Integral equations satisfied by the scalar 
and pseudoscalar vertex parts. 
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FIG. 5. Class of graphs excluded from the kernel K. 

m. So the diagrams shown in Fig. 6(b) are all pro¬ 
portional to m t which improves the convergence by 
one power of momentum and gives them D £ -1. 

The contribution to Eq. (A3) of the diagrams shown 
in Fig. 6(a) can be written symbolically as 

A«<“ ) = J(f s -f s 7 s )S>Si^ 

- J fjs 'pS'rKy' - y 5 s;§;ic] 

= J AS;s;iC - Jfs[S ’ F S’ F Ky* - y s s;s>if ]. 

(A 4) 

The first term in Eq. (A4) has -1 because (to 
order n - 2) A satisfies assumption (ii) of the in¬ 
ductive hypothesis. The second term in Eq. (A4) 
is the residue obtained when the matrix y 5 is com¬ 
muted from its original position on the far right 
of Fig. 6(a), through the string of electron propa¬ 
gators and photon vertices, to a position immed¬ 
iately to the right of the vertex f s . The square 
bracket in this term is easily seen to be propor¬ 
tional to the electron mass m t giving the second 
term an extra power of convergence with the re¬ 
sult that it, too, has -1. This completes the 
demonstration that, to order w, all subintegrations 


S 


/ 

F 




FIG. 6. (a) Diagrams in which the electron-positron 
lines emerging from f s and f 5 connect directly with the 
external electron-positron lines entering the kernel K . 
Internal photon lines in K are not shown, (b) Diagrams 
in which the electron-positron lines emerging from r s 
and f 5 do not connect directly with the external electron- 
positron lines entering the kernel K. Again, internal 
photon lines in R are not shown. 


contributing to the right-hand side of Eq. (A3) have 
D ^ -1. By Weinberg's theorem, this implies that, 
to order n, A has the properties (i) and (ii) stated 
above, thereby completing the induction. 

We note that in making the proof we have not as¬ 
sumed the omission of internal photon self-energy 
parts. Our result, that A is finite, is a fortiori 
still valid when this simplification is made. 

APPENDIX B 


We derive here the results quoted in Eq. (41) of 
the text, giving the behavior under gauge transfor¬ 
mation of the quantities F C I, F 2f C 2r a, and y. 
Our starting point is Eq. (40), which we repeat for 
convenience: 


S' F (*, £') = exp{(£' _£)[*.(*)_ A(0)]}s;(*,£), 


M*) = 


ie 0 2 
(2tt) 4 





(Bl) 


Before proceeding with the derivation, we give 
some useful properties of x(#). Letting x be space¬ 
like, and using the symmetrical integration formula 

j dOf-” = JM V) ,/2 (-* 2 ) ,/2 >, 


(B2) 


= Bessel function of order unity), we find the 
following representation for x(%): 

M*) = -f 2 ^ p2 _^2 m 2 ( p 2 Jtip) ■ (B 3) 

From Eqs. (Bl) and (B3) we learn that 

M*) 0 , 


A(z) £3 ln(A Vm 8 ) +0(m 2 /A 2 ) + 0 (* 2 In* 2 ); 

x -*0 47T 

A(*) =- lim HX) - =2* J“ J t (p), (B4) 

m ^ [ ln (- + 2y B - 1] +0(x 2 In x 2 ), 


y B = Euler's constant. 

We begin by deriving the results of Eq. (41a), 
giving the gauge-transformation behavior of the 
renormalization constants and Z 2 . Introducing 
the wave-function renormalization Z 2 U) and the re¬ 
normalized electron propagator Sjr(x, £), 

S^Xft,)=Z 2 (t,Y l S' F (Xft)f (B5) 

we can rewrite Eq. (Bl) in the form 
S^(XfZ')=Z 2 (Z)Z 2 (t')- 1 

xexp{U' - £)[A(*) - A(0)] }§;(*, 0 . 

(B6) 

Let us consider the limit of Eq. (B6) as x — «>. 
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Because we have supplied an infrared cutoff /i, the 
renormalized electron propagator approaches in 
this limit the free electron propagator for physical 
mass m, 


~Sr(x>0 r—QW-jjzrf y.p-n 


(B7) 


Since the right-hand side of Eq. (B7) is independent 
of gauge, we get, using the results of Eq. (B4), 


fU^=exp[-U'-«)*(0)] 


z 2 (0 


yy2\(a 0 /4ff)(c'-S) 


■($ 


(B 8 ) 


Comparing Eq. (B 8 ) with Eq. (22) in the text, we 
learn that 

y'-y = (a 0 /2Tr)U'-Z) i 
Cl /?^ 2 \< a o/4Jr )(£'-0 

c; = U*) 


To get the gauge transformation properties of the 
bare mass we consider the small-# limit of the 
unrenormalized equation, Eq. (Bl), The small-# 
behavior of the unrenormalized position-space pro¬ 
propagator is determined by the large-/) behavior 
of the unrenormalized momentum-space propagator 
by the Fourier-transform relation 


yp- 


-I p*X 


J (2*) 4 v*-»*u)-£(*,«r 

But as p - for fixed cutoff A, we have 
£(/>,£)^ A £' Ix (p° wers of in/> 2 ), 
so we get from Eq. (BIO) 


(BIO) 


(Bll) 


Cd 4 p e’ ip ‘ x 

5; ( ^«)=j( 2jr ) 4 r . / ,_ Wo ( 4 ) + o(/.- I x( ] 


x (powers of in/) 2 )) 


= J W* e + + °(P' a x (Powers of ln/» 2 ))J 

1 y • # i w 0 (£) . , , , 

= 2^2 + 4^2 ~x^~~ +0 ^ X x (P° wers of In* 2 )) . 


(B12) 


Substituting Eq. (B12) into the small-# limit of Eq. (Bl), and using Eq. (B4) for a small-# estimate of A(#) 
- A( 0 ), we get 


=1 

m 0 wioU) 

Comparing with Eq. (22) in the text, we find 

a' = a, ^ = 1. 

C 2 


(B13) 


(B14) 


Next, we derive the results of Eq. (41b), giving the gauge-transformation behavior of the functions F x and 
F 2 appearing in the asymptotic form of the renormalized propagator. To proceed, we need the renormal¬ 
ized version of Eq. (Bl), obtained by eliminating Z 2 (Z)Z 2 (%')~ 1 from Eq. (B 6 ) by use of Eq. (B 8 ) and then 
dropping terms in A(#) which, for fixed #, vanish as A - «>. This gives 

s;(*,«') = exp[u'-ox(*)] §;(*,«). (bis) 

with A(#) given in Eq. (B4). We now take the small-# limit of Eq. (B15), using Eq. (B4) for the small-x be¬ 
havior of A(#) and extracting the small-# behavior of 5j^(#, £) from the large-/) asymptotic form of $ F ~ l (p, €) 
given in Eq. (38), 


~ SF(X ’ O= S0 e ' ,P 'i ( ' P Ffir n r;p + mF 2 C 2 (-p 2 /m 2 )- a/2 + o^., x(powers of j^))] . 

We can evaluate the integrals appearing in Eq. (B15) by using Eq. (B2) and the formula 18 

i dxx " ,J M ) - 2 ~ y r(i + iJ) ' 


(B16) 


(B17) 


Substituting the result into Eq. (B4) and equating the coefficients on left and right of the two most singular 
terms as #- 0, we get the gauge-transformation formulas quoted in Eq. (41b) of the text. 
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We review and extend earlier work dealing with the short-distance behavior of quantum elec¬ 
trodynamics. We show that if the renormalized photon propagator is asymptotically finite, 
then in the limit of zero fermion mass all of the single-fermion-loop 2n-point functions, re¬ 
garded as functions of the coupling constant, must have a common infinite-order zero. In the 
usual class of asymptotically finite solutions introduced by Gell-Mann and Low, the asymp¬ 
totic coupling a 0 is fixed to be this infinite-order zero and the physical coupling a <a 0 is a 
free parameter. We show that if the single-fermion-loop diagrams actually possess the re¬ 
quired infinite-order zero, there is a unique, additional solution in which the physical cou¬ 
pling a is fixed to be the infinite-order zero. We conjecture that this is the solution chosen 
by nature. According to our conjecture, the fine-structure constant is determined by the 
eigenvalue condition F^a) ■ 0, whereis a function related to the single-fermion-loop 
vacuum-polarization diagrams. The eigenvalue condition is independent of the number of 
fundamental fermion species which are assumed to be present. 


I. INTRODUCTION AND SUMMARY 

The fundamental constant regulating all micro¬ 
scopic electronic phenomena, from atomic physics 
to quantum electrodynamics, is the fine-structure 
constant a. Experimentally, the current value 1 
a - 1/(137.03602 ±0.00021) is one of the best de¬ 
termined numbers in physics. Theoretically, the 
reason why nature selects this particular numer¬ 
ical value has remained a mystery, and has pro¬ 
voked much interesting speculation. The specula¬ 
tions may be divided roughly into three general 
types: (a) those in which a is cosmologically de¬ 
termined, either as a cosmological boundary con¬ 
dition (which makes a undeterminable) or as a 
function of time-varying cosmological parameters 
(which makes a a function of time) 2 ; (b) theories 
in which a is a constant which is determined mic¬ 
roscopically through the interplay of the electro¬ 
magnetic interaction with interactions of other 
types, either strong, weak, or gravitational. 3 
Since these interactions are currently even less 
well understood than is the electromagnetic inter¬ 
action, such theories seem at present to offer 
little promise of an actual computation of a; (c) 


finally, theories in which a is microscopically 
determined through properties of the electromag¬ 
netic interaction alone, considered in isolation 
from other interactions. It is this restricted 
class of theories to which we will address our¬ 
selves in the present paper. 

The idea that a may be determined electromag- 
netically is an old one. In the early days of re¬ 
normalization theory there were hopes that a 
could be fixed by requiring the logarithmic diver¬ 
gences appearing in higher orders of perturbation 
theory to cancel or “compensate'* the second-order 
divergence in the photon wave-function renormali¬ 
zation Z 3 , 4 so that the renormalized photon prop¬ 
agator would be asymptotically finite. These hopes 
received a setback, however, when Jost and Lut- 
tinger 5 calculated the order- a 2 logarithmically 
divergent contribution to Z 3 and found that it has 
the same sign as the order-a divergence. Of 
course, it was obvious that the question could not 
be settled by calculations to any finite order of 
perturbation theory. A systematic nonperturbative 
attack on the problem was made by Gell-Mann and 
Low 0 in their classic 1954 paper on renormaliza¬ 
tion-group methods. They showed that there is 
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indeed an eigenvalue condition imposed by requir¬ 
ing that the renormalized photon propagator be¬ 
have as 

ad^-^/w 2 , a)= Oo+M-tf 2 /™ 2 , a), U) 

with a 0 finite and with h vanishing as -c?!n? - «>. 
However, the condition takes the form tp(a 0 )-O t 
and determines the asymptotic coupling a 0 rather 
than the physical coupling a. Their analysis leaves 
a a free parameter of the theory, restricted only 
by the condition a< a 0 coming from spectral-func¬ 
tion positivity. This essential conclusion was re¬ 
tained in the subsequent important work of Johnson, 
Baker, and Willey, 7 who showed that if the eigen¬ 
value condition is satisfied all the renormalization 
constants of electrodynamics (m 0 and Z 2 as well as 
Z 3 ) can be finite, and who applied a simple argu¬ 
ment based on the Federbush-Johnson theorem 0 
to obtain a greatly simplified form of the eigen¬ 
value equation for a 0 . Thus, the prevailing view 
since 1954 has been that it is not possible to de¬ 
termine a within a purely electrodynamical con¬ 
text. 

Our aim in the present paper is to give a reex¬ 
amination and extension of the work of Gell-Mann 
and Lov: and of Johnson, Baker, and Willey, which, 
we believe, reopens the possibility of an electro- 
dynamic determination of a. We continue to work 
within the same basic framework as these previous 
authors in that we assume, as they do, that asymp¬ 
totically vanishing terms encountered in each or¬ 
der of perturbation theory do not sum to give an 
asymptotically dominant result. Our basic obser¬ 
vation is that the work of Johnson et al., assumes 
that a 0 is both a simple zero, and a point of regu¬ 
larity, of the Gell-Mann-Low function tp(y). In 
actual fact, we find that an extension of the argu¬ 
ment given by Baker and Johnson to obtain their 
simplified eigenvalue condition indicates that 
neither of these assumptions is correct. We show 
that if ip has a zero at all it must be a zero of in¬ 
finite order - i.e., an essential singularity. This 
infinite-order zero in the coupling constant must 
also appear in all of the single-fermion-loop 2w- 
point functions calculated in electrodynamics with 
zero fermion mass. The presence of an essential 
singularity has the important consequence that dif¬ 
ferent orders of summing perturbation theory can 
lead to inequivalent forms of the eigenvalue con¬ 
dition. One natural method of summing perturba¬ 
tion theory is to proceed ‘Vacuum-polarization- 
insertion-wise” One first sums all internal-pho¬ 
ton self-energy parts, and then inserts the result¬ 
ing full photon propagators in the vacuum-polariza¬ 
tion skeleton graphs. This order of summation is 
the one used by Johnson, Baker, and Willey, and 
leads naturally to the class of asymptotically finite 
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solutions introduced by Gell-Mann and Low, in 
which a 0 is fixed to be the infinite-order zero and 
a< a 0 is undetermined. An alternative summation 
method is to proceed “loopwise”: One first sums 
all single-fermion-loop vacuum-polarization 
graphs, then one sums all two-fermion-loop vacu¬ 
um-polarization graphs, and so forth. If we as¬ 
sume that the single-fermion-loop 2w-point func¬ 
tions do actually have the infinite-order zero in the 
coupling constant as described above, then by using 
loopwise summation we show that there is a unique 
additional asymptotically finite solution, in which 
the physical coupling ct is fixed to be the infinite- 
order zero. We conjecture that this is the solution 
actually chosen by nature. According to our conr 
jecture , the fine-structure constant a may be com¬ 
puted as follows . Let F [l \y) be the coefficient of 
the logarithmically divergent part of the sum of 
single-fermion-loop vacuum-polarization diagrams 
illustrated in Fig . i. We conjecture thatF^(y) is 
analytic in an interval extending from y = 0 to y- a, 
where it has an infinite-order zero as y approaches 
a from below along the real axis . If the function 
has no infinite-order zero, then the renor¬ 
malized photon propagator cannot be asymptotically 
finite. Our conjecture has the obvious virtue that 
it stands or falls according to the outcome of the 
mathematical problem of calculating the function 
F fl] (y). This problem will be well posed in per¬ 
turbation theory if F [ll (y) is a function of the class 
which is uniquely defined by the coefficients of its 
formal power-series development in y. 9 

The paper is organized as follows. In Sec. II we 
give a review of previous work on the short-dis¬ 
tance behavior of electrodynamics. We derive 
the Gell-Mann-Low equation for the asymptotic 
behavior of the photon propagator, discuss its 
properties, and establish its relation to the recent 
work of Callan and Symanzik. 10 We then review 
the program of Johnson, Baker, and Willey for the 
removal of infinities from electrodynamics. In 
Sec. in we show that the zero of the Gell-Mann- 
Low function must be an essential singularity and 
discuss the implications of this for the conventional 



* finite + (y) x logorithm 


FIG. 1. Sum of single-fermion-loop vacuum-polariza¬ 
tion diagrams which determines the function F^(y), with 
the dependence on the coupling constant y indicated ex¬ 
plicitly. Throughout the paper we will adhere to the con¬ 
vention of using solid lines to denote fermions, wavy 
lines to denote photons. 
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eigenvalue condition on a 0 and for the asymptotic 
behavior of the renormalized electron propagator. 

In Sec. IV we introduce the idea of “loopwise” 
summation and show that, assuming the presence 
of the essential singularity, there is an asymp¬ 
totically finite solution of electrodynamics in which 
a, rather than is fixed to be the infinite order 
zero. In Sec. V we motivate our conjecture that 
nature picks the solution in which a is fixed, and 
we suggest a possible connection of our work with 
a conjecture of Dyson 9 concerning singularities in 
electrodynamics at a = 0. We also point out that 
our conjecture leads to a determination of a which 
is independent of the number of fundamental fer¬ 
mion species, and based on this fact, give a spec¬ 
ulative argument Justifying the neglect of the strong 
interactions in formulating our eigenvalue condi¬ 
tion for a. In Appendix A we give a summary of 
notation, while in Appendix B we derive the Callan- 
Symanzik equations for massive photon (i.e., in¬ 
frared-cutoff) spinor electrodynamics in an arbi¬ 
trary covariant gauge, and briefly sketch the ap¬ 
plication of these equations to deriving the Johnson- 
Baker-Willey asymptotic form for the electron 
propagator. 

II. REVIEW OF PREVIOUS WORK 

We begin with a survey of the papers of Gell- 
Mann and Low, of Callan and Symanzik and of 
Johnson, Baker and Willey dealing with the short 
distance behavior of electrodynamics. Our partic¬ 
ular aim will be to examine the underlying assump¬ 
tions which these authors make and to discuss the 
connections between their approaches. 


(o) 



(b) 


+ 

permutotions 


(c) 




+ 

permutotions 


+ 

permutotions 


+ 


FIG. 2 . (a) Lowest-order vacuum-polarization contri¬ 
bution to it( 4%!, - (b) Vacuum-polarization loops with 
four or more vertices, (c) Vacuum-polarization contri¬ 
butions to 7rft 2 ) which involve the loops with four or more 
vertices illustrated in (b). 


A. Cutoff (Unrenormalized) and Renormalized 
Quantum Electrodynamics 


In order for the renormalization constants and 
the unrenormalized propagators and vertex parts 
to be well-defined, it is necessary to introduce 
cutoffs. In addition to an ultraviolet cutoff A, we 
will eliminate infrared divergences by giving the 
photon a nonzero mass m- The infrared cutoff 
will be needed for the derivation of the Callan- 
Symanzik equations for the electron propagator 
given in Appendix B. Where no infrared diver¬ 
gences are encountered, such as in the discussion 
of the asymptotic photon propagator which occupies 
the bulk of the paper, the photon mass m will be 
set to zero. Specifically, we introduce the cutoffs 
as follows: 

(i) The propagator for a bare photon of four- 
momentum q is given by 



qy \ i -a 2 

g 2 / g 2 - Mo 2 g 2 - A 2 


+*,( 1 - 1 )^ 




-A 2 

q*-A 2 ’ 



with M 0 the bare photon mass and £ a guage param¬ 
eter. The reason for the peculiar choice of the 
longitudinal term in Eq. (2) will become apparent 
very soon. 

(ii) The lowest-order vacuum-polarization con¬ 
tribution to the photon proper self-energy ir(q i ) ilu 
comes from the fermion loop diagram illustrated 
in Fig. 2(a). We calculate this contribution in the 
following manner: First we impose gauge invari¬ 
ance to remove the quadratic divergence, and then 
we regulate the fermion loop, with fermion regu¬ 
lator mass A, to remove the logarithmic diver¬ 
gence. 

(iii) All vacuum polarization loops with four or 
more vertices [see Fig. 2(b)] are calculated by 
imposing gauge invariance, which makes them 
finite. The requirement of gauge-invariant calcu¬ 
lation of fermion loops, together with the photon 
propagator cutoff specified in (i), renders conver¬ 
gent the vacuum polarization contribution to tK^ 2 )^ 
of the type illustrated in Fig. 2(c). The photon 
propagator cutoff also makes finite all electron 
self-energy parts and vertex parts, so our cutoff 
procedure is sufficient to make the unrenormalized 
theory well defined. 

We can now proceed to define renormalization 
constants and renormalized (i.e., A-independent 
in the limit A— «°) w-point functions. The renor¬ 
malized electron propagator and electron-photon 
vertex are introduced in the standard 11 manner; 
we review in detail only the construction of the 
renormalized photon propagator. Since rules (ii) 
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and (iii) guarantee the gauge invariance of the pho¬ 
ton proper self-energy part, we may write 

(3) 


Letting a b denote the canonical (bare) coupling and 
summing the series illustrated in Fig- 3 to get the 
full unrenormalized photon propagator D' F (q) uut we 
get 


D 





+ 


QvQy \ 

<? ) 


X“2-2 

“M 0 2 


_1_ -A 2 

+ <x b (f + 0(q*/A 2 )] <f- A 2 

Qti Qv 1 -A 2 

<? q*-y. z q 2 - A 2 ‘ 


(4) 


We fix the unrenormalized photon mass m 0 2 by re¬ 
quiring Eq. (4) to have a pole at (f- m 2 > i.e., 

M 2 - Mo 2 + 2 ) * 0 . (5) 

We then make the algebraic rearrangement 


tf 2 - Mo 2 + tvrV? 2 ) 

= g 2 - M 2 + ajy iKg 2 ) - mMm 2 )] 

= (g 2 - m 2 )[ 1 + a»ir(M 2 )] + a^^g 2 ) - jKm 2 )] 
= -ZTH? 2 - m 2 + «g 2 [^(g 2 ) - >Km 2 )]} , 

( 6 ) 

which introduces the photon wave-function renor¬ 
malization constant 

Z~ x =i + a b n((j. 2 ) t (7a) 

and the renormalized coupling constant a, 


a=a & -Z 3 = 


l + a b ir(ii 2 ) ' 


(7b) 


Comparing Eq. (7) with Eq. (5), we note that the 
photon bare mass can be reexpressed as 

Mo 2 =£,-v, (8) 


indicating that it is not an independent renormal¬ 
ization constant. To get the full renormalized pho¬ 
ton propagator, we multiply Eq. (4) by a b and let 
the cutoff A become infinite, giving 


- 

-a b TTtq 2 )^ * 

FIG. 3. Series which defines the full unrenormalized 
photon propagator D' F (g ) nv . 


a8p(q)„v = lim cttD'piq)^ 

_/ „ . Qn Q A _<*_ 

\ gvv g 2 /g 2 - ii 2 + acfn c (tf) 

+ a( «- 1) ^F^- (9) 

with 

%(<?)- lim^? 2 )-tt(m 2 )] • (10) 

A-^OO 

We can now see why the longitudinal part of the 
bare propagator had to be chosen as in Eq. (2): 
Because of the transversality of the lon¬ 

gitudinal part of the full propagator [Eq. (4)] is the 
same as the longitudinal part of the bare propagator. 
Therefore, in order for the longitudinal part of the 
renormalized propagator to be finite, the longitu¬ 
dinal part of the bare propagator must become 
finite when multiplied by a & . This dictates the 
overall factor of Z 3t and the use of /i 2 rather than 
M 0 2 in the denominator. The fact that the gauge 
parameter £ always occurs in the combination 
(£ - 1)Z 3 will be of importance in the derivation of 
the Callan-Symanzik equations for the electron 
propagator given in Appendix B. On the other 
hand, the form of the longitudinal terms is irrel¬ 
evant to the subsequent discussion of the Gell- 
Mann-Low and Callan-Symanzik equations for the 
photon propagator, because the photon proper self- 
energy is strictly gauge-invariant (rather than be¬ 
ing merely gauge-covariant, as is the case for 
the electron propagator and the electron-photon 
vertex) and hence receives no contribution from 
the longitudinal terms. 

To conclude this section, we state the special¬ 
ization of Eqs. (7)—(10) to the case of massless- 
photon electrodynamics (m 2 - m o 2 = 0). We have 

<xDf(q)»u = ^ y - — 

+ a(£-l)^5. (11) 

with 

d c (-g 2 /»n 2 , a) = [l + aff c (g 2 )] -1 (12) 

a dimensionless function which contains all the dy¬ 
namical effects of vacuum polarization, and with 
ir c (q 2 ) now given by 

^c(^)* lim - ^(0)]. ( 13 ) 

A-*<*> 

B. The Gell-Mann-Low Equation 

We turn next to a review of the Gell-Mann-Low 
equation, which describes the asymptotic properties 
of the photon propagator. It will be useful to define 
an “asymptotic part” of the renormalized photon 
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propagator, which we denote by ctd/°(-(f/m 2 , a), in 
the following manner: We develop ad c (-(f/m 2 t a), 
in a perturbation expansion in powers of a and in 
each order of perturbation theory drop terms which 
vanish as -q*/m 2 -» while retaining terms which 
are constant or which increase logarithmically. 12 
The resulting sum of constant and logarithmic 
terms is the "asymptotic part” and clearly has the 
form 

ad~(-(?/m 2 , a) = q(a) +p(a)\n(-q 2 /m 2 ) 

+ r(a)ln 2 (-^ 2 /wi 2 ) + • • • . (14) 

Throughout the analysis which follows we will make 
the assumption that the nonasymptotic terms which 
we have neglected in each order of perturbation 
theory do not sum to give a result which dominates 
asymptotically over the logarithmic series in Eq. 
(14). That is, we assume that the asymptotic be¬ 
havior of the “asymptotic part* 9 ctdf 3 correctly de¬ 
scribes the asymptotic behavior of the exact prop¬ 
agator ad c . 13 

To facilitate the derivation of the Gell-Mann- 
Low equation we introduce a notation which explicit¬ 
ly indicates the point where the subtraction in the 
photon proper self-energy is made. Thus, letting 
x = -q*/n?, we write 

ad c [x,w, a]*a{l + a(ir[x] - i^w])}’ 1 , 

7T [x] 2 ir[-m 2 x) = 7 T(q*) . ^ 

In terms of the new notation, the usual renormal¬ 
ized photon propagator is 

d c (x, a) = dj[x , 0, a], (16) 

with the renormalized charge a given by 

a= adJO, 0, a]. (17) 

Let us now imagine that, instead of making expan¬ 
sions in powers of the usual fine-structure con¬ 
stant a, we use as expansion parameter a new 
fine-structure constant a w defined by 

a w ~ cidj[w 9 0, a] = a{l + a(ir[w] - tt[0])} _1 . (18) 


further reference to the point zero. 

Let us now let x and w both become large. Ac¬ 
cording to our earlier discussion, the right-hand 
side of Eq. (20) becomes the logarithmic series 
ad^(x, a). For the left-hand side, we introduce 
the asymptotic assumption that the only dependence 
on x and w, when both are large, is through the 
ratio x/w. An equivalent statement of the asymp¬ 
totic assumption is that when x^-cf/rr? and 
w = -q ,2 /m 2 are both large, the quantity a w d c [x, y , aj, 
regarded as a power series in a w , becomes inde¬ 
pendent of the electron mass m. 14 This assumption 
can actually be justified order-by-order in pertur¬ 
bation theory, either by using the analysis of 
Callan and Symanzik (see below) or by invoking 
the theorem on cancellation of infrared singularities 
of Kinoshita 15 and Lee and Nauenberg. 15 Equation 
(20) now becomes 


[x/w, a w ] = ad"(x, a ), (21) 

where, since w is large, we may rewrite Eq. (18) 
for a w as 

ot w - ctdf(w, a) 

~q(a)+p(a)lttw+r(a)lnhv+ •• * . (22) 

Equation (21) gives a functional relation for d c °°, 
which may be rewritten in a more useful form as 
follows: We introduce the function ip(z) by the def¬ 
inition 



v = l 


(23) 


Differentiating Eq. (21) with respect to x, and then 
setting x = w, we get the differential equation 



Rewriting this as 

dw da w 
w ~</>(aJ ’ 


(24a) 


(24b) 


From the definition of Eq. (15), we may write 

<*wdj[x, w, a J = a w {l + a w (ir[x] - tt[m ;])} _1 

= (ot~ l + tt[x] - tt[w]) ~ l , (19) 

which on substitution of Eq. (18) becomes 

a w d c [x, w, a J ={a _1 + ?r[y] - tt[0] + tt[x] - ir[w]} -» 

= ad c [x, 0, a] = ad c (x, a) . (20) 

On the right-hand side we have the usual photon 
propagator, which involves a subtraction at the 
nonasymptotic point zero; Eq. (20) states that this 
can be reexpressed in terms of the new charge a w 
and the photon propagator subtracted at w , with no 


integrating with respect to w from 1 to x and using 
the boundary condition 

a wL-i = 9(«) = 0K?r(l, 0£) (25) 


we get finally the Gell-Mann-Low equation, 



dz 

ip(z) ' 


(26) 


It is also useful to have the inversion formula 
relating the coefficients q(a), p(a), . .., in the log¬ 
arithmic expansion of Eq. (22) to the Gell-Mann- 
Low function ip(z). To get this, we write z = at x 
= adj°(x, a) and make a Taylor expansion of z with 
respect to lnx. 
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2 = 


00 


£ 


n=0 


(In*)" d n 


n\ d(lrtf)' 


According to Eq. (24), 
be rewritten as 


z (27a) 

In x = 0 

the derivative d/d( lnx) may 


d dz d_ 
d(lnx) d(lnx) dz 



(27b) 


giving the desired formula 6 


ad“(x, a) 

n = o 



(28) 


The function ip(z) spearing in these formulas is 
not explicitly known beyond its expansion to sixth 
order of perturbation theory, which is 18 


tp(z) = z 



+ 


.il _!L 

4tt 2+ 8ji 3 




(29) 


with f(3) the Riemann f function. 

As Gell-Mann and Low have shown, Eq. (26) pro¬ 
vides a powerful tool for analyzing the asymptotic 
behavior of the photon propagator, and leads one 
naturally to distinguish the following two possibili¬ 
ties: (a) The integral fdz/ip(z) in Eq. (26) does not 
diverge until the upper limit becomes infinite. In 
this case ad“(x, a) — °° as x -<», and so the photon 
propagator is asymptotically divergent, (b) For 
some finite value z = a 0 , the function ip(z) develops 
a sufficiently strong zero for f a °dz/ip(z) to di¬ 
verge. In this case ad“(x,a)-a 0 as^-oo and the 
photon propagator is asymptotically finite. We will 
restrict our attention from here on exclusively to 
case (b), for which, as noted in the Introduction, 
we may write 


ad*(x, aHao. (31c) 

In this case the Gell-Mann-Low equation degener¬ 
ates to an integral over an interval of vanishing 
size located at the point where ip ~ l is infinite. 

Type 2. The physical fine-structure constant a 
differs from a x . The coefficients of the logarithmic 
terms in Eq. (28) then do not vanish and ad™(x, a) 
is a nontrivial function of x which approaches a 0 in 
the limit asx-«>. In this case the Gell-Mann-Low 
equation is nondegenerate, with the integral extend¬ 
ing over an interval of finite size, and a is an un¬ 
determined parameter. 

Clearly, as far as behavior of the photon propa¬ 
gator is concerned, the more general class of 
asymptotically finite solutions with type-2 behavior 
is just as satisfactory physically as the solution 
with type-1 behavior. (We will find additional evi¬ 
dence for this statement when we study the asymp¬ 
totic electron propagator below.) Hence following 
Gell-Mann and Low, we conclude that requiring 
asymptotic finiteness of the renormalized photon 
propagator fixes a 0 , but does not determine the 
fine-structure constant a . 

To conclude this discussion of the Gell-Mann- 
Low equation we give a simple, concrete illustra¬ 
tion of type-2 asymptotic behavior. Let us make 
the customary assumption that ip(z) is regular and 
vanishes with a simple zero and negative slope at 
z = a 0 . We ignore the fact that ip(z) also vanishes 
for small z and replace by a linear approxima¬ 
tion over the integration interval in the Gell-Mann- 
Low equation, 

4iz)~ i|)'(oo)(a 0 -«), f(a 0 )<0. (32) 

Then Eq. (26) can be immediately integrated to give 


ad^(x, a) = a 0 +h(x , a), (30) 

lim h(x, a) = 0. 

Within the category of case (b), we wish to fur¬ 
ther distinguish between two different types of pos¬ 
sible asymptotic behavior of the theory: 

Type 1. The physical fine-structure constant a 
is equal to the particular value a t which satisfies 

q(<*i )= 0£o- (31a) 


lnx 


- 1 lu \ ad nbc, a) - aa ] 

^(O£ 0 ) L q{a)-ot Q J ’ 


(33) 


which can be rewritten as 

<*d “(x, a) = otQ + [q(a) - ae^x * Ccto) 


-vi 

n=l 


(34) 


According to Eq. (21), the coefficient of (lnx)" with 
n> 1 is then 



^(0£)) 


! iW'ir- 




We see that the logarithmic series for ad*(x, a) is 
nontrivial, with all powers of In* present, but that 
it sums to a function which approaches a 0 asymp - 
totically. The nonasymptotic pieced, which is 
given in our example by 


= ^(o£ 0 ){ ■ ■ ■} = 0, (31b) 


h(x, a) = [ 9 (a) — a,,]** ( “ 0> , 


(35) 


and the logarithmic series reduces to its constant vanishes asymptotically as a power of x indepen- 
term alone, dently of the value of a. 
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C. The Callan-Symanzik Equation 

A very powerful and elegant method for studying 
the asymptotic behavior of renormalized perturba¬ 
tion theory has recently been developed by Callan 
and Symanzik. 10 We briefly review here the deri¬ 
vation of the Callan-Symanzik equation for the re¬ 
normalized photon propagator, 17 and indicate its 
connection with the Gell-Mann-Low equation dis¬ 
cussed above. Our starting point is the formula 
relating the renormalized and unrenormalized pho¬ 
ton propagators, 

d c (-q z /m z , a) -1 = Z s (\ 2 /m 2 , a)[l + a b ir(q 2 )], (36) 


with the canonical (bare) coupling a b and the cutoff 
A held fixed. Under this variation the bare elec¬ 
tron mass m Q and the physical coupling a both 
change, since the renormalization conditions give 
both of them an implicit dependence on m . Thus, 
insofar as d c and Z 3 are concerned, variation of 
m is described by 




da d 

+ m --— — , 
dm da 


(37) 


while for the bare propagator 1 + a b ir(q 2 ), which 
depends on m only through the mass variation 
is described by 


where we have explicitly indicated the cutoff de¬ 
pendence of Z 3 . Since the photon propagator is 
gauge invariant, the quantities appearing in Eq. 
(36) have no dependence on the gauge parameter £. 
Let us now vary the physical electron mass m, 



= m 


dm a 9 
dm 9 niQ * 


(38) 


Equating the mass variations of the left- and right- 
hand sides of Eq. (36) gives 



+ m 


da 9 \ 
dm 9a/ 





=m-~Zg[l + a b ir\ + Z 3 m + oc b 7r] 


dm 


~ -1 _ d ~ j -i dm Q 9 

= Z, 7n-^— Z,a e + am -r-^ -r — it. 
3 dm 3 c dm 9m 0 


(39) 


The term Bir/dm^ on the right-hand side of Eq. (39) 
is simply interpreted as a photon-photon-scalar 
vertex part, with the scalar current carrying zero 
four-momentum. It can be shown 18 that this ver¬ 
tex part is made finite by multiplication by the re¬ 
normalization constant and so we can write 


g 

m ° B 77 = f )7s( < 7 2/m2 > «) • (40) 

It can be further shown 10 * 18 that the quantities /3(a) 
and 6(a) defined by 


/3(a) 

l + 6(a)=m 0 -»m^»i 0 


(41) 


are cutoff-independent and therefore, as indicated, 
are functions of a alone. Finally, we can relate 
the quantity mda/dm appearing on the left-hand 
side of Eq. (39) to 0(a), as follows: 


m 


da 

dm 


= m 


d 

dm 


(a b Z 2 )-a b m 



= aZ 2 ~bn 



= a/3(a). 


(42) 


\m - l)]d c (-9 2 /a)' 1 

= a[l+5(«)]f Ij! (? ! /»< 1 , a). 

(43) 

Let us now let become infinite. Order by 

order in perturbation theory, the left-hand side of 
Eq. (43) becomes 

3m + ^ a) ( a 3a" 1 )] d "( - 4 2 ' /,n2 > “ ) ~ 1 > ( 44) 

while a simple application of Weinberg’s theorem 19 
shows that, again order by order in perturbation 
theory, the right-hand side of Eq. (43) vanishes. 

So we learn that d c °° satisfies the differential equa¬ 
tion 




a) -1 = 0. 

(45a) 


Interestingly, when we substitute Eq. (37) for 
mdfdm into Eq. (42), we learn 17 that ^(AVm 2 , a) 
satisfies a differential equation identical in form 
to Eq. (45a), 




Z^A 2 /™ 2 , a) = 0. 


(45b) 


Putting everything together we get the Callan- 
Symanzik equation for the photon propagator. 



R29 


375 


3028 


STEPHEN L. ADLER 


5 


For the subsequent discussion, it will be useful to 
reexpress Eq. (45a) as a differential equation for 

4T, 

a Ta + *)] d ~ a) = 0. (46) 

We will now show that Eq, (46) is completely 
equivalent to Eq. (26), the Gell-Mann-Low equa¬ 
tion. Letting x , as before denote -tfVm 2 , we re¬ 
write Eq, (46) in the form 

- 2x J^ +a ^'^ i ad c( X ’ “) = °- ( 47 ) 

This equation has the integral 

ad "( x ’ a )- $ '‘[i nx+ r^f)]’ (48) 

with the function $ determined by the x=l boundary 
condition 

$[ad"(l, a)] = $[g(a)]= J (49) 

and with c an arbitrary constant of integration. 

(The presence of c merely reflects the freedom of 
changing $ by an arbitrary additive constant.) In¬ 
verting Eq. (48), we thus can write 

1 nx = $[ad c °°(x, a)]-$[g(a)] (50) 


where o = 0 £ 1 , we have /3(a) = 0. Equation (47) then 
reduces to 

* 3 ^ <*d c °°(x, a) = 0, (54) 

which has, as expected, the integral 

ad~(x t a) = a 0 . (55) 

Similarly, Eq. (45b) tells us that when /3(a) =0, 
the photon wave-function renormalization Z 3 is 
cutoff-independent. As shown in Appendix B, the 
Callan-Symanzik equation for the renormalized 
electron propagator also has the function /3(a) as 
coefficient of the 9/9a term. Consequently, in 
the case of type-1 asymptotic behavior this equa¬ 
tion also simplifies, and leads, by a simple argu¬ 
ment, 18 to an elementary scaling form for the as¬ 
ymptotic electron propagator. Clearly, in the case 
of type-2 asymptotic behavior we have /3(a) #0 and 
must deal with the Callan-Symanzik equations in 
their full complexity. Even so, we find in Appen¬ 
dix B that the scaling form for the asymptotic elec¬ 
tron propagator still holds, again indicating, as 
asserted above, that there is no reason for favor¬ 
ing the type-1 solution over the more general class 
of asymptotically finite solutions with type-2 as¬ 
ymptotic behavior. 


which, if we write $[w] in the integral form 



ip{z)~[$'{z )]’ 1 , 


(51) 


can be further recast as 


lnx = 



dz 


(52) 


which is just the Gell-Mann-Low equation. Clearly, 
the derivation which we have just given does not 
involve the asymptotic assumption made in the dis¬ 
cussion immediately following Eq. (20); in effect, 
the Callan-Symanzik route to the Gell-Mann-Low 
equation replaces a statement about infrared be¬ 
havior (m independence of a w dj[x, w , a J as m- 0) 
with a statement about ultraviolet behavior (asymp¬ 
totic vanishing of r^) which can be justified in 
perturbation theory by the use of Weinberg’s the¬ 
orem. 

Comparing Eq. (51) with Eq. (49), we can read 
off the following functional relationship between 
the Callan-Symanzik function /3(a) and the Gell- 
Mann-Low function ip(z). 


/Xa)- 


2 ip(q(a)) 
aq'(a) 


(53) 


Thus, in the case of type-1 asymptotic behavior, 


D. The Johnson-Baker-Willey Program 

We conclude our review by surveying the recent 
work of Johnson, Baker, and Willey (JBW) dealing 
with the asymptotic properties of electrodynamics. 
As noted in Sec. I, this work has led to two prin¬ 
cipal results: a simplified form of the Gell-Mann- 
Low eigenvalue condition for a 0 , and a demonstra¬ 
tion that if the eigenvalue condition is satisfied, 
then the electron bare mass m 0 and wave-function 
renormalization constant Z 2 can be finite. We dis¬ 
cuss these two aspects in turn. 

1. Simplified Eigenvalue Condition 

A key ingredient in the JBW formulation of the 
eigenvalue condition is the use of 'Vacuum-polar- 
ization-insertion-wise” summation of the photon 
proper self-energy part n. The basic idea is to 
first write down a modified skeleton expansion for 
the photon proper self-energy in which all diagrams 
with internal vacuum polarization insertions are 
omitted. Some typical diagrams which appear in 
this expansion are illustrated in Fig. 4; note that 
they still contain internal electron self-energy and 
electron-photon vertex parts. The next step is to 
replace all internal photon lines appearing in the 
expansion by full renormalized photon propagators 
<*Df(q)\xv (we indicate explicitly the coupling con¬ 
stant a associated with the ends of the photon line). 
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FIG. 4. Typical diagrams which appear in the modified 
skeleton expansion for the photon proper self-energy part 
ir. All proper diagrams are included which do not have 
internal photon self-energy insertions. Diagrams (a) 
have a single fermion loop, while those labeled (b) con¬ 
tain two or more fermion loops. 

This recipe leads to a “vacuum-polarization-in¬ 
sertion-wise” summed expression for the photon 
proper self-energy tt, which, it is easy to see, 
correctly includes all of the relevant Feynman 
diagrams. 

We next introduce the assumption that the renor¬ 
malized photon propagator is asymptotically finite, 
which allows us to write it in the form 

+ h(-q 2 /ni l , a)] 

+ aU-l)ffi, (56) 

with h vanishing asymptotically. In order for this 
assumption to be self-consistent, we require that 
the renormalized photon proper self-energy part 
7T C , obtained by inserting Eq. (56) in the skeleton 
expansion as outlined above, must itself be asymp¬ 
totically finite. That is, no powers of In (-(f/n?) 
may be present in the asymptotic behavior of tt c . 

To determine the asymptotic properties of tt c , we 
must consider each contributing graph (or more 
exactly, each gauge-invariant set of graphs related 
by permutation of photon vertices) and examine 
the convergence properties of both the over-all in¬ 
tegration, involving all lines in the graph, and the 
subintegrations involving subsets of these lines. 

In doing this we maintain our “vacuum-polarization- 
insertion-wise” summation by treating internal 
photon propagators as complete entities, described 
by Eq, (56), rather than also breaking these up 
into contributing graphs. By our assumption of 
Eq. (56), the internal photon propagators cannot 
give rise to any logarithmic terms. Subintegrations 
associated with electron-self-energy and electron- 
photon vertex parts also lead to no logarithms, be¬ 
cause as shown in Sec. HD 2, there is a gauge (the 
Landau gauge) which makes these asymptotically 
finite. It can be shown 7 ' 18 that there are no other 
troublesome subintegrations; hence logarithmic 
behavior of a graph contributing to tt c can only be 
associated with the over-all integration involving 


all lines in the graph. Referring to Eq. (56), we 
see that each internal photon line in the over-all 
integration contributes two parts, a part propor¬ 
tional to a 0 and a part proportional to h. As we 
have seen in Eqs. (32)—(35) above, the conventional 
assumptions about the nature of the zero of ip(z) 
imply that h decreases as a power of -(f/m 2 as 
°°. As a result, any contribution to the 
over-all integration involving one or more factors 
h converges, and leads to an asymptotically finite 
contribution to tt c . Hence the asymptotically loga¬ 
rithmic part of 7 t c is correctly obtained by neglect¬ 
ing h in each internal photon propagator, so that 
Eq. (56) becomes 

a Dp(q)vv s -Sw -^+gauge term. (57) 

Thus, we are led to a simplified model for 7 t c (the 
so-called JBW model) in which no internal photon 
self-energy insertions appear; all internal pho¬ 
tons are described by free propagators coupling 
with the asymptotic coupling strength a 0 . An anal¬ 
ysis 7,18 of the asymptotic behavior of tt c in this 
model shows that a single logarithm is present 
(corresponding to the fact that a single subtraction 
suffices to make the over-all integration converge), 
so we get finally 

*c /v, g(a 0 ) + /(a 0 )ln(-?y»n 2 ). (58) 

Self-consistency of the assumption of asymptotic 
finiteness of tt c now requires 

/K) = 0, (59) 

which is the JBW form of the eigenvalue condition. 
Let us reiterate that Eq. (59) does not involve all 
vacuum polarization graphs [as does the Gell- 
Mann-Low eigenvalue condition ip(a Q ) =0] but rather 
only the restricted class, illustrated in Fig. 4, 
which have no internal photon self-energy inser¬ 
tions. 

Implicit in the derivation of Eq. (59) are rather 
stringent convergence assumptions. These arise 
because the argument leading to Eq. (59) involves 
replacing the limit of an infinite sum [the exact 
t is an infinite sum of skeleton graphs with 
photon self-energy insertions] by the sum of the 
limits of the individual terms, [Eq. (58) is the sum 
of skeletons with the photon self-energy insertions 
replaced by their asymptotic limits.] A necessary, 
but by no means sufficient, condition for the inter¬ 
change of limit with sum to be valid is that the re¬ 
sulting series /(a 0 ) be convergent. This fact will 
be of importance in the discussion of “loopwise” 
summation given in Sec. IV below. 

In their recent papers, 7 Baker and Johnson have 
extended in two respects the treatment of the eigen- 
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value equation sketched above: First, they have 
shown that /(a 0 ) = 0 implies that a 0 is also a zero 
of the Gell-Mann-Low function ip(y), and secondly, 
they have shown (again assuming “vacuum-polar- 
ization-insertion-wise” summation) that Eq. (59) 
can be replaced by the much simpler eigenvalue 
condition 

F [ll (a 0 )=0, (60) 

where F [11 (y) is the single fermion loop part of f{y) 
introduced in Sec. I. The first assertion is proved 
by an argument (which we omit) based on properties 
of the modified skeleton expansion, showing that 
ip(y) and f(y ) are functionally related, 

n= i 

= Ay)c 1 (y)+f 2 (y)c 2 (y) + -- •. (61) 

Hence a zero of / is necessarily a zero of ip. The 
second assertion follows from a simple argument 
based on the Federbush-Johnson theorem; we give 
details in the case, since the results are central 
to the discussion of Sec. ID below. We assume that 
the Gell-Mann-Low eigenvalue equation ^(y) = 0 has 
a solution y- so that the renormalized photon 
propagator takes the form of Eq. (1). If we now 
let the electron mass m approach zero, we learn 
from Eq. (1) that the renormalized photon propa¬ 
gator d c approaches its asymptotic value a 0 for 
any ^“*0. This means that in a theory of massless 
spin-% electrodynamics satisfying the eigenvalue 
condition, the full renormalized photon propagator 
is exactly equal to the free photon propagator, with 
coupling constant a 0 . Consequently, the absorptive 
part of the photon proper self-energy vanishes; 
i.e., we have 

<0| (Afy„(y)|0> = 0, (62) 

where is the electromagnetic current operator. 
By exploiting positivity of the absorptive part of 
the full photon propagator, Federbush and John¬ 
son 8 * 20 have shown that the vanishing of the two- 
point function in Eq. (62) implies that j M (x) annihi¬ 
lates the vacuum, and hence the general 2w-point 
current correlation function vanishes as well, 

<0|T[j Mi (x 1 )j Ma (x 2 )---; (Ian (x 2o )]|0> = 0, n» 2. (63) 

Equation (63) is the essential tool which allows us 
to simplify the eigenvalue condition. Let us take 
the difference between the photon self-energy part 
7 t c evaluated at four-momenta (f and q t2 . Since the 
full photon propagator is equal to the free photon 
propagator in the massless theory, this difference 
may be calculated from the skeleton diagrams of 
Fig. 4, and according to Eq. (58) is given by 

- Tr c (q f2 ) = A<x 0 )M(f/q' 2 ). (64) 


The contributions to Eq. (64) may be divided into 
two basic types: those containing a single closed 
fermion loop [Fig. 4(a)] and those containing two 
or more closed fermion loops [Fig. 4(b)]. The sum 
of contributions of the second type can be recast 
as a sum involving current correlation functions 
which have been linked together by photon lines, 
and therefore vanishes by Eq. (63). Thus, the 
vanishing of the logarithmic term in Eq. (64) im¬ 
plies that the sum of contributions of the first type 
must vanish by itself, which gives the simplified 
eigenvalue condition 

F [ll (a„) = 0. (65) 

Clearly, the same argument applied to Eq. (63) 
shows that the sum of single closed fermion loop 
contributions to the general 2n-point current cor¬ 
relation function («» 2) must vanish by itself when 
the coupling is a 0 and the fermion is massless, a 
result which will be of great utility in the next sec¬ 
tion. We stress in closing that the powerful results 
which we have just described are consequences of 
positivity of the spectral function of the photon 
propagator. In particular, since the single closed 
fermion loop contributions to tt c do not by them¬ 
selves have a positive spectral function, the meth¬ 
ods which we have used cannot be used to prove the 
converse result that a zero of is necessarily 

a zero of f(y) and ^(y). 21 


2. Asymptotic Electron Propagator and 
Finiteness of andm 0 


To analyze the asymptotic electron propagator, 
JBW employ the simplified model described above, 
in which the asymptotically vanishing part h of the 
photon propagator is neglected. Each internal pho¬ 
ton is thus represented by a free propagator, cou¬ 
pling with the asymptotic coupling strength a 0 . In 
this model it is straightforward to determine the 
asymptotic behavior of the renormalized electron 
propagator and of the renormalization constants 
Z 2 andm^ either by using renormalization group 
methods 7 or by use of the Callan-Symanzik equa¬ 
tion, 18 with the results 

/ a 2 \t(«i)/2 

S ^)- 1 ^ aM-$s) 

p — m \ m / 


X 



mFAaJC^/m 2 , a L ) 



= Oj) 


> 


( 66 ) 


/ *2\-6(a 1 )/2 

m o~ C 2 (ti 2 /m 2 J 

In writing Eq. (66) we have used the fact that in the 
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mapping q(a) - a, and so Eq. (30) tells us that CONSEQUENCES 


ct 0 = q(a 1 )~a 1 . (67a) 

The function Sfaj) is defined in Eq. (41), while the 
definition of ^( 0 ^) is given in Appendix B. The 
transformation properties of Eq. (17) under changes 
in the gauge parameter £ can be explicitly worked 
out , 18 and for the exponents y and 6 we find (primed 
quantities refer to gauge parameter £', unprimed 
to gauge parameter £) 


6 '- 6 = 0 . 


(67b) 


Thus, if we choose £' to satisfy 

(aJ2vM' - 0+r(«i, £) = o, 

then we have y* »y(a lf £0 = 0 and the electron wave 
function renormalization Z f 2 remains finite as 
A—°°. Furthermore, if6(a 1 )>0, the electron bare 
mass m 0 vanishes in the limit of infinite cutoff, in¬ 
dicating that the physical mass of the electron is 
entirely electromagnetic in origin. The apparent 
logarithmic divergence of in perturbation theory 
results only when 

m o~ C 2 (m 2 / w|2 i a^expf-i^a^lnfA 2 /"* 2 )] (68) 

is expanded in a power series in a 2 = a 0 and illegal¬ 
ly truncated at a finite order. Thus, in the model 
with the photon propagator replaced by its finite 
asymptotic part, all perturbation theory infinities 
can be eliminated, provided only that 6 ( 0 !!) > 0 . 

A little caution is required, however, in applying 
the results of the JBW model to the full theory, 
where the photon propagator contains the nonas- 
ymptotic piece h in addition to the asymptotic part 
ct 0 . Because the renormalization counterterms 
which are subtracted in going from the unrenor¬ 
malized to the renormalized electron propagator 
are evaluated at the nonasymptotic four-momentum 
P=m, it is easy to see that h makes a nonvanishing 
contribution to the asymptotic renormalized elec¬ 
tron propagator. Thus Eq. ( 66 ) does not necessarily 
apply to the full theory. In Appendix B we analyze 
the effect of h on the asymptotic behavior of 
Spipy 1 . Assuming that h vanishes asymptotically 
as a power of we find that the form of Eq. 

( 66 ) and of the exponents y and 6 are unaltered, 
all of the effects of h being confined to changes in 
the constants C x and C 2 . 22 Hence, when h vanishes 
as a power, the conclusions obtained from the 
JBW model regarding the finiteness of Z 2 and m 0 
apply to the full theory as well. 


We continue in the present section to work with 
the “vacuum-polarization-insertion-wise” summa¬ 
tion scheme described above in Sec. IID1. We 
show that the argument leading to the simplified 
eigenvalue condition of Eq. (65) has the further im¬ 
plication that F [lI (y) vanishes at y= a 0 with a zero 
of infinite order, i.e., an essential singularity. 

We find that as a result, the nonasymptotic piece 
h of the photon propagator vanishes asymptotically 
much more slowly than any power of -tfVm 2 , and 
we discuss consequences of this both for the eigen¬ 
value condition and for the asymptotic behavior of 
the electron propagator. 

A. Existence of an Essential Singularity 

Since our argument makes extensive use of the 
properties of the single-fermion-loop 2w-point 
functions, we begin by introducing a compact no¬ 
tation for these. Let us denote the sum of single¬ 
fermion-loop contributions to the photon proper 
self-energy by 

m, y), (69) 

where we have explicitly indicated the dependence 
on the fermion mass m and on the coupling con¬ 
stant y. The series of diagrams defining tt^ 11 has, 
of course, already been exhibited in Fig. 1. Ac¬ 
cording to the results of Sec. IID, when -^/m 2 
approaches infinity 77* 11 has the asymptotic be¬ 
havior 

TT [ c l] ((f; m, y) = G [l] (y) + F lll (y)\n(-tj*/m 2 ) 

+vanishing terms, (70) 

and the assumption that the Gell-Mann-Low func¬ 
tion ip vanishes at y= a 0 implies that the coefficient 
of the logarithm in Eq. (70) also vanishes for this 
value of the coupling, 

F [ll (a,)«=0. (71) 

Let us next denote the sum of single-fermion-loop 
contributions to the general 2«-point current cor¬ 
relation function (w£ 2) by 

■ • P2n^ 11 " " " * ^2n 5 ***» » 

7 1 +--- + 72n = 0; (72) 

the series of diagrams defining T^ 11 is shown in 
Fig. 5. In each order in the power series expan¬ 
sion in y, all distinct permutations of external and 
internal photon vertices are included in T^ 11 . As a 

result, T^ 11 is independent of the gauge parameter 
£ appearing in the internal photon propagators and 
satisfies current conservation with respect to the 
external photon indices, 
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FIG. 5, Sum of single-fermion-loop diagrams which 
defines the 2n-point function 7^ appearing in Eq, (72), 
with the dependence on the coupling constant y indicated 
explicitly. 



.(il 


Man" ?2 2 7V 

~ <7 2n n 7’uJ • • u = 0 ■ 


(73) 


As was shown in Sec. nD, when the fermion mass 
m is zero and when y is equal to a# the general 
2w-point current correlation function vanishes, 


T rii 

1 Mi* 



9 Qirtf ^o) ” 0 • 


(74) 


Finally, let us define a modified two-point function 

(75) 

by the procedure of linking 2w - 2 = 2(w - 1) external 
vertices of the general 2w-point function with n - 1 
free photon propagators and integrating over the 
four-momenta carried by these propagators, thus 
leaving a vacuum-polarization-like tensor of sec¬ 
ond rank. Because we have enforced current con¬ 
servation [Eq. (73)] and because there are no pho¬ 
ton self-energy insertions, this second-rank tensor 
has only an over-all logarithmic divergence which 
can be made finite by a single subtraction. A sim¬ 
ple way to perform the subtraction is to use the 
usual Pauli-Villars procedure of taking the dif¬ 
ference of Eq. (75) for two distinct values of the 
fermion mass, giving the finite expression 


/ , d 4 a d 4 a / / »>M2rt-3M2rt-2\ 

(2^- (2^\ - TTj A gn-I 2 -/ 


• * M2n-3M2n-2M*'^I’ * * * * Qn-lf Qf *91 y ) 

— • • »2n~3^2n~2^ v ^ lf • * * * Qn-lt ~Qn~lt 7 * y)] ■ 

(76) 


For the sake of compactness in writing the internal 
photon propagators, we have restricted ourselves 
to the Feynman gauge, a convention to which we 
will adhere henceforth. 

Our next step is to establish the following funda¬ 
mental identity 23 relating the modified two-point 
function to a derivative of the photon proper 
self-energy part uj 11 , 

y) - y )] 

= m, y) - Antf-,™', y) ■ 


(77) 

To prove Eq. (77), we develop the right-hand side 
and the bracket on the left-hand side in power 
series expansions in y, 


y ) - y) 


= S AJl 

J-0 

(78) 


y) - irlhHq*; m', y) = £ Aai! A<f', m,m'), 

i =0 

so that Eq. (77) asserts that 

2 n-i O'+« 1)1 7 T^ 1 J+It _ 1 (^ 2 ; m, m') = /g 2 -, m, m'). 

1 (79) 

To verify Eq. (79), we proceed in two steps: First, 


we show that the functions on the left- and right- 
hand side are the same, apart from a multiplicative 
constant, and then we give a simple combinatoric 
argument to show that this constant is in fact 
2 n ‘ 1 0’ +W -l)l/j!. 

To prove the first assertion, we refer to Fig. 1 
defining t^ 17 as a power series in y. We see that 
^cli+n-iW^w') is just the sum of all distinct 
single fermion loop vacuum polarization contribu¬ 
tions containing exactly j+n-1 internal virtual 
photons (with the logarithmic divergence eliminated 
by taking the difference of expressions with fermion 
masses m and m f ). Next, we refer to Fig. 5 and 
Eq. (76) which respectively define and 
Since the y dependence of nJi 1 comes entirely from 
Tt 1 , we see that j contains j internal virtual 
photons (the ones which appear in the yf term of 
T^J 1 ) plus the n -1 additional virtual photons inserted 
by the definition of Eq. (76), or a total of j + n -1 
in all. Thus is also a sum of (mass 

differenced) single fermion loop vacuum polariza¬ 
tion diagrams containing exactly j +n - 1 internal 
virtual photons. Furthermore, it is readily seen 
that all of the relevant diagrams appear in the sum 
with equal weight because is completely sym¬ 
metric in the variables of the 2w external photons. 
Hence 7must be a multiple of 4!>+„_!, the con¬ 
stant of proportionality K reflecting the fact that 
in obtaining the two-point function by linking 2w - 2 
external vertices of the 2«-point function, there 
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will be multiple counting and each relevant dia¬ 
gram of the two-point function will appear many 
times. 

To complete the derivation of Eq. (79) we must 
calculate the proportionality constant. This is 
easily done by noting that 




(80) 


the numerator and denominator in Eq. (8o) being 
the total number of distinct Feynman graphs ap¬ 
pearing in j and in respectively. Let 

us define N 2ntJ to be the total number of distinct 
Feynman graphs with j internal virtual photons 
which contribute to the single fermion loop 2«-point 
function. Then from the definitions given above 
we clearly have 

tf/ju \. N ( 8l > 

The combinatorics of calculating N 2rttj goes as 
follows. We hold one external vertex fixed on the 
fermion loop to define a starting point. There are 
then (2w + 2j -1)1 diagrams obtained by permuting 
the remaining 2w -1 external vertices and the 2 j 
vertices which terminate internal photon lines. 
However, diagrams obtained by permuting any of 
the j internal photon lines, or interchanging the 
ends of any of these lines, are identical, and so 
we must divide by a factor of 2 *jl to get the num¬ 
ber of distinct Feynman diagrams. Thus we get 24 


'2 n, j 


and hence 


_ (2« + 2j- 1)! 


2 J j\ 


(82) 


K = 


(2n + 2j - 1)1 / f 2 + 2(j + « - 1) - 1)1 


2 } ji / 5 

= 2"- 1 0' + n-l)l/7l, 


U+ii-i 


(j+»- 1)1 


(83) 


completing the proof of Eq. (77). 

We now have all the apparatus needed to show 
the existence of an essential singularity. Let us 
take the limit m,m f ~ 0 in Eq. (77), with m/m* and 
tf fixed and with y- a 0 . The left-hand side can 
be evaluated from the asymptotic expression in 
Eq. (70), giving 




In (m' 2 /m 2 ). 


(84) 




To evaluate the limit of the right-hand side, we 
refer to the definition of 

^'(g 2 ;**, y) - ^’(g 2 ; m’, y) 

given in Eq. (76). We would like to be able to inter¬ 
change the subtraction in the square bracket on the 
right-hand side with the integrations, giving 

[^'(g 2 ;™, y) - ^'(g 2 ; *»', y)](-<fg„ v +q»q v )=i m -i m -, 

(85) 

with 


= f 

J (2ff) 4 


/ / z - g "2n-3M2n-2 \ m 

(277) 4 ^ 9l 2 J {- q n _? j'w 


> , 2n-3> i 2n-2i iv ^ 1 ’ • • • > 9.-H -g.-i, q> ~q‘> rn > y) 

( 86 ) 


For general values of y t this interchange is not allowed, because I m is a logarithmically divergent integral 
of the general type 


J o P +m 

and hence the right-hand side of Eq. (85) is an ambiguous expression of the form 00 
ever, the situation is different, because Eq. (74) tells us that 

* " M 2 „- 3 M 2 rt—• • • i Qn-u a Qi ~Q\ 8, 0!,,) = 0 
and consequently 

' * M 2 n— 3 ^2n—2^~7ir • • • » Qn-li ~Qn- l» Qi ~Qf a o) 


(87) 


- 00 . When y = Oq, how- 


( 88 ) 


(89) 


is proportional to m 225 As a result, the conver¬ 
gence of Eq. (86) is improved by two powers of 
momentum over what it is for general values of y f 
and hence when y = a or l m becomes a convergent 
integral of the type 26 

f°° cnPdp 


f crrrdp 

Jo (p+m^p + cm 2 ) ' 


The interchange in Eq. (85) is now legal, 27 and 
taking the limit m f m* - 0 gives 

Um [7T^ 1 (g 2 ;m,}i)-4i 1 (g 2 ;>n', y)](-g s g(1 „ + 9 ft,) 
fn j nt ^ 0 
"i/ix' fixed 


= lim/_ - 


m— o 


lim I m > - 0. 

m o 


(91) 


(90) 
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Substituting Eqs. (91) and (84) into Eq. (77) we get, 
finally, the fundamental result 




= 0 , 2 . 


(92) 


y=cc, 


It is important to note that Eq. (88) does not imply 
the stronger result 


lim/ m = 0, (93) 


as is readily seen by taking the ra - 0 limit of the 
specific example in Eq. (90), 


lim 

m-+ 0 



cm 2 dp 

(p +m 2 )(p + cm 2 ) 


clnc 
c — 1 * 


(94) 


Thus, our argument gives us no information about 
G Cll (y), the fermion-mass independent part of the 
asymptotic expression for y) given in 

Eq. (70). 

To summarize, we have learned that the function 
F fll (y) and all of its derivatives are zero at the 
point y - a 0 , where a 0 is the zero of the Gell- 
Mann-Low function ip(y). In other words, van¬ 
ishes with an essential singularity at a 0 . It is 
clear that a similar argument can be applied to the 
general 2n-point function t£], by using an identity, 
analogous to Eq. (77), which relates the derivative 
d m T[]}/dy m to an integral over the (2« + 2m)-point 
function Thus we additionally learn that 

when the fermion mass m is zero, the single fer¬ 
mion loop 2n-point function and all of its y deriva¬ 
tives also vanish at a 0 . This fact, together with 
our result for F fl1 , gives us information about all 
of the loop diagrams appearing in the modified 
skeleton expansion for /(y), from which we learn 
that / also has an infinite order zero at a 0 . Final¬ 
ly, referring to Eq. (61), we conclude that the 
Gell-Mann-Low function ^(y) vanishes with an es¬ 
sential singularity at y = a 0 . 28 In Fig. 6 we sum¬ 
marize the complete chain of reasoning which we 
have used. Clearly, our conclusion shows that the 
customary assumption, that a 0 is a simple zero 
and a point of regularity of ip, is in fact incorrect. 


B. Asymptotic Behavior of h 


gy vanishes with power law behavior, 

h~ x ^<*o) f (95) 

as x--q 2 fm 2 becomes infinite. Now that we know 
that ip actually vanishes with an essential singular¬ 
ity, and not with a simple zero, we must reex¬ 
amine the reasoning leading to Eq. (95). We give 
first a general, qualitative argument to show how 
Eq. (95) must be modified. Let us use the Gell- 
Mann-Low equation in the form of Eqs. (50) and 
(51), 

\nx=$[ad*(x y a)]- $[g(a)] , 

(96) 

If ip has a zero at z - a„, then $[u] becomes infinite 
at u- a 0 , and hence the large- x behavior of ad* is 
governed by the behavior of $ in the vicinity of a„. 
Now if ip(z) vanishes more rapidly than a 0 - z as 
z-a 0i then v = $[u\ will become infinite faster 
than ln(a 0 -tt) as M-a 0 . This implies that ^[v] 

- a 0 is a function which is weaker than an exponen¬ 
tial a.3 v - or equivalently, 

ad*(x t a) -a 0 = *“ J [lnx] - a 0 (97) 

is weaker than a power law as x- °°. So we obtain 
the qualitative conclusion that if jp vanishes more 
rapidly than with a simple zero as z - a„, h(x t a) 
will decrease more slowly than a power law as x 

_ oo 

To obtain more specifically the connection be¬ 
tween the functional form of ip near a 0 and that of 
h near x = °°, we resort to the study of exactly in¬ 
tegrate examples. As in the discussion of Eqs. 
(32)—(35), in constructing these examples we can 
ignore the fact that ip(z) vanishes at z = 0, since 
this region is not relevant to the asymptotic be¬ 
havior of h . As our first illustration, we consider 
the case where ip vanishes with a zero of finite 
order higher than the first. Substituting 

4i(z) = A(a 0 - z y+ € (98) 

into the Gell-Mann-Low equation [Eq. (26)] and 
integrating, we get 


As we have seen in Sec. IIB, the customary as¬ 
sumption about the zero of jp implies that the non- 
asymptotic piece h of the photon proper self-ener- 


ad* - a n - 


q(a)-ct„ 


(99) 


^( a 0 )=0 


General 2n-point 
function(n-2) * 
vonishesat y*a 0 , 
m=0 



(a o )=0 

=0 

y=a 0 

m=0 


F t,] (a o )=0* 


I? 

2 n 


y =a o 

m*0 


= 0 " 


•f (a D )= 


FIG. 6. Chain of reasoning which summarizes the discussion of Secs. IID1 and in A. The abbreviation 0*° 
denotes a zero of infinite order (i.e., an essential singularity) in the y variable. 
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which, as expected, falls off more slowly than any 
power of x in the limit x — 00 . As our second illus¬ 
tration, we study the case where ip vanishes with 
an essential singularity of the form 

. ( 100 ) 

To get an exactly integrable expression we multi¬ 
ply Eq. (100) by a power of a 0 - z> giving 

4>(z) = e~ A /la o-‘ )f . (101) 

ABp 

Substituting Eq. (101) into Eq. (26) and doing the z 
integration, we get 

-A l/P 

ad c - a 0 - (i n [ s - 1 i nA:+ exp\A/[a 0 - g(ajp}]) i7 * 

^ (\n\nx)- 1/p . (102) 

We see that when ip vanishes with an essential sin¬ 
gularity at a 0 , the asymptotic vanishing of h in the 
limit of large x is very slow indeed. In Table I we 
summarize the connection between the type of zero 
of ip and the asymptotic behavior of h that we have 
inferred from our examples. 29 


C. Consequences of the Slow Decrease of h 


In both the justification of the JBW form of the 
eigenvalue condition [Eq. (59) and the discussion 
preceding it in Sec. HD] and the derivation of the 
scaling form of the asymptotic electron propaga¬ 
tor [Appendix B] we assume that a 0 is a simple 
zero, and a point of regularity, of the Gell-Mann- 
Low function ip , and that h decreases asymptotical¬ 
ly with power-law behavior. Now that we have 
seen that these assumptions are false, we must 
reexamine our treatment of the eigenvalue condi¬ 
tion and of the asymptotic behavior of the electron 
propagator, to study the consequences of the es¬ 
sential singularity which we have found in ip and of 
the concomitant very slow asymptotic decrease of 
h . For the sake of definiteness, we will assume 
behavior as in Eqs. (101) and (102) with p~ 1, so 
that h decreases asymptotically as 


lnln (-q 2 /m 2 ) 


(103) 


This restriction, while convenient to make, is not 
crucial to the discussion which follows. 

Let us first reconsider the eigenvalue condition, 
picking up our discussion of Sec. HD at the point 
where we established that logarithmic behavior of 
a graph contributing to tt c can only be associated 
with the over-all integration involving all lines in 
the graph. As we noted, each internal photon line 
in the over-all integration contributes two parts, a 
part proportional to a 0 and a part proportional to 
h . Let us separately group together all contribu¬ 
tions to v c involving no factors of A, all contribu¬ 
tions involving exactly one factor of h , all those 
involving exactly two factors of h , etc., as indi¬ 
cated in Fig. 7. The shaded blobs in Fig. 7, to 
which the insertions of h are attached, are two- 
point, four-point, six-point, etc. functions cal¬ 
culated with all internal photons described by free 
propagators coupling with the asymptotic coupling 
strength a 0 . The piece with no factors of h is just 
the one retained in our earlier discussion, which, 
as we have seen, makes the contribution 

g(ot 0 ) + f (a 0 )\n(-q 2 /m 2 ) (104) 

to the asymptotic behavior of tt c . Heuristically 
speaking, the logarithm in Eq. (104) can be thought 
of as arising from the integral 

(105) 

Jm 2 P 


in this language, the leading asymptotic behavior 
of the piece of 7 t c containing n factors of h corre¬ 
sponds to the integral 



(106) 


When h vanishes as a power of p for large p, the 
integral in Eq. (106) converges at the upper limit 
as -q 2 /m 2 — 00 . Asymptotic finiteness of tt c then 
only requires the vanishing of the coefficient of 
the integral in Eq. (105), giving the JBW condition 
/(o£ 0 ) = 0. When h vanishes much more slowly than 
a power law, as in Eq. (103), the situation is rad¬ 
ically changed. 30 The integral in Eq. (106) is now 



_ d p 

p[lnln(p/»n 2 )]" ’ 


(107) 


TABLE I. Connection between behavior of near 
z = a 0 and behavior of h (x, a ) near x * *>. 


Behavior of p near a 0 

Asymptotic behavior of h 


x*' 

(“ o - * ) 1+£ 

(lnx) _1/t 


(lnlnx) _1/ * 



FIG. 7. Grouping of ir c into contributions involving 
no factor h, exactly one factor h f exactly two factors 
h, etc. The shaded blobs denote two-point, four-point, 
six-point, etc. functions calculated with all internal 
photons described by free propagators coupling with 
the asymptotic coupling strengtha 0 . 
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which for all n is divergent at the upper limit as 
-q 2 /m 2 -*>. Thus, asymptotic finiteness of ir c re¬ 
quires now that an infinite number of conditions be 
satisfied: in addition to the coefficient of Eq. (105) 
vanishing, the coefficient of the contribution rep¬ 
resented heuristically by Eq. (107) must vanish 
for all n. It is remarkable that when a 0 is chosen 
to be the root of f(ot 0 ) = 0, this infinity of conditions 
is in fact satisfied. The reason is the argument 
based on the Federbush-Johnson theorem given in 
Eqs. (62)-(63) of Sec. IID1, which shows that 
when f(a 0 ) = 0 and the fermion mass m is zero, the 
general 2«-point current correlation function van¬ 
ishes for w^2. Hence when a 0 satisfies /(a 0 ) = 0, 
each shaded blob in Fig. 7 is proportional to m 2 
and therefore contributes a convergence factor 
m 2 /p to the integral in Eq. (107). The integral 
then becomes 25 

C '" 2 __ ( 10 8 ) 

J m 2 p^lnlnipM 2 )]”’ {1 0) 

which is asymptotically finite as — q 2 /m 2 —The 
asymptotically divergent integral in Eq. (107) of 
course reappears when a 0 is chosen to have any 
value other than the root of /(a 0 ) = 0. We conclude, 
then, that Eq. (103) still permits one to deduce the 
JBW eigenvalue condition/(a„) = 0, but only by a 
more involved mechanism than is required in the 
case of a power law vanishing of h . 

Let us next examine the implications of the es¬ 
sential singularity at a 0 and of Eq. (103) for the 
argument leading to the scaling form for the as¬ 
ymptotic electron propagator. As we have noted, 
the approach used to derive the scaling form in 
Appendix B depends very specifically on the as¬ 
sumptions of regularity of the theory in the vicin¬ 
ity of a 0 and power law vanishing of h. To deal 
with the situation where a 0 is a point of essential 
singularity, we give an alternative approach, 
based on reasoning similar to that which we have 
just used In our discussion of the eigenvalue con¬ 
dition. Let us consider the unrenormalized elec¬ 
tron propagator S^)" 1 in the limit in which -/^ 
and the cutoff A 2 are both becoming infinite rela¬ 
tive to the fermion mass m z . To study this, we 
collect together all contributions to the electron 
proper self-energy involving no factors of h , in¬ 
volving exactly one factor of h> exactly two factors 
of h t etc., as shown in Fig. 8. As before, the 
shaded blobs are calculated with all internal pho¬ 
tons described by free propagators coupling with 
the asymptotic coupling strength a 0 . The piece 
with no factors of h is Just the unrenormalized 
electron proper self-energy in the JBW model. A 
straightforward analysis 81 using the methods of 
Ref. 17 shows that if this piece alone is retained, 
the unrenormalized asymptotic electron propaga- 



FIG. 8. Grouping of the electron proper self-energy 
into contributions involving no factors h, exactly one 
factor h, exactly two factors h, etc. The shaded blobs 
are calculated with all internal photons described by 
free propagators coupling with the asymptotic coupling 
strength a 0 . 


tor has the scaling form 


sfor 1 ~ 

V/m 2 — 



,>(-§P“ 


A 2 / m 2 



m 0 F 2 (ai) 



(109) 


Together with the fact that S* F and the scalar ver¬ 
tex r s are multiplicatively renormalizable, Eq. 
(109) implies 31 the results of Eq. (66) for both the 
renormalized electron propagator and the renor¬ 
malization constants and m 0i with the modifica¬ 
tion, already noted in Sec. IID, that the constants 
Ci and C 2 in Eq. (66) become dependent on nonas- 
ymptotic quantities. We must now examine whether 
the asymptotic expression of Eq. (109) is modified 
by the pieces containing one or more factors of h . 
To this end, it is useful to note that the powers in 
Eq. (109) arise in perturbation theory from infinite 
sums of logarithms, 

, (U0) 


and heuristically, the logarithms can be thought of 
as arising from integrals of the form 


J-P P 


( 111 ) 


In this language, the piece of the electron proper 
self-energy containing « factors of h will involve 
integrals of the form 



( 112 ) 


If h vanishes as a power of p for large p, the inte¬ 
gral in Eq. (112) vanishes as -p z /m z > A 2 /m 2 
and the scaling form of Eq. (109) is unmodified. 30 
On the other hand, if h vanishes as in Eq. (103), 
then Eq. (112) becomes 


L 


A 2 


d£_ 


p p[lnln(pAn 2 )]" * 


(113) 
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which does not vanish 30 in the limit of asymptotic 
-p 2 y A 2 and which could therefore give rise to cor¬ 
rections to Eq. (109). We again can salvage the 
situation if we can use the Federbush-Johnson 
theorem to argue that the Compton-like shaded 
blobs in Fig. 8 vanish when/(a„) = 0 and the fermi¬ 
on mass m is zero. However, this involves an ex¬ 
tension of the Federbush-Johnson theorem outside 
the charge-zero sector, which is the only place 
where a satisfactory proof in the case of electro¬ 
dynamics has'been given. 8,32 If such an extension 
is allowed, we gain a convergence factor m 2 /p in 
Eq. (113), giving 

r A2 _ d pml _ ( 114) 

J-? f?[lnln(p/m 2 )r ’ 

which vanishes as A 2 /m 2 -°°. 

We conclude, then, that the JBW eigenvalue con¬ 
dition and, possibly, the scaling form for the as¬ 
ymptotic electron propagator remain valid in the 
presence of the essential singularity, but only by 
virtue of an additional infinity of conditions being 
satisfied simultaneously. This, of course, poses 
troublesome questions of convergence (basically, 
is Oxoo effectively 0 in these problems?) which we 
have not attempted to settle. 

IV. LOOPWISE SUMMATION AND AN EIGEN¬ 
VALUE CONDITION FOR a 

Up to this point we have consistently employed 
the “vacuum-polarization-insertion- wise” summa¬ 
tion scheme, both in our review of the JBW results 
in Sec. IID and in our deduction of the presence of 
an essential singularity in the preceding section. 

As we have seen, this scheme leads to a one-pa¬ 
rameter family of asymptotically finite solutions, 
in which the asymptotic coupling a 0 is determined 
to be the zero y 0 of the Gell-Mann-Low function 
4>(y) [and simultaneously a zero of the simpler 
functions f(y) and F r %?)], while the physical cou¬ 
pling a is a free parameter, restricted only by 
the condition a < ot 0 ~ y 0 following from spectral 
function positivity [see Eq. (129) below.] The usu¬ 
al assumption is that this one-parameter family 
represents the most general type of asymptotical¬ 
ly finite solution which can occur. In the present 
section, we show that the presence of a simulta¬ 
neous zero in all of the single fermion-loop dia¬ 
grams makes possible one additional asymptotical¬ 
ly finite solution t which has the very appealing 
feature that the physical coupling a is fixed to be 
y 0 . Our procedure is not strictly deductive, in 
that we continue to accept the results concerning 
properties of the single fermion-loop diagrams 
which were found in Sec. IIIA, while dropping the 


identification of a 0 with y 0 which was made there. 
We will also introduce a new order of summing 
the perturbation series, involving “loopwise” 
rather than “vacuum-polarization-insertion-wise” 
summation. Specifically, we make the following 
two assumptions: 

(1) The function F ai (y) defined by Fig. 1 and the 

2w-point current correlation function with zero 
fermion mass, T .. ^ 2n (q u •. * > Qi n J m = 3>), 

vanish simultaneously at y s y 0 . As we have seen 
in Sec. HI A, the simultaneous vanishing of F ri1 and 

implies that they vanish with a zero of infinite 
order. 

(2) The photon proper self-energy can be cor¬ 
rectly obtained by “loopwise” summation. That is, 
we assume convergence of the sum 

ir.-Sir? 1 , < 115 > 

0 = 1 

where tt^ 1 is the contribution to the photon proper 
self-energy containing exactly n closed fermion 
loops. The burden of the present section will be 
to show that these two assumptions imply asymp¬ 
totic finiteness of the photon propagator when the 
physical fine structure constant is chosen to have 
the value a=y 0 . Furthermore, we will show that 
for this particular value of a the function 0(a) ap¬ 
pearing in the Callan-Symanzik equation vanishes 
(when summed loopwise) and so the theory has 
type-1 asymptotic behavior . 

To proceed, we introduce some additional defi¬ 
nitions. Let 0^1(a) be the contribution to 0(a) with 
exactly n closed fermion loops, and let ^ be the 
part of Tr^in which exactly r closed fermion loops 
remain when all internal photon self- energy parts 
are shrunk down to points [see Fig. 9.] In terms 
of these definitions, we can write 



FIG. 9. (a) Typical diagrams contributing to the 

part of the two-fermion-loop photon proper self-energy 
which contains only one fermion loop after the internal 
photon self-energy part (enclosed by dashed lines) is 
shrunk down to a point, (b) Typical diagrams contribut¬ 
ing to , the part of the two-fermion-loop photon 
proper self-energy which still contains two fermion loops 
after shrinking away the Internal photon self-energy parts. 
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0(a) = £0W(a), 

n-1 

wM = bir l ;- r] ■ 

r= 1 


(116) 


We now begin our argument by considering the 
case tt- 1. Because we are dealing with the renor¬ 
malized theory, the coupling constant which ap¬ 
pears is the physical fine structure constant a, 
and so (using our earlier notation) we must study 
the asymptotic behavior of tt [ l\q 2 \m,a). Refer¬ 
ring back to Eq. (70), we see that for asymptotic 
—q z /m z we have 

tt £Xq 2 ; m , a) = G fll (a) + F ril (a) ln(-g 2 /m 2 ) 

+ vanishing terms; (117) 


hence choosing a = y 0 guarantees the asymptotic 
finiteness of it . Next we consider the case n = 2, 
for which we can write 


TT 




+ TT 


[ 2 . 2 ] 
c f 


(118) 


with the two terms in Eq. (118) corresponding re¬ 
spectively to the diagrams in Figs. 9(a) and 9(b). 
Because the single fermion-loop vacuum-polariza¬ 
tion insertion has already been shown to be asymp¬ 
totically finite, we can use the argument which 
was employed above in getting Eq. (58) to show 
that TT [ * ,1] , as well as tt%' 2 \ grows asymptotically 
at worst as a single power of ln(-q 2 /m 2 ) f 


^ [2 ' l \q 2 \m t a) = G b,11 (a) + F fe ’ 1] (a)ln(-g 2 /»n 2 ) 


+ vanishing terms, 


(119) 


TT^ 2 \q 2 ;m f a)^G^ t 2 \a) + F^- 2 \a)ln(- q ym 2 ) 

+ vanishing terms. 


Furthermore, the same argument tells us that the 
potential logarithm is associated with the subinte¬ 
grations involving all lines in tt?’ 21 , and all lines 
in tt!?- 11 which remain after the internal photon 
self-energy part has been shrunk down to a point. 
Clearly, these subintegrations always involve at 
least one single fermion loop 2 j-point function 
(with 2) which, we have assumed, vanishes when 
a = y 0 and the fermion mass m is zero. As a re¬ 
sult, the potentially dangerous subintegrations are 
really two powers of momentum more convergent 
than indicated by naive power counting [cf. Eq. 

(90)] and hence cannot actually lead to logarithmic 
asymptotic behavior. So we learn that when a -y 0 , 
we have f' [a,l] (a!) a l i ’ b,a] (a) a O f and therefore t rj? 1 
is asymptotically finite. Note that the argument 
which we have just given does not determine the 
actual limiting values of vJJ 3 or t t* 1 , i.e., we learn 
nothing about the values of G ri] (a), G 6,11 (a), or 
G b * 2] (a) at o£=y 0 . This is expected, because the 


G’s depend on the nonasymptotic theory (where m 
cannot be neglected) as a result of the subtraction 
at q 2 = 0 which renormalizes the photon proper 
self-energy. Since knowledge of the G*s would al¬ 
low one to calculate a 0 through the formula 

b EG r "’ rl (a) = a 0 - l -a - 1 , (120) 

n =1 r-l 

we see that in our solution with a fixed, a 0 cannot 
be determined through asymptotic considerations 
alone. 

The next step in the argument is to prove the 
vanishing of 0 [l \a) at a * y 0 . We do this by using 
the Callan-Symanzik equation in the form given by 
Eq. (43) which, on substituting Eq. (12) for d c ~ x . 
and dropping the asymptotically vanishing term 
proportional to f yySt becomes 

-0(a) + jm g^- + 0(a)|a-^- ^Ja7T c ~0. (121) 

The one-fermion-loop part of this equation is 
-/3 [ll (a)+ wi^-aJrS^-O. (122) 

dm 

Substituting Eq. (70) for 7 r ? 1 this becomes 

0 ril (a) = -2aF w (a), (123) 

from which we immediately learn that 0 [l] (a) van¬ 
ishes at a ~y 0 . 

We now continue the argument inductively. We 
assume that when a = y 0 the pieces tt ..., tt ^ of 
the photon proper self-energy are asymptotically 
finite, while the pieces 0 [1 \ ..., 0 w of the Callan- 
Symanzik function 0 are zero. We wish to extend 
these assertions to the pieces tt [ ” + i] and 0 fn+l1 
which contain one more closed fermion loop. We 
write 

7r [ r il = s ir!r i - rl , ( 124 ) 

r = l 

where, according to our induction hypothesis and 
the argument preceding Eq. (58), the piece 7T r " +1 * rl 
can grow asymptotically at most as a single power 
of In (-q 2 /m 2 ) t 

+ F [ ^ l ' r Xa)ln(-q z /m 2 ) 

+ vanishing terms. (125) 

Again, the argument leading to Eq. (125) tells us 
that the potential logarithm is associated with the 
subintegration involving all lines in TT [ ? +lmr] which 
remain after the internal photon self-energy parts 
have been shrunk away. This subintegration al¬ 
ways involves at least one single fermion-loop 2j- 
point function (j > 2) which, whena = y 0 , improves 
the ultraviolet convergence of the subintegration 
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by two powers of momentum and hence prevents a 
logarithm from actually appearing in Eq. (125). 

So we conclude that F [n+1 ' r \a) = 0 when a = y 0 , 
r* 1 ,...,»+1 , and hence 7T r " +l1 is asymptotically 
finite. To prove the vanishing of /3 r " +ll (a), we con¬ 
sider the part of Eq. (120) involving exactly n + 1 
closed fermion loops, 

—/3 [n+ 1] (a) + m ■—- an * +l1 


+ 




(126) 

Using the induction hypothesis on /3 this equation 
simplifies, when a * y ot to 


-/3 [n+1] (a) +m — a7r^ +l1 ~0. 
dm 


(127a) 


But asymptotic finiteness of 7tells us that 


a 0 =a+ f aw(p/m 2 t a)d(p/m 2 ), (129) 

J o 

and hence a 0 >a. This inequality raises an appar¬ 
ent paradox when we turn to the “vacuum-polariza¬ 
tion-insertion-wise” summation scheme, which if 
applicable would imply that a 0 obeys the same 
eigenvalue condition as does a, F ril (a 0 ) * 0. The 
paradox is resolved, however, when we note that 
since y 0 is an essential singularity of F ril (y) 9 the 
point a 0 >ot =y 0 lies outside the radius of conver¬ 
gence of F^Xy), and so the interchange of limit 
and sum leading to the eigenvalue condition on a 0 
is unjustified. Another way of stating this is ob¬ 
tained by writing down the formal Taylor expan¬ 
sion connecting the eigenvalue conditions for a and 
O£ 0 , 



y=y 0 =a 


(130) 


m^-av l r a ~0, (127b) 

dm 

so we learn that /3 r " +ll (a) = 0 when a = y 0 , complet¬ 
ing the induction. 

To summarize, we have learned, for all w, that 
7 r [ c nl is asymptotically finite and that & [nl vanishes 
when a =y 0 . Invoking now our assumption of con¬ 
vergence of the “loopwise” summations in Eq. 

(115) and Eq. (116), we learn that when a -y 09 the 
total photon proper self-energy tt c is asymptotical¬ 
ly finite, and the total Callan-Symanzik function 
/3(a) vanishes. The vanishing of the Callan-Syman¬ 
zik function means that our solution with a =y 0 has 
type-1 asymptotic behavior. According to the dis¬ 
cussion of Appendix B, the asymptotic electron 
propagator must then have the simple scaling form 
of Eq. (66) (with a l = a), leading, as we have noted, 
to the possibility of a finite m 0 and Z 2 . 

In conclusion, we briefly discuss the relation of 
the asymptotically finite solution which we have 
Just found to the “vacuum-polarization-insertion- 
wise” summation methods used earlier. As we 
have seen, in our “loopwise” solution a is deter¬ 
mined by the condition F w (a) = 0, with the asymp¬ 
totic coupling a 0 determined from a by the func¬ 
tions G r " ,r1 (a) according to Eq. (120). A priori , we 
can say nothing about the value of a 0 except that 
positivity of the spectral function w(p t a) appearing 
in the KSllen-Lehmann representation 33 for the 
photon propagator, 

d c (-q 2 /m 2 ,a) = l + q 2 f w(p/m 2 t a) ^ 

Jo q — p— i€ 

(128) 


Since and all its derivatives vanish at y ot na¬ 
ive application of Eq. (130) tells us that F ril (a 0 ) = 0. 
This conclusion is of course incorrect, because 
the Taylor expansion in Eq. (130) attempts the 
analytic continuation of F ri] outside its region of 
regularity, and therefore is mathematically mean¬ 
ingless. In other words, because of the essential 
singularity, we cannot freely rearrange the “loop- 
wise”-summed theory, with a = y 0 , into a “vacuum- 
polarization-insertion-wise”-summed theory. 

V. DISCUSSION 

We have learned that there are two possible 
ways of having an asymptotically finite electro¬ 
dynamics. The first is the usual one-parameter 
family of solutions, in which the asymptotic cou¬ 
pling a Q is fixed to be y 0 and the physical coupling 
a <a 0 is a free parameter. The second is the 
unique additional solution found in the preceding 
section, in which the physical coupling a is fixed 
to be y 0 . We conjecture that nature in fact chooses 
this second type of solution, and hence that the 
fine structure constant may be calculated by de¬ 
termining the location of the infinite order zero 
y 0 of the function F^(y). u [Of course, if the func¬ 
tion F w (y) does not have an infinite-order positive 
zero, then electrodynamics cannot be asymptoti¬ 
cally finite.] We can advance two possible rea¬ 
sons why nature may choose the solution which 
fixes a over the solutions which fix a 0 : 

(1) The “vacuum-polarization-insertion-wise” 
summation procedure needed to get the solutions 
which fix a 0 may be divergent for all nonzero val¬ 
ues of a. In other words, electrodynamics may 
exist only when summed ‘loopwise,” with the spe- 


implies the sum rule 34 
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cific choice of physical coupling a = y 0 . 

(2) Both types of solution may be mathematically 
valid, but there may be stability arguments which 
tell us that when other interactions (such as weak 
or gravitational interactions) are switched on, the 
theory chooses the largest possible value of a, 
that is a~y 0 . 

We emphasize that we have given no arguments 
which distinguish which, if either, of these pos¬ 
sible reasons is correct. 

We conclude the paper by giving a brief, spec¬ 
ulative discussion of some further implications of 
the work of the preceding sections. First, we point 
out a possible connection of our work with Dyson’s 6 
old conjecture suggesting singularities in electro¬ 
dynamics at a = 0. Then, we discuss the fact that 
the conjecture stated at the beginning of this sec¬ 
tion gives a species -independent determination of 
a, and give an argument based on this which may 
justify our neglect of strong interaction correc¬ 
tions. 

A. Dyson's Conjecture 

Dyson has argued that the renormalized pertur¬ 
bation theory of quantum electrodynamics, re¬ 
garded as a power series in a, cannot have a non¬ 
zero radius of convergence. For if it did, the the¬ 
ory would still exist when analytically continued 
to negative a, which corresponds to a physical 
situation in which like charges, rather than unlike 
charges, attract. But in the analytically continued 
theory, the usual vacuum, defined as the state 
containing no particles, would be unstable. To see 
this, we note that if we create N electron positron 
pairs, with N very large, and group the electrons 
together in one region of space and the positrons 
together in another separate region, we can create 
a pathological state in which the negative potential 
energy of the Coulomb forces exceeds the total 
rest energy and kinetic energy of the particles. 
Although this state is separated from the usual 
vacuum by a high potential barrier (of the order of 
the rest energy of the 2N particles being created), 
quantum-mechanical tunneling from the vacuum to 
the pathological state would be allowed, and would 
lead to an explosive disintegration of the vacuum by 
spontaneous polarization. This instability means 
that electrodynamics with negative a cannot be 
described by well-defined analytic functions; hence 
the perturbation series of electrodynamics must 
have zero radius of convergence. 

If one assumes, as we do in this paper, that 
electrodynamics is by itself a complete theory, 38 
then physical quantities in electrodynamics are 
described by well-defined, calculable functions of 
a when a is positive. According to Dyson's argu¬ 


ment however, these functions cannot be continued 
to negative a, and therefore must have a singular¬ 
ity at a = 0. Because the singularity originates in 
a tunneling phenomenon, and because tunneling 
amplitudes are typically negative exponentials of 
a barrier-penetration factor, it is plausible that 
this singularity should be an essential singularity 
of the form e~ c ^ a . 

We can now attempt to make a connection with 
the results of the preceding two sections. As we 
recall, we argued there that the single-fermion- 
loop function F^\a) should possess an essential 
singularity (perhaps of the form exp[-c^y 0 - a)], 
resembling a tunneling amplitude) at the point a 
= y„>0. In establishing a connection with Dyson's 
work, there appear to be two possibilities. One 
possibility is that the singularity at y 0 is not Dy¬ 
son's singularity, but that electrodynamics exists 
for a range of positive a and that F [l \a) (or per¬ 
haps some other function in the theory) has a sin¬ 
gularity at a = 0 which prevents continuation to 
negative a. An alternative possibility is that F w (o} 
is regular at a = 0, but that the full photon propa¬ 
gator simply does not exist for values of the phys¬ 
ical coupling a other than y ot preventing continua¬ 
tion of the complete theory to negative fine-struc¬ 
ture constant. In this case, the singularity of 
at y 0 would be a mathematical manifestation of Dy¬ 
son’s argument. In this connection, it is intrigu¬ 
ing that the class of single-fermion-loop vacuum- 
polarization diagrams which we assert to possess 
an essential singularity are just the simplest dia¬ 
grams describing the creation of an arbitrarily 
large number of pairs from the vacuum, and there¬ 
fore are the simplest diagrams leading to Dyson's 
pathological state. For, as shown in Fig. 10, the 
single-fermion-loop diagrams describe the crea¬ 
tion of an arbitrary number of pairs from the 
vacuum, but with only one fermion world line ac¬ 
tually present. 

B. Species Independence 

Up to this point we have assumed the presence 
of only one species of fermion interacting solely 



FIG. 10. Ordering In which a single-fermion vacuum- 
polarization loop diagram describes the creation of an 
infinite number of pairs from the vacuum. (We have not 
drawn in any of the internal photons.) 
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with the electromagnetic field. Let us now consid¬ 
er the more general case in which there are j ele¬ 
mentary charged spin-^ fermion species which, 
for the moment, we still assume to interact only 
electromagnetically. Although these fermions 
may have different masses, the contributions of 
mass-difference terms to the photon proper self- 
energy are guaranteed, just by power counting, to 
be asymptotically finite. Hence to study the effect 
of having j fermions on the asymptotic behavior of 
the photon propagator, it suffices to consider the 
special case in which they all have a common 
mass w. Then, because each closed fermion loop 
in the photon proper self-energy appears j times, 
the piece of tt c containing exactly n closed fermion 
loops is multiplied by j", and so Eq. (115) is mod¬ 
ified to read 


< 131 ) 

n-1 

Clearly, because choosing a = makes each of the 
tt^ 1 individually asymptotically finite, this choice 
of coupling makes the total tt c asymptotically finite 
as well, independent of the species number j . 
Stated in another way, when j fermion species are 
present the single fermion loop function determin¬ 
ing y 0 is just j F fl] (y), and so the value of y 0 deter¬ 
mined is the same as in the one-species case. 

Thus we reach the important conclusion that our 
eigenvalue condition for determining a is indepen¬ 
dent of the fermion species number. Whether this 
species independence is maintained in the presence 
of elementary charged spin-0 boson fields is not 
clear. The requirement is obviously that the func¬ 
tion defined by summing the single charged 

boson loop diagrams of Fig. 1 in analogy to our 
definition of F w (y), must vanish with an infinite 
order zero at the same point y 0 where F w (y) van¬ 
ishes. All that is known about F fl] (y) and F^(y) at 
present is the first few terms in their respective 
power-series expansions, 37 



Equation (132) tells us that the functions F fl] (y) and 
Fj^y) are not identical, but of course says nothing 
about their behavior when summed to all orders. 

Returning, now, to our model with several 
charged fermion species, let us suppose that some 
of these fermions have strong interactions med¬ 
iated by neutral boson exchange (the gluon model). 




FIG. 11. Fermion vacuum-polarization loop modified 
by internal gluon (dashed line) radiative corrections. 


Although the bosons do not themselves contribute 
vacuum-polarization loops, they could modify the 
fermion vacuum polarization loops when they ap¬ 
pear as internal radiative corrections (see Fig. 

11.) However, let us now invoke the experimental 
observation of scaling in deep-inelastic electron 
scattering, one explanation for which 38 is that the 
exchanges which mediate the strong interactions 
are actually much more strongly damped at high 
four-momentum transfer than is the free boson 
propagator (< 7 2 +ji 2 ) -1 . If such an explanation proves 
correct, 39 then vacuum-polarization diagrams with 
gluon radiative corrections will by themselves be 
asymptotically finite, and so the presence of 
strong interactions will not alter our eigenvalue 
condition for a. Our scheme is clearly incom¬ 
patible, however, with the presence of fractionally 
charged fermions such as quarks 40 ; all elementary 
charged fermions must have the same basic elec¬ 
tromagnetic coupling (±) Va. 

Note added in proof. R. F. Dashen has pointed 
out to us that in order y 3 and higher the vacuum 
polarization structure of charged spin-0 boson 
electrodynamics will differ from that of the spin- 
\ case, as a result of the presence of a boson- 
boson scattering counterterm in the Lagrangian. 
Hence the analysis which we have given above for 
the case of spin-^ electrodynamics cannot be di¬ 
rectly applied to the spin-0 case. The JBW argu¬ 
ment for finiteness of the bare mass also fails in 
spin-0 electrodynamics. [See D„ Flamm and 
P. G. O. Freund, Nuovo Cimento 32, 486 (1964).] 


ACKNOWLEDGMENTS 

I want to thank R. F. Dashen and W. A. Bardeen 
for discussions which led to this work, and F, J. 
Dyson, B. Simon, and S. B. Treiman for a careful 
critical reading of the manuscript. I am grateful 
to the following people for helpful conversations, 
for criticism and/or for reading the manuscript: 

C. G. Callan, M. L. Goldberger, D. J. Gross, 

K. Johnson, A. Pais, A. Sirlin, F. Strocchi, A. S. 
Wightman, K. Wilson, and D. R. Yennie. I wish to 
thank Angela Gonzales of the National Accelerator 
Laboratory staff for preparing the figure drawings. 


R29 


389 


3042 


STEPHEN L. ADLER 


APPENDIX A: PARTIAL SUMMARY OF NOTATION 


JBW 

a 

<*w 

^3 

m 

ad c (-q 2 /m 2 f a) 
h(—q 2 /m 2 t a) 
ad?(-q 2 /m 2 ,a) 


ip(y) 
F [1 \y) 


y 0 

A 

M, Mo 

I 

Who 

7T c (^ 2 )= lim[7r(^ 2 ) - 7r(/i 2 )] 

A -*• 00 

* 

WIo, ^2 

0(a) 

/(o£ 0 ) 

in 

S' F (p) 
y(a), 6(a) 

^Zn “ * * • t Qznt 3 J ) 


Johnson-Baker-Willey 

physical coupling (fine-structure constant) 
new coupling constant defined by subtraction at w 
asymptotic coupling constant 

canonical or bare coupling constant, related to a by a b = Z 3 ~ l a 
root of qictj-cto, with q(y) = yd~(l t y) 
photon wave-function renormalization constant 
electron physical mass 

renormalized photon propagator; d c (-q 2 /m 2 f a)~[l + aTr c (q 2 )]' 1 

difference between ad c and its asymptotic limit a 0 

“asymptotic part” of the renormalized photon propagator, obtained by 
dropping in each order of perturbation theory terms which vanish as 
-q 2 /m z ~'x>, but keeping terms in each order which are constant or in¬ 
crease logarithmically 

Gell-Mann-Low function 

coefficient of logarithmically divergent part of the sum of single-fermi¬ 
on-loop vacuum polarization diagrams 

point where F w (y) has an infinite-order zero (essential singularity) 
ultraviolet cutoff 

physical photon mass (infrared cutoff), bare photon mass 
bare photon propagator 

gauge parameter (coefficient of longitudinal part of photon propagator) 

photon proper self-energy 

full unrenormalized photon propagator 

full renormalized photon propagator 

subtracted photon proper self-energy 

dimensionless variable -q 2 /m 2 

electron bare mass and wave-function renormalization constant 
coefficient of B/Ba in the Callan-Symanzik equation 

coefficient of the logarithmically divergent part of the photon proper self- 
energy in the JBW model 

electromagnetic current operator 

renormalized electron propagator 

coefficient functions appearing in the Callan-Symanzik equation for the 
electron propagator 

parts of 7 r Cf 0 with exactly n closed fermion loops 

single-fermion-loop 2«-point function (w^2) with coupling y 

modified 2-point function defined as a contraction on T 


390 


Adventures in Theoretical Physics 


5 


SHORT-DISTANCE BEHAVIOR OF QUANTUM... 3043 


it 


\n ,r ] 
c 


w(p/m 2 , a) 

F™(y) 


n 


part of in which exactly r closed fermion loops remain when all in¬ 
ternal photon self-energy parts are shrunk down to points 

Kailen-Lehmann spectral function for the photon propagator 

coefficient of the logarithmically divergent part of the sum of single 
charged boson loop vacuum polarization diagrams 

combination a(£- 1) through which gauge dependence occurs 


APPENDIX B: CALLAN-SYMANZIK EQUATIONS 
AND APPLICATION TO THE ELECTRON 
PROPAGATOR 


In this Appendix we derive the Callan-Symanzik 
equations for massive photon (i.e,, infrared cutoff) 
spinor electrodynamics in an arbitrary covariant 
gauge. We are particularly interested in the equa¬ 
tions for the electron propagator and the electron- 
scalar vertex, which can be used to derive the 
JBW asymptotic form for the electron propagator 
discussed in Sec. IID 2. To begin, we recall that 
the gauge parameter £ enters into the theory only 
via the quantity ot b D° P (q) lwt which according to Eqs. 
(2) and (7b) can be written as 


a 




-A’ 


Mo <7 - A' 


+ a(4 _l)i^ 




(Bl) 


In particular, we see that £ always appears in the 
combination a(£- 1), a fact which we exploit by 
displaying the arguments of the renormalized elec¬ 
tron propagator 3£ -1 and the electron wave func¬ 
tion renormalization Z 2 in the form 

= n,m,a, tj] , 

Z 2 = Z 2 [A, ii,m,a,i)] , (B2) 

7} = a(£- 1). 


d a a 

m — = m -— + 0 m a -— , 
dm dm 3 ct 

d 9 9 


with and /3 U defined by 


da „ _i d „ 

a ^ =m i^ =z * m i^ z » 


«^ u =M^ L =^3' 1 MTr^3, 


dM 


dM 


(B4) 


(B5) 


in analogy with Eq. (42). Applying these differen¬ 
tial operators to Eq. (B4), and observing that the 
unrenormalized propagator ft -m 0 - E depends on 
m and p implicitly through its dependence on m 0 
and and explicitly through the factor 1 Aq 2 -p 2 ) 
in the gauge term, we get the Callan-Symanzik 
equations for the electron propagator, 



= -(1 + 6 m )f s +ii 2 af} m f s , , 






(B6) 


= -S^fs+M^a^p -2)f s , +2 m 2 (£ -l)f s „ . 


To derive the Callan-Symanzik equations for the 
electron propagator, we start by writing down the 
equation connecting the renormalized and unre¬ 
normalized electron propagators, 

$r[p,V, m ,ci,n]=Z£F~ l 

(B3) 

with E the electron proper self-energy part. We 
now make independent variations in the physical 
electron and photon masses m and p, keeping A 
and a b fixed and simultaneously making a gauge 
transformation which keeps q = a(£ - 1) fixed. 
These variations are described by the respective 
differential operators 


In writing this equation we have introduced the fol¬ 
lowing additional definitions: 

Z2 " m d^ Z2= - 7 ”’ 

= 1 +6m ’ (B7) 

r s = moZtfl +—) , 


R29 


391 


3044 


STEPHEN L. ADLER 


5 


f 


s' 


_^2 


as 


Z$ 3m 


2 t 


m ~ + ap(a) j 

dm da J\m / 



The vertex part r s rr is defined as the sum of 
terms in which each internal photon propagator 
ot b D 9 F (q)^v * s replaced in succession by 

a(<7 M <7„/V)(<7 2 -M 2 )- 2 l-A 2 /(<7 2 -A 2 )]. (B 8 ) 

Note that the derivative a/aa in Eq. (B 6 ) acts only 
on the a dependence explicitly displayed in Eq. 

(B2) and not on the a dependence which is implicit¬ 
ly present as a result of the dependence on 77 . Let 
us now simplify Eq. (B 6 ) in two ways. First we 
pass to the region of asymptotic which 

allows us to drop the terms f s « and f s » on the 
right-hand side, since these vanish asymptotical¬ 
ly. Secondly, we observe that we are really only 
interested in keeping the infrared cutoff /a 2 where 
it appears in divergent terms proportional to a 
power of in 71 2 . We get these divergent terms cor¬ 
rectly even if we neglect those terms in Eq. (B 6 ) 
which vanish as 0(/i 2 (In ja 2 )") as /i 2 -0, Making 
these simplifications and adding the second equa¬ 
tion in Eq. (B 6 ) to the first gives the desired form 
of the Callan-Symanzik equations for the asymptot¬ 
ic electron propagator, 

[” 1 ^ + + a/3 ^3o' + 7 ( a » -1 1 + ^s > 

[^ +y »] S *- l ~ 0 . (B9) 
where 41 


0(a) =/3 m | m 2 =0( 

6(a) = 6 m | m 2 =0 , (BIO) 

y(a, 7j) = (y ni + y M ) | M 2 * 0 . 


A precisely analogous procedure 18 yields the Cal¬ 
lan-Symanzik equations for the asymptotic elec¬ 
tron-scalar vertex, 


r a a n t \ a 



(Bll) 


Finally, in the limit as /i 2 _* 0 Eq. (B7) for Z 2 and 
m 0 can be rewritten in the form 


f a 3 . , a 

[ M 3 ^ + 7 ^] Z2 = 0 ’ 



z 2 =o, 


closely analogous to Eq. (45b) for Z 3 given in the 
text. 

Let us now use Eqs, (B9)-(B12) to study the as¬ 
ymptotic behavior of and the large -A behavior 
of m Q and Z 2 in the cases of type-1 and type-2 as¬ 
ymptotic behavior (cf. Sec. IIB). 

Type In In this case the physical coupling a is 
equal to the value a x which satisfies < 7 (aj) = a 0 , 

/3(aj) =0 and, as shown in Sec. IIC, the asymptotic 
renormalized photon propagator ad“ is exactly 
equal to a 0 . Because /3(a) =0, the a/aa terms dis¬ 
appear from Eqs. (B9)-(B12), and so these equa¬ 
tions become the simplified Callan-Symanzik equa¬ 
tions used in the analysis of Ref. 17 (apart from 
the change that the asymptotic coupling a 0 used in 
Ref. 17 is replaced now by the physical coupling 
a = aj). For the asymptotic behavior of S' F (p) and 
the large -A behavior of m 0 and Z 2 we thus get 
the scaling expressions of Eq. ( 66 ). Furthermore, 
we find the gauge transformation properties de¬ 
rived in Ref. 17 to be in accord with the conclusion 
which we have reached above, that the gauge pa¬ 
rameter £ appears only in the combination r\ 

= a(£ - 1). 

Type 2, In this case a *aj and so /3(a) #0. We 
proceed to analyze the asymptotic behavior under 
the conventional assumption that a 0 is a simple 
zero, and a point of regularity, of the Gell-Mann- 
Low function ip, or equivalently 42 [cf. Eq. (53)] 
that a x is a simple zero and a point of regularity 
of /3. As we have seen in Eqs. (32)-(35), this as¬ 
sumption corresponds to power law vanishing of 
the nonasymptotic piece h of the renormalized 
photon propagator. To study Eqs. (B9)-(B11) for 
S' F and f S( we separate out they -matrix structure 
by writing 


SJ -" 1 = $F+mG , 
mT s -^H+mJ f 

which gives the equations 


(B13) 


[ m i^ + ^ +a ^ a) h +v(a ‘^ F ~° • 

[ a a a n 

m Sm + M 3^ + a<3 ^ — + y(a, 7})J»n G -■- [1 + 6(a)] mj , 

(B14) 

[” r ^ + ^ + 6(«)J J_0 - 


(B12) 


The first of these three differential equations has 
the general integral 
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with $ F [u> ii 1 2 * 4 /m 2 ,r}] an arbitrary function of its 
arguments. Let us now consider the behavior of 
Eq. (B15) as a-a v Since 0 has a zero at z = a yt 
the argument of the exponential prefactor and the 
argument u of the function $ F both become infinite. 
The only way for the function F to remain regular 
at a = aj is for the singularities of the exponential 
and of $j?at a = a y to precisely cancel. This can 
happen only if $ F has the following asymptotic 
behavior as u becomes infinite, 

<b F [u, n 2 /m 2 ,ri] ^ C r (n 2 /m 2 , a lt i]) 

xexp[^y(aj, r})u] . (B16) 

If we assume Eq. (B16), then when a is near a y 
we get 


F *=exp|^J" dz - y ( a i y x finite terms, 

(B17) 

which is regular because /3 vanishes with only a 
simple zero at z =a y . Let us now consider what 
happens as —p 2 /m 2 becomes infinite, with a fixed 
at its physical value, different from a r Again u 
becomes infinite, this time because of the term 
ln(-p 2 /m 2 ) in Eq. (B15), and so invoking Eq. (B16) 
gives us 


Bis) 

Thus, we see that even when a * a lt in the asymp¬ 
totic limit F exhibits scaling behavior with a scal¬ 
ing exponent y characteristic of the value ai at 
which vanishes. 43 An identical argument can be 
used to integrate the equations for G and J in Eq. 
(B14) and those for and mjm in Eq. (B12), and 
finally the equation 



— 0 (B19) 


can be used to relate the 4 dependence of the re¬ 
sulting constants of integration. The procedure 
unfolds in complete analogy 44 with the treatment 
of the JBW model given in Ref, 17, and the results 
obtained are of the same form as in Eq. ( 66 ), 
apart from the more complex structure of the inte¬ 
gration constants seen in Eq. (B18), 

To conclude, we reemphasize that in order to 
derive Eq. (B18), we need the twin assumptions of 
a simple zero in /3 and of regularity of the theory 
around a,, If /3 vanishes more rapidly than with a 
simple zero at a u the exponential factor in Eq. 
(B17) is still not regular at a lt and so the argu¬ 
ment for requiring $ F to have the particular as¬ 
ymptotic form given in Eq. (B16) is no longer com¬ 
pelling. For an alternative derivation of scaling 
behavior of the asymptotic electron propagator, 
which may be valid even when ip (or equivalently, 

0) has a higher-order zero, see Sec. in C of the 
text. 
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lower limit on the possible value of cm 2 . 

2T One might ask why, even for y * a 0 , one cannot simply 
add and subtract 

2 ' ** M 2n - 3^2n-2 M y *. 0,y) 

in the integrand of Eq. (76), thus leading to the following 
modified version of Eq. (85), 

, y) - 4V (9 S'”' ,:y)](-«V M „ + 9 m 9„) =I m -Ij , 

with 


1 , ... ifefl-i (jst'l* 

m J (2 tt) 4 (2 tt) 4 \ q? 

X ^?t^ 1 2*"» 1 2n-3» 1 2n-2My 
- r M 1 tM2*"M2n-3» 1 2n-2My 




The answer is that although l m is now ultraviolet conver¬ 
gent, the subtraction term makes I m a logarithmically 
divergent integral in the infrared of the general type 
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p(p+m 2 ) 



cam 2 dp 



In c 

= a + ca-- > a , 

c -1 


Thus, the modified version of Eq, (85) is still an ambigu¬ 
ous expression of the form » - ». The significance of 
the special condition, Eq. (74), which holds when y=a 0 
is that it improves the ultraviolet behavior of I m without 
simultaneously making the infrared behavior worse. This 
feature has been incorporated in the illustrative example 
given in Eq. (90). 

28 After this work was completed, we learned that 
K. Johnson (unpublished) knew a related argument sug¬ 
gesting a zero of infinite order in ^(y), obtained by work¬ 
ing directly with the modified skeleton expansion de¬ 
scribed in Sec. IIDl, 

29 Equations (99) and (102) clearly illustrate the distinc¬ 
tion between type-1 and type-2 asymptotic behavior. 

When a Q * q{a), the asymptotic behavior is type 2, and 
Eqs. (99) and (102) can both be developed as power series 
in In*. When a Q = q{a), Eqs. (99) and (102) both degener¬ 
ate to ctdj? =a 0 , as expected for type-1 asymptotic behav¬ 
ior. 

30 The discussion which follows depends only on the fact 
that the Gell-Mann-Low function vanishes with a zero of 
infinite order, and does not hinge crucially on the choice 
of Eq. (103) for A. To see this, we note from Table I 
that if the Gell-Mann—Low function vanishes with a zero 
oi'finite order N, the function h{x t a) vanishes asymptoti¬ 
cally as (lnx)'‘ I/ ^“ I * . Consequently, h n vanishes as 
(lnx)'* n/to '*^ and the integral in Eq. (106) diverges for all 
n ^ N — 1. Letting N-+ °°, we learn that In the case of an 
infinite-order zero of the Gell-Mann-Low function, the 
integral in Eq. (106) diverges for all n. Note that in 
making the distinction between the case where h vanishes 
as a power of x and the case where h vanishes more slow¬ 
ly, it is important to adhere to our convention of “vacu- 
um-polarization-insertion-wise** summation, which re¬ 
quires us to sum the logarithmic series defining d™ be¬ 
fore passing to the asymptotic limit. This is particularly 
important in the case of Eq. (102), where the logarithmic 
series has only a finite radius of convergence and so can¬ 
not be used to describe the asymptotic region. 

31 W. A. Bardeen (unpublished). 

^The question of whether the Federbush-Johnson theo¬ 
rem can be extended outside the charge-zero sector in 
electrodynamics is an important one and deserves fur¬ 
ther study. If it can be extended sufficiently to imply the 
vanishing of the electron-photon vertex part, then using 
the Ward identity to relate the vertex part to the asymp¬ 
totic electron propagator in Eq. (66) implies that 

linjF t C t m-7=0 

when the eigenvalue condition is satisfied. If F t * 0, this 
equation then tells us thatZ 2 must vanish. 

33 G. Kalldn, Helv. Phys. Acta 25, 417 (1952); H. Leh¬ 
mann, Nuovo Cimento U, 342 (1954). 

M Note that there is no contradiction between the fact 
that a 6 > a and the assertion, essential to the argument 
of Sec. IIDl, that the spectral function vanishes as 
m 2 -» o. For illustrative purposes let us follow the ex¬ 
ample of Eq. (90) and take 

u>(x,a)=c/[{l+x){l + cx)] 
with c> 0. Then Eq. (129) becomes 


but for fixed p the spectral function (i.e., the integrand) 
vanishes as m 2 -* 0. 

35 The fact that y 0 appears as an infinite-order zero of 
F^ means that it may be possible to determine y 0 from 
the limiting behavior of the nth term in the perturbation 
expansion for F^ as n —«>. For example, suppose that 
F^ actually has a convergent power series expansion 
around y *0 with radius of convergence y 0 , 

f q ^)=Z; c„y. 

n = 0 

Then y 0 is given by the limit formula 

i*'"- 

n -*®o 

Since c n describes the fermion loop with n internal virtu¬ 
al photons, it is thus conceivable that y Q can be computed 
in a semiclassical (large-photon-number) calculation. 
3G Dyson (Ref. 9) also considers the alternative possibili¬ 
ty, that electrodynamics by itself is not a complete theo¬ 
ry, and becomes consistent only when other interactions 
are taken into account. 

3T The sixth-order result for F^ is due to J. L. Rosner, 
Phys. Rev. Letters 17, 1190 (1966), and Ann. Phys. (N.Y.) 
44 , 11 (1967). The fourth-order expansion for F^(y) is 
due to Z. Biafynicka-Birula, Bull. Acad. Polon. Sci. 13, 
369 (1965). Conflicting results in the fourth-order boson 
calculation have been claimed by I.-J. Kim and C. R. 
Hagen, Phys. Rev. D 2, 1511 (1970). However, D. Sin¬ 
clair (unpublished) has located an error in the work of 
Kim and Hagen which, when corrected, gives Bialimicka- 
Birula’s result. Sinclair has also rechecked this result 
independently by Rosner’s method of calculation. 

38 For a review of this point of view, see D. J. Gross and 
S. B. Trelman, Phys. Rev. D 4, 1059 (1971). 

39 For an alternative explanation, which regards scaling 
as an intermediate energy manifestation of composite¬ 
ness of the nucleon, see S. D. Drell and T. D. Lee, Phys. 
Rev. D 5, 1738 (1972). 

40 SinceF* 2 p(y) is different from F^(y) our scheme could, 
in principle, accommodate a fractionally charged ele¬ 
mentary boson. 

4 *The renormalization constants m 0 and Z 3 are gauge-in¬ 
variant and are also infrared finite as /i 2 -*o. These prop¬ 
erties account, respectively, for the facts that 6(a) and 
0(a) are independent of tj and that 6^ and 0^ vanish as 
M 2 -^0. In Ref. 18, it is shown that the gauge dependence 
of y(a, V) is strictly additive, i.e., y(a, 7))-y(a, i}') 

= (^-^')/(27r). 

42 We assume that the mapping q{a) is well behaved near 
a u in particular, that q'ia^^O. 

43 This was first pointed out by K. G. Wilson, Phys. Rev. 

D 3, 1818 (1971). Our treatment is suggested by the pro¬ 
cedure of C. G. Callan, Ref. 10. 

44 In particular, comparison of the second and third equa¬ 
tions in Eq. (B14) shows that G ■ - J is a particular inte¬ 
gral of the differential equation for G , and a simple appli¬ 
cation of Weinberg’s theorem shows that one cannot add 
a solution of the homogeneous equation 

+a«a)^; + v{a,i))J W iC =0 

to the particular solution. 
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The various coupling-constant-dependent numbers describing anomalous commutators are 
constrained by the nonrenormalization of the axial-vector—current anomaly. The axial-vector 
current continues to behave anomalously even if the underlying unrenormalized field theory 
is finite due to the vanishing of the Gell-Mann-Low eigenvalue function. 


I. INTRODUCTION 

It has now been established that the canonical 
formalism of quantum field theory frequently 
yields results that are not verified in perturbation 
theory. 1 These “anomalies” are of two distinct 
kinds. Firstly there are failures of the Bjorken- 
Johnson-Low (BJL) 2 limit: Equal-time commuta¬ 
tors between operators, when evaluated by the BJL 
technique in perturbation theory, usually do not 
agree with the canonical determination of these 
commutators. A well-known consequence is the 
failure of the Callan-Gross sum rule for electro¬ 
production. 3 Secondly there are violations of Ward 
identities associated with exact or partial symme¬ 


tries; the two known examples being the triangle 
anomaly of the axial-vector current and. the trace 
anomaly of the new improved energy-momentum 
tensor. 1 (When a Ward identity is anomalous, 
there is also a corresponding BJL anomaly.) The 
Sutherland-Veltman low-energy theorem for neu¬ 
tral pion decay is falsified as a consequence. 4 
Both categories of anomalies arise from the diver¬ 
gences of unrenormalized perturbation theory, 
which require the introduction of regulators to de¬ 
fine the theory. The BJL anomaly reflects the 
noncommutativity of the BJL high-energy limit 
with the infinite regulator limit which must be tak¬ 
en to define renormalized, physical amplitudes. 
Failures of Ward identities arise when no regulator 
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exists which preserves the relevant symmetry. 

Although the common cause for both classes of 
anomalies is evident, it has not been appreciated 
that an intimate relationship exists between the 
BJL anomalies and the failures of Ward identities. 
In this paper we demonstrate that the very inter¬ 
esting analysis by Crewther 5 of the triangle anom¬ 
aly in terms of Wilson's short-distance expansion 6 
can be extended to exhibit this relationship. Fur¬ 
ther we show that in lowest nontrivial order of 
perturbation theory the ^-number anomaly in the 
equal-time commutator of space-components of 
currents can be completely determined in terms of 
the c-number anomaly in the equal-time commuta¬ 
tor between the time component and space compo¬ 
nent of the current, i.e., the ordinary Schwinger 
term. Finally we inquire to what extent the canon¬ 
ical formalism can be reestablished if the unre¬ 
normalized theory becomes finite due to the van- 

_ i 

Ttf c a (x, y, z)=NA^ a (x, y,z) + ---. 


ishing of the Gell-Mann-Low 7 eigenvalue function. 
Our conclusion, at least for the axial-vector cur¬ 
rent, is that naive manipulations continue to lead 
to error. 

II. THE CREWTHER ANALYSIS 

Assume that one is dealing with a theory which is 
conformally invariant at short distances. Consider 
the vector-vector-axial-vector current amplitude 

y, ^) = <01 “(^)) 10> . (2.1) 

Schreier 0 has shown that a conformally invariant 
three-index pseudotensor of dimension 9 must be 
proportional to the fermion triangle graph con¬ 
structed from massless fermions in free field the¬ 
ory. Hence (2.1) is given by 9 

(2.2a) 
(2.2b) 


TrrVy Vy W(*- y)dy ~ Z )A Z - *)* 

[(* - y ) 2 - ie] 2 [(y - 2 ) 2 - i«] 2 [(2 - xf - ie J 2 ‘ 


Here N is a number and the dots in (2.2a) represent less singular, non-scale-invariant contributions to 
T Xbc a ( x > V* z \ which vanish in the scale-invariant (= conformally invariant) limit. The precise assumption 
about these subdominant terms is that they can be identified and separated from A% bc a (x, y, z ) in sequential 
short-distance limits, 

lim lim T%£ x (x 9 y 9 z) = lim lim NA% b *(x 9 y, z) + less singular terms, (2.3) 

x t ■*** x t ~* x k x t ~* x k x i ~* x k 

where {x ir x j9 x k } are any of {x 9 y 9 z}. Thus we know that 


lim lim Tj£“(x, y, 0) 

x -*•0 V -*o 

lim lim TSTtx, 0,*) 


47T 6 
Nd, 


va&€ 


+leSS Sin&Ular termS > 


nhc € Mi ; 6 € X € ( 2 Z a Z h - g * Z 2 ) . . . , 

T i J +les s Singular terms. 


47T° (x 2 - ie) 2 (z 2 -ie) 


(2.4a) 


(2.4b) 


Next a scale-invariant short-distance expansion for current commutators is postulated, 


[V»(x),VUQ)} = -iS rr b ah (g'"’x?-2x' i S) 

x~0 


€(X°)& W (X^) . 


67T 3 


7T 


iKyyd abo € tll, aR Aq( 0 ) x l : : 8 “ + • * * 


(2.5a) 


[A£(x), AJ(0)] = - 2x»x") + iK AA d abc t»\ B Af(Q)x* + ... , (2.5b) 


x ~o 


[vr(x),A;(0)] = iKy A d ab e^ ai vmx S + • • • . 

x~0 71 


(2.5c) 


The dots indicate less singular contributions, or operators with quantum numbers and symmetries differ¬ 
ent from the exhibited terms. S and K are constants which appear in the following equal-time commuta¬ 
tors, 


[V2W, Vl(O)]| I 0 =0 = ifl rt S rr A8 , S s g)- J^^Syyd'd^O 3 ®*---, 

\v' tt (x), vi(0)]| lN =iM aic £ i, ^(0)« s (i ) + - 1 

etc. 


(2.6a) 

(2.6b) 


A is a quadratically divergent constant, and the omitted terms have different quantum numbers. An expan- 
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sion similar to (2.5) is written for T products, 




_ „ 2 x*S) r . d abc i»\,A?{ 0 )x R 

~ * VV 2TT 4 (j?-U) 4 ~ K yv j,l 2 - +* > 




2ir 2 [x i - i € \- 


T(i4 M (r' / l 1 '(0)) - S ^(g 11 x 2 ~ 2x v x v ) dgbct M "g6-Af (0)* 8 . 

nA.{x,A t w» - S AA 2irV-i€) 4 ~ AA 2n<*L** -fej* + ’ 


x~0 


T(V ll (x)A''(0)} - —K ^abc^ 11 aB^cW* 

nV tt WA h m- a Kva 277 2 (x 2 -£ € ) 2 + • 


(2-7a) 

(2.7b) 

(2.7c) 


Crewther’s observation is that the constants N, S, and K are not independent. 5 From (2.1) and (2.7c) it 
follows that 


d e Va xft 

lim T% a (x, y, 0) = -K ya <01 TiVS(x)nm 10) 


V -0 

while (2.6a) implies that 


(2.8a) 


d a^ va&€ yA2x>‘x 6 -^) 


i 1 ” J 1 ” T » 6 ““ (X ’ y ’ 0) =S rr K r* - ie ) 4 


(2.8b) 


Hence upon comparing (2.8b) and (2.4a), one finds 

N^S YY K YAt (2.9a) 

Additionally, from (2.1), (2.4b), (2.7a), and (2.7b) 

it follows that 

N~S AA Kyy. (2.9b) 

If a similar analysis is performed on the axial- 
vector-axial-vector-axial-vector current ampli¬ 
tude one gets 

N'=S aa K aa , (2.10) 

where N' is the proportionality constant defined 
analogously to (2.2a). 10 As emphasized by Crew- 
ther, the interest in relations (2.9) and (2.10) de¬ 
rives from the fact that 5 and K are measurable 
(in principle) in various deep-inelastic processes - 
thus the anomaly in low-energy processes, at an 
unphysical point, is directly determined by experi¬ 
mental high-energy behavior. 

HI. CONSTRAINTS ON ANOMALIES 

We shall use (2.9) and (2.10) to probe the struc¬ 
ture of anomalies in various models. Consider 
first a free massless field theory with 

VS(x) = :^(x)y* 1 ^X a <p(x)'-> 

A£(x) = t .'4>(x)iy il y 5 \\ a ip(x ):. 

It is trivial to verify that, as already stated by 
Crewther, 5 (2.9) and (2.10) are satisfied with N=K 
= 5 = 1- The nonvanishing of 5 and N is convention¬ 
ally described as anomalous. A naive evaluation 
of the equal-time commutator (2.6a) yields a van¬ 
ishing result; the nonvanishing of 5 measures the 
famous Schwinger-term anomaly. Similarly a na¬ 


ive evaluation of BfiTjSffa y, 2 ), 8? T£ b v e a (x, y , 2 ), and 
d M a T££?(x,y 9 z) yields zero since the currents are 
conserved. Nevertheless, as Schreier has shown, 8 
one cannot consistently set all divergences of 
&Hb “( x 9 y* z ) t° zero, because this quantity is sin¬ 
gular when all three points coincide. In momentum 
space this corresponds to the well-known violation 
of Ward-Takahashi identities of the fermion, axial- 
vector triangle graph. Hence N measures the axi¬ 
al-vector-current anomaly. 1 Since a naive deter¬ 
mination of K from the equal-time commutator 
(2.6b) also gives K - 1, we have a connection be¬ 
tween the anomalies: N = S\ the triangle anomaly 
is a consequence of the Schwinger-term anomaly. 

Next consider a fermion theory with an SU(3)-in- 
variant Yukawa interaction of strength g involving 
neutral vector gluons. (Spin-zero gluons render 
the axial-vector current infinite; hence we do not 
consider them.) In order to apply Crewther’s 
analysis, 5 it is necessary to satisfy his hypotheses: 
(1) the existence of finite currents; (2) the exis¬ 
tence of a scale-invariant expansion for products 
of currents, (2.5) and (2.7); and (3) the existence 
of a conformally invariant short-distance limit for 
T£b?( x > y, 2 ), (2.2). No complete calculation of 
Tab?( x > y> z ) in higher order has been performed 
which can be used to check the third hypothesis. 

We shall nonetheless assume that this result is 
valid, provided the other two are satisfied. This 
assumption is motivated by the fact that the trian¬ 
gle anomaly has no higher-order corrections, 11 
and is almost certainly true for the class of 
graphs, discussed in detail below, which contain 
only a single fermion loop. (See the Appendix.) 
Therefore we set N = N* = 1, even in the presence 
of interactions . In order to satisfy the first hy¬ 
pothesis, we must not consider SU(3) singlet vec- 
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tor currents, since these are not well defined in 
perturbation theory. The interaction with the vec¬ 
tor gluon gives rise to infinite vacuum polarization, 
which modifies the singlet current. 

In lowest-order perturbation theory in this mod¬ 
el, a scale-invariant expansion for current prod¬ 
ucts exists. This can be seen as follows. A BJL- 
limit determination of the equal-time commutator 
(2.6b) yields a finite expression. 1 Hence the q- 
number portion of the expansion (2.5) and (2.7) ex¬ 
ists. That the c-number part also exists follows 
from the Jost-Luttinger calculation of the proper 
vacuum polarization tensor. 12 Their result is that 
in second order of perturbation theory this object 
is no more singular than in the free-field model. 

[in momentum space both the free-field graph and 
the lowest order graphs of Fig. 1 go as a single 
power of ln(-fe 2 ) for large k.] However, S and K 
depart from their free-field values. The Jost- 
Luttinger formula 12 for 5 is 1 + 3^ 2 /16tf 2 . Because 
N = 1, we must have K = (1 + 3^ 2 /16tf 2 ) -1 «1 - $g 2 / 
16tt 2 ; and the BJL calculation of K gives indeed 
this answer. 1 Hence we see that the BJL anomaly 
in the commutator of two spatial components of 
currents is determined by the higher-order terms 
in the c-number Schwinger term. 

Beyond lowest order, perturbation theory no 
longer satisfies the hypotheses (1) and (2). The 
axial-vector current ceases to be well defined, 
since the graph of Fig. 2 is not rendered finite by 
external wave-function renormalization factors. 1 
Also the c-number portion of the expansion (2.5) 
and (2.7) is not of the assumed scale-invariant 
form, since the proper vacuum polarization tensor 
acquires quadratic and higher powers of ln(-fe 2 ). 

(No calculations have been performed on the q- 
number part of the expansion; but we expect that 
it too is no longer scale-invariant.) However, sub¬ 
sets of graphs can be chosen which probably con¬ 
tinue to satisfy the hypotheses. For example if 
fermion creation and annihilation is ignored, then 
the vacuum expectation value of the currents is 
given by the one-fermion-loop graphs. In this ap¬ 
proximation we have 13 


(a) 


V 


V 


I Wf e>QX (0 i or“* 

= (^y-<7 V) 2 ^b ah F(g 2 )In 
+ less singular terms. (3.1a) 

Here F(g 2 ) is the Baker-Johnson function, 13 whose 
first three terms in a power series expansion are 
known: 


^ a )- 1+ wS-5T2^ + -”- (31b) 

Also the axial-vector current is no longer infinite, 
since the graph of Fig. 2 is absent. Evidently the 
c-number term in the expansion (2.5) and (2.7) is 
scale invariant with S=F(g 2 ), and it is likely that 
so also is the ^-number term. Hence we conjec¬ 
ture that if the current commutator were computed 
in the BJL limit, without including fermion cre¬ 
ation or annihilation processes, one would find 


K(g 2 ) 


1 ,J gl 

F(g 2 ) ■ 167T 2 + 512 tf 4 


(3.2) 


IV. ANOMALIES IN THE GELL-MANN - LOW LIMIT 

We consider quantum electrodynamics, and as¬ 
sume that the Gell-Mann-Low eigenvalue function 
possesses a zero, so that Z 3 is finite. 7 Now one 
can discuss currents, since the vacuum polariza¬ 
tion no longer diverges. In this limit it should be 
possible to set the electron mass m to zero and 
scale invariance becomes exact. 14 (Z 2 , the elec¬ 
tron wave-function renormalization constant, can 
be made finite by appropriate choice of gauge.) We 
examine this (hypothetical) theory in the context of 
the ideas developed in Secs. II and HI. It will be 
seen that singular behavior survives even in this 
finite theory and that naive canonical reasoning 
continues to be inapplicable. 15 [All our previous 
formulas hold with SU(3) indices suppressed, and 
the following replacements: V% - AJ - J£, d abc 
- 2, 6 ab - 2.] Observe first that, since the triangle 
anomaly has no radiative corrections, N continues 
to be equal to unity. However, because Z 3 is finite, 
the quadratically divergent Schwinger term is ab- 


(b) V 


V V 


v <^> v 


FIG. 1. Contributions to (0|T(V'i 1 (at) V'g(0))10> 

which go as ln(— k 2 ) for large k. (a) Free-field theory 
graph, (b) Lowest-order perturbation theory graphs. 



FIG. 2. Graph which render" the axial-vector current 
infinite. 
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sent; i.e., 5 = 0. Since K = 1/5, we see that the co¬ 
efficient of the axial-vector current in the short- 
distance expansion of the product of two currents 
is infinite. In other words the c-number singular¬ 
ity (2.5) is weaker than x~ 6 , but the ^-number sin¬ 
gularity is stronger than x -3 , so that their product 
remains singular as x~ 9 . Consequently the BJL 
limit, which naively gives (2.5), is anomalous, and 
this is true regardless whether or not the electron 
mass is set to zero. 

Further difficulties emerge if we set the electron 
mass to zero. In that limit all vacuum matrix ele¬ 
ments of current products vanish by the Feder- 
bush-Johnson theorem, 14 

(0|J' M| (x 1 ) * ■ -J Mn (x n )|0) =0. (4.1) 

Nevertheless we now show that one cannot conclude 
the strong statement that J M (x) 10) = 0. For if this 
were true then 

T» v ^x,y,z) = (0\T(J»(x)J v (y)J?(z))\ty (4.2) 

must be purely a seagull, since no matter what the 
values of x°, y°, and z° are, there is always an 
electromagnetic current adjacent to the vacuum. 


However, a seagull cannot give rise to the anoma¬ 
lous divergence, 1 which survives even in the Gell- 
Mann-Low scale-invariant limit, since it is mass 
independent and is not renormalized. Consequently 
the equation J M (x) 10) = 0 is false. Evidently only a 
weaker statement can be true 

(0 |OJ M (x) |0) = 0, (4.3) 

where O stands for some, but not all , operators. 

In particular, products of electromagnetic currents 
can comprise O, but O cannot be 

J?(z)J v (y) or ^(y^z). 

A further problem appears if we combine the 
Federbush-Johnson theorem with the results which 
we obtained above from Crewther’s analysis when 
fermion creation and annihilation were neglected. 
As Baker and Johnson 14 have shown, when the cou¬ 
pling g 2 is equal to the value g 0 2 which makes Z 3 
finite, and the electron mass is zero, (4.1) holds 
even in the one-fermion-loop approximation. In 
particular, g 2 is a zero of the function F(g 2 ) de¬ 
fined in (3.1a) and the four-point function satisfies 

(4.4) 


T» vXa (xQyz) = <0 |T(J M (x)J y (0)J \y)J a (z)) |0> =0 . 


But now let us take the limit x- 0 in (4.4) and substitute the short-distance expansion of (2.7a). In the one- 
loop approximation S vv =F(g 0 2 ) = 0, so the leading contribution comes from the second term in (2.7a) and 
is given by 


T» vka (x0yz) oc K(g 2 ) 

x~o 


rM*' a x S 
€ ocB x 

(x 2 -ie)‘ 


(0\T{J2(0)j\y)j 


a /- \\ | (\\ one-fermion-loop 

y Z) nv) g 2 =go 2 tm=0 - 


(4.5) 


This is infinite , since according to (3.2) the coeffi¬ 
cient K(g 0 2 ) is equal to F -1 (^ 0 2 ) =°°> while we have 
seen that the three-point function appearing in (4.5) 
cannot vanish. So we have reached the impossible 
conclusion that 0=°°! Evidently, if the theory has 
an eigenvalue g 0 which makes Z s finite, the naive 
short-distance expansion is invalid at the eigenval¬ 
ue , even though it may be true order-by-order in 
perturbation theory. In particular, the limiting 
operations g— g 0 and x- 0 do not commute. One 
can easily write down simple examples which have 
this property, e.g., 

F~ 1 (g 2 ) 

1 + f(x)F-*(g*)’ (4,6) 

where f * 0 for x*0 but/(0J = 0. For all nonzero x, 
(4.6) vanishes as F(g 2 ) in the limit g 2 — g 2 9 but 
for x=0, (4.6) diverges as F _1 (^ 2 ) in the same 
limit. Whether such behavior can actually emerge 
from field theory, when all the constraints im¬ 
posed by current conservation and conformal in¬ 
variance are taken into account, remains an open 


question, as indeed does the question of whether 
an eigenvalue g 0 2 exists in the first place. 

V. CONCLUSION 

We have shown that the coupling-constant-depen- 
dent numbers, describing various BJL anomalies, 
are constrained by the nonrenormalization of the 
triangle anomaly. Furthermore the axial-vector 
current continues to behave in a singular fashion 
even in the finite theory of Gell-Mann and Low. In 
particular the following three phenomena are in¬ 
compatible: 

(1) The triangle anomaly is unrenormalized. 

(2) There is an eigenvalue g 2 =g 0 2 which makes 
Z 3 finite. 

(3) Naive scale-invariant short-distance expan¬ 
sions involving the axial-vector current are valid 
at the eigenvalue. 

Crewther 6 also applies Wilson’s method to anom¬ 
alies of scale invariance. 1 Unfortunately there 
does not seem to be a “no renormalization theo¬ 
rem” for these anomalies since all regulators vio- 
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late scale invariance. (Chiral invariance is not 
violated by boson regulators; these render finite 
all graphs but the basic fermion triangle.) There¬ 
fore results analogous to the above cannot be de¬ 
duced for scale invariance anomalies. 16 

We have benefited from conversations with 
R. Crewther, K. Johnson and K. Wilson, which we 
are happy to acknowledge. SLA and CGC, Jr. wish 
to acknowledge the hospitality of the National Ac- 


Jdz(0\T(e(z)^ <1 \x 1 )- ■ ’4> (n) (* n ))|0 )=i 



a 

dx t 


f dzz^oine^'Hx,) ■ ■ •<»< n Vn))|o> 



where 0 is the trace of the “improved” energy- 
momentum tensor (hence containing only mass 
terms and other soft operators), d t is the canoni¬ 
cal dimension of the field <p {t) and is the corre¬ 
sponding intrinsic spin matrix. 

The basis for the naive argument for asymptotic 
conformal invariance is the observation that since 
0 must contain explicit factors of mass the left- 
hand sides of (Al) and (A2) must, on dimensional 
grounds alone, be less singular at short distances 
than the corresponding right-hand side. 

The work of Zimmermann, 17 Lowenstein, 10 and 
Schroer 19 indicates that when the unavoidable di¬ 
vergences of perturbation theory are properly tak¬ 
en into account, the above Ward identities are 
modified by the addition to © of operator contribu¬ 
tions of dimension four (nonsoft). These new terms 
have no explicit dimensional factors and need not 
vanish relative to the right-hand side in the short- 
distance limit. As a result, asymptotic scale and 
conformal invariance are not realized in renor¬ 
malized perturbation theory, except in special 
cases. 

The nonsoft contributions to © are associated 
with the various wave-function and coupling-con¬ 
stant renormalization subtractions needed to make 
the theory finite. The pieces associated with 
wave-function renormalization can in fact be ab¬ 
sorbed in (Al) and (A2) by replacing the canonical 
dimensions d i by coupling-constant-dependent 
“anomalous dimensions” d- t . The pieces associated 
with coupling-constant renormalization are pro¬ 
portional to the various interaction terms in the 
Lagrangian and simplify only in (Al)'. The inser¬ 
tion at zero four-momentum of an interaction term 
is equivalent to differentiation with respect to the 


celerator Laboratory, where part of this work was 
done. 

APPENDIX 

In this Appendix we shall give arguments for our 
assertion that the vacuum-polarization-free trian¬ 
gle is asymptotically conformal invariant. Our 
starting point is the Ward identities for scale and 
conformal invariance. At the naive canonical lev¬ 
el, these have the form 

+ d,'\(0|T(<#)' 1 ’(x 1 )---<#) ( ’' ) (x„))|0) ) (Al) 


+ 2xJ(d l ^ tI „ + 2< i J) UOln^HXi) ■ ••<#> (n, (* n ))|0) > (A2) 


r 


corresponding coupling constant. 

In the body of the paper we considered a theory 
of an SU 3 singlet vector-meson coupling via a con¬ 
served current to a fermion. The octet vector and 
axial-vector currents in such a theory require no 
renormalization subtractions, since they cannot be 
coupled to the singlet vector meson by vacuum po¬ 
larization bubbles of the type illustrated in Fig. 1. 
Thus, the SU 3 XSU 3 currents will, according to the 
preceding paragraph, act like fields with canonical 
dimensions. The same statement applies to both 
the electromagnetic and axial-vector currents in 
quantum electrodynamics with vacuum-polariza¬ 
tion insertions omitted. Since the vector meson 
couples via a conserved current, the usual Ward- 
identity argument guarantees that coupling-con¬ 
stant infinities arise only from vacuum polariza¬ 
tion graphs. If such graphs are excluded - either 
by fiat, or by looking at a sufficiently low order in 
perturbation theory-no coupling-constant renor¬ 
malization is needed, and 0 in (Al) and (A2) may 
be treated as a soft operator. Further, if we con¬ 
sider a Green's function involving only nonrenor- 
malized currents, so that the relevant dimensions 
are all canonical, the scale and conformal Ward 
identities assume their naive form and the argu¬ 
ment for asymptotic scale and conformal invari¬ 
ance becomes correct. 

Let us apply these remarks to the VVA triangle. 
To 0(g°) (g being the coupling constant of the glu¬ 
on), we obtain the bare triangle, which is trivially 
conformally invariant in the short-distance limit. 
To 0(g 2 ) we obtain the triangle decorated in all 
possible ways with one gluon. At this level, no 
vacuum polarization is possible and the above ar¬ 
gument indicates that asymptotic conformal invari- 
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ance still holds. But there is only one possible 
form for a conformal-invariant VVA amplitude. 
Therefore, in the short-distance limit, T VVA 
- (1 +cg 2 )Tv\ Af where stands for the asymp¬ 

totic limit of the bare triangle. On the other hand, 
the PCAC (partially conserved axial-vector cur¬ 
rent) anomaly is determined precisely by the 
short-distance limit of T VVA and is also known to 
be coupling-constant independent. This is possible 
only if C =0, which is to say that the 0(^ 2 ) graphs 
succeed in being conformal invariant by vanishing. 
Now consider the 0(g 4 ) contributions to T VVAt At 
this level there are vacuum-polarization graphs 
and the argument for conformal invariance breaks 
down. Nonetheless scale invariance survives. We 
argued that when coupling-constant renormaliza¬ 


tion is needed, (Al) is modified by adding a term 

P(g) (0|T(4> (l, (Xj) • • • <J> fa, (^))|0> 

to the left-hand side. It turns out that & is 0(g 3 ), 
so that if we need P(B/Bg)T VVA to 0(g 4 ) it suffices 
to know T yya to 0(g 2 ). We have just argued that 
the 0(g 2 ) contribution to T YYA vanishes more rap¬ 
idly in the asymptotic limit than naive power 
counting would suggest. Therefore, the left-hand 
side of (Al), computed to 0(g 4 ), still vanishes 
relative to the right-hand side in the short-dis¬ 
tance limit, leading to asymptotic scale invari¬ 
ance. In higher orders, scale invariance presum¬ 
ably breaks down as well. 
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We show that the Feynman rules for vacuum-polarization calculations and the equations of 
motion In massless, Euclidean quantum electrodynamics can be transcribed, by means of 
a stereographic mapping, to the surface of the 5-dimensional unit hypersphere. The re¬ 
sulting formalism is closely related to the Feynman rules, which we also develop, for mass* 
less electrodynamics in the conformally covariant 0(5,1) language. The hyperspherlcal 
formulation has a number of apparent advantages over conventional Feynman rules in 
Euclidean space: It is manifestly infrared-finite, and It may permit the development of 
approximation methods based on a semiclasslcal approximation for angular momenta on the 
hypersphere. The finite-electron-mass, Minkowski-space generalization of our results 
gives a simple formulation of electrodynamics in (4,1) de Sitter space. 


I. INTRODUCTION 

Conformal invariance in quantum field theory has 
attracted renewed interest recently, because of its 
connection with problems of asymptotic high-ener¬ 
gy behavior. 1 Important results on leading light- 
cone singularities, for example, have been ob¬ 
tained by the use of conformal invariance. 3 Another 
question to which conformal invariance is relevant 
is the study of eigenvalue conditions imposed by re¬ 
quiring renormalization constants to be finite. 3 To 
see this, let us consider the single-fermion-loop 
vacuum-polarization diagrams in spin-i quantum 
electrodynamics, illustrated in Fig. 1. If we work 
in coordinate space with separated points x , x ' we 
can freely pass to the zero fermion mass, or con¬ 
formal limit. In this limit, however, the struc¬ 
ture of the vacuum polarization is unique, 2 and 
hence the sum of diagrams in Fig. 1(a) must be 
proportional to the lowest-order vacuum-polariza¬ 
tion tensor in Fig. 1(b), 

<*) = -37rf* 1 Ha)ffp°i, ) (*i x ') • U) 

When Eq. (1) is Fourier-transformed to momentum 
space, using current conservation in the usual 
fashion to eliminate the quadratic divergence, the 
function F^(a) appears as the coefficient of the 
logarithmically divergent term. Requiring the 
photon wave-function renormalization Z 3 to be 
finite then imposes the eigenvalue condition F^ 1] (a) 
= 0. 4 

Our aim in the present paper is to study reformu¬ 
lations of massless electrodynamics which are 
made possible by its invariance under conformal 
transformations, with the goal of developing meth¬ 
ods which may allow one to calculate or approxi- 

6 


mate the function F llj appearing in Eq. (1). Be¬ 
cause the singularity structure in x and x f is not of 
interest (it is just that of the lowest-order vacuum 
polarization), we make the Dyson-Wick rotation to 
a Euclidean metric at the outset. Thus we deal 
with massless, Euclidean quantum electrodynam¬ 
ics. Our principal result is that the Feynman 
rules for vacuum-polarization calculations and the 
equations of motion in this theory can be simply 
rewritten in terms of equivalent rules and equa¬ 
tions of motion on the surface of the 5-dimensional 
unit hypersphere. In Sec. II we state the 5-dimen- 
sional rules and verify by explicit transformation 
that they are equivalent to the usual rules in Eu¬ 
clidean coordinate space (x space). We also con¬ 
struct and verify a 5-dimensional formulation of 
the Maxwell equations and the equation of current 
conservation, and discuss the physical meaning of 
rotations and inversions on the hypersphere. In 
Sec. m we discuss massless, Euclidean quantum 
electrodynamics in the manifestly conformal-co¬ 
variant 0(5,1) language. We develop the Feynman 
rules in this formalism, explore some of their 
peculiar features, and show that they are related 
by a simple projective transformation to the rules 
on the 5-dimensional hypersphere. In Sec. IV we 
discuss possible generalizations and applications 
of our results. We point out that the finite-elec¬ 
tron-mass, Minkowski-space extension of our hy- 
perspherical results gives a simple formulation of 
electrodynamics in (4,1) de Sitter space. The 
electron wave equation which we use is just the de 
Sitter-space equation originally proposed by 
Dirac, 5 but our treatment of the Maxwell equations 
is an improvement over that of Dirac, and does 
not require the imposition of homogeneity condi- 
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tions. There are a number of possible calculation- 
al advantages of the hyperspherical formulation of 
electrodynamics over the usual Feynman rules in 
Euclidean space. First, because the surface of the 
hypersphere is a bounded domain, the calculation 
of vacuum-polarization diagrams in the 5-dimen¬ 
sional formalism is manifestly infrared-finite. 
Second, because the wave operators on the hyper¬ 
sphere are constructed from angular momentum 
operators, there appears to be the possibility of 
making semiclassical approximations when virtual 
angular momentum quantum numbers are large 
compared to unity. This contrasts sharply with 
the situation in Euclidean space, where there is 
no natural distance or momentum scale which dis¬ 
tinguishes regions where one can approximate the 
wave operator. 

II. 5-DIMENSIONAL FORMALISM 

In this section we set out the 5-dimensional 
formalism and verify, by explicit transformation, 
its equivalence to the usual rules in x space. 

Secs. IIA—IIC contain a summary of the 5-dimen¬ 
sional Feynman rules and equations of motion, 
while in Secs. nD and IIE we discuss the transform¬ 
ation to x space and the interpretation of symme¬ 
tries on the hypersphere. 

A. Summary of Feynman Rules 
on the Hyper sphere 

In writing down the 5-dimensional rules and 
comparing them with their Euclidean counterparts, 
we adhere to the following conventions and nota¬ 
tion. 6 Five-dimensional unit vectors are denoted 
by /h, /j 2 , ...; 5-dimensional vector indices are in¬ 
dicated by lower case italic letters a, b, ... which 


L. ADLER 6 

take the values 1, ..., 5, and the 5-dimensional 
metric is the Euclidean metric 6^. Similarly, or¬ 
dinary 4-dimensional vectors are denoted by 
x ly x 2J ... with vector indices p, p, ... taking the 
values 1, ..., 4 and with a 4-dimensional Euclidean 
metric The usual 4x4 Dirac y matrices are 
taken to satisfy a Euclidean Clifford algebra 

{r,,, r l,}=2o ( i1 , (2) 

and are all Hermitian; explicit representations for 
these matrices are well known. In writing the 5- 
dimensional rules we need, instead of the y’s, a 
set of five Hermitian 8x8 matrices a a satisfying 
the Clifford algebra 

{«..“»} = 26^. (3) 

In terms of the y matrices and the Pauli spin ma¬ 
trices t 12s , an explicit representation of the a 
matrices is 

oe M =y M T i, 0£ 5 = t 3 . (4) 

Since the matrix 

a 6 = t 2 (5) 

satisfies a 6 2 = 1 and anticommutes with the a a , the 
trace of an odd number of a matrices vanishes. 
Physical quantities such as the electromagnetic 
current, vector potential, etc., will be denoted by 
capital letters (J ay A aJ ...) in 5-dimensional space 
and by lower case letters (j^ a p , ...) in Euclidean 
space. We let Jd 4 x = Jdx x dx 2 dx 3 dx^ denote the in¬ 
tegration of x over Euclidean space and we similar¬ 
ly let j denote the integration of tj over the sur¬ 
face of the 5-dimensional hypersphere. Finally, 
we use tr 4 and tr B to denote, respectively, the trace 
over the y matrices and the a matrices. 

The connection between the 5-dimensional co- 





+ a- 


■——- 


<o— 


'-permutations - 1 


• • 


(a) 


7Ti 


(0) 




(x,x') 



(b) 


FIG. 1. (a) Single-fermion-loop vacuum-polarization diagrams in spin-^ electrodynamics, 
polarization diagram. 


(b) Lowest-order vacuum- 
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ordinate tj describing a space-time point and its 
Euclidean equivalent x is given by the stereo - 
graphic mapping 7 


X ll =K'\, K = l+T) 5 , 

with the inverse transformation 

2 * u 1 -* 2 
^ P ~ 1 + x z J 1 + X 2 * 


( 6 a) 

( 6 b) 


The 5-dimensional electromagnetic current «4, 
which satisfies the constraint equation 

r) • J = »)„«/„ = 0, (7) 

is mapped into the usual electromagnetic current 

by 

/T 3 j M = Jp -x M J 5 , (8) 

with the inverse transformation 

(9) 

J 5 = -K~ 2 X'j . 


We can now state the 5-dimensional Feynman 
rules with, for comparison, their Euclidean count¬ 
erparts. These are given in Table I. The equiva¬ 
lence of the two sets of Feynman rules for vacu¬ 
um polarization (closed-fermion-loop) calculations 
is demonstrated explicitly in Sec. HD below. 


B. Photon Propagator Equation and Maxwell Equations 


To write the wave equation satisfied by the pho¬ 
ton propagator on the hypersphere we introduce 
the (anti-Hermitian) angular momentum operator 


■^ab “ Va 


d 

fyb 



( 10 ) 


When several coordinates rj l9 rj 2 , ... are present 
we denote the angular momentum acting at i] 1 by 
L lr so (Lj^afr = ( 7 Jj_) 0 3 /3(tJx)^ - ( 171 )^ 0 / 9 ( 171 ) 0 , etc. In 
this notation the photon propagator equation takes 
the form 

(11) 

where 0 S is the hyperspherical 6 function satisfy¬ 
ing 


J dSi Vi /(ij 1 )6 s (ij 1 -ij 2 ) = /(ij 2 ) (12) 

for arbitrary /. The constant multiplying the 6 S 
function in Eq. (11) can be verified by integrating 
Eq. (11) over the hypersphere: 

JdBjLS-4) (v ^ vJ2 


= ~ 4 f dSil (^hF 

L JVm(i-m 2 ) 

= _ 4 ( J_\ §3! 

4 \4/3/ 3 

= -8tt 2 , 





(13) 


where we have written M ~Vi • Vz and used the fact 
that 




dn( 1 — / 1 2 ) x azimuthal integrations . 


(14) 


Equation (11) can also be verified from the expan¬ 
sion of (1 -fi)' 1 in terms of Gegenbauer polynomi- 
als 


TABLE I. The 5-dimensional Feynman rules and their Euclidean counterparts. 



5-dimensional 

Euclidean 

Electron 

-i 4(a • ?h-l)-Ha* V 2 +l) 

-i y(* j-X,) 

propagator 

* 2 Oh-4> 4 

21T* {x t -x 2 ^ 

Photon 

1 


propagator 

4tT Z (Ti!-TJ 2 ) 2 

4^ dl-*2) 2 + gaUge termS 

Electron-photon 
vertex a 

ieot a = {ie[ot- rj if a a I 

ieTfs 

Each closed 
fermion loop 

- tr 8 

-tr 4 

Each virtual 
coordinate integration 

fda. 

jd*x 


a The two indicated forms of 5 -dimensional electron-photon vertex are equal when sandwiched between electron propa¬ 
gators. 
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1 ^ (2« + 3)Cn /2 Qi) 

1 -u 2 L / (w + 1)(w + 2) 


(15) 


n*0 

Using the relation 


£ Y„ m (r h )YL(n 2 ) = ^-CV\ Vl - Vz ), 


(16) 


where the y nm (rj) are orthonormalized hyperspheri 
cal harmonics, Eq. (15) becomes 

YMYM 


1 


. 2 ^ ^ ^ nm\ •!' Jim 

77 Ls Ls (w+l)(w + 2) 


(17) 


(Vi-rh? „ 

Then, using the differential equation for the hyper- 
spherical harmonics 

^ 2 Y nm (v i) = -2 n(n + 3)7 nm (ni), (18) 

we find from Eq. (17) that 

( V- 4) (T)1 *^2 = -8tt 2 E E YMYM 

= -8rr%(ri 1 -r) 2 ), (19) 

in agreement with Eq. (11). 

To write the Maxwell equations on the hyper¬ 
sphere we introduce the electromagnetic potential 
5-vector A aJ which satisfies the constraint 

q-A= 0 (20) 

and is related to the electromagnetic potential a 
in Euclidean space by 

K~ 1 a fJ = A m -* m A 5 . 

The electromagnetic field strength is described 
by the totally antisymmetric rank-three tensor 


( 21 ) 


Fabc = L ab A c + + L ca A b , 


( 22 ) 


which is dual to the antisymmetric rank-two ten¬ 
sor 




ab 6 c abode x cde 
d 


~ €abcieVc 


A a . 


(23) 


9 Va~ * 

The usual dual tensor / in Euclidean space is 
related to F ab by 

K-'% v =F llv - x l ihv-x v F II 5 . (24) 

In terms of the tensors F abc and P ab the Maxwell 
equations become 

L a b F attc = 2eJ CJ (25a) 

LabK^Kc, (25b) 

with J c the electromagnetic current, which satis¬ 
fies the conservation equation 8 


L ab Jb s J a • 


(26) 


An explicit demonstration that Eqs. (25) and (26) 


indeed do correspond to the Maxwell and current- 
conservation equations in x space will be given be¬ 
low in Sec. HD. 

When Eqs. (22) and (23) are used to express the 
Maxwell equations in terms of the potential A, Eq, 
(25b) is trivially satisfied, while Eq. (25a) be¬ 
comes 


F ca A a = 2e e/ c , 

with P ca the wave operator 


P ca = 2L cb L ba -6L ca + L*6 Qa . 


(27) 


(28) 


Using the angular momentum commutation rela¬ 
tions it is straightforward to verify thatp ca has 
the following properties: 


■^ftc-^ca “F bc L ca “P ba , 

VbF ba ~P(iaVa ~ 0 j 


(29) 


which guarantee the consistency of Eq. (27) with 
the constraints on J given by Eq. (7) and Eq. (26). 
Equation (27) can be further simplified if the po¬ 
tential A a is chosen to satisfy the condition 


L a bAb-A a , 


(30) 


which is the hyperspherical analog of the Lorentz 
condition. When acting on potentials which obey 
Eq. (30) the operator P ca becomes simply 
(L 2 -4)6 Cfl . Hence the wave equation becomes 


(L 2 -4)A =2eJ c , 


(31) 


and as expected involves the same wave operator 
as appears in the photon propagator equation, Eq. 

(ID- 

C. Electron Propagator Equation and Field Equation 

To write the electron propagator equation we 
introduce the matrix y ab defined by 

7.6= 4* [a.. <*»]• (32) 

Using the abbreviation y ab L ab =y • L, we find that 
the electron propagator obeys the wave equation 

= -2tt 2 6 s (rji - r) 2 )(a • rj 2 +1), 
(ocTh-lHa-TJa + l) (33) 


('ll — t» 2 ) 4 


(iyl^-2) 

=-2 tt 2 6 s (rji - rjjsHa • rji -1), 


where the coefficient of the 6 S function in Eq. (33) 
is obtained by averaging over the hypersphere, as 
in Eq. (13). An alternative method for obtaining 
Eq. (33) is to use the following relation between 
the electron and photon propagators, 
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(ofTh-lXofTk + l) 

(TJl ~tj 2 ) 4 

= -i(i7*L 1 + l)^-^j-^(a*i7 2 + l). (34) 

Applying the wave operator iy • L x + 2 to Eq. (34) 
and using the identity 

(iy ■ L j + 2)(iy • i, + 1) = -\(L? -4), (35) 

we find 


(iy Li + 2 ) 


(a -r)! - l)(a ‘i} 2 + l) 
(t? 1 “ T)a) 4 


= 4(il2 “ 4) (^T^F (a ' T,2+1) 

= -Tj 2 )(a • tj 2 +1), 


(36) 

where in the last step we have used the photon 
propagator equation, Eq. (11). 

The matrix y ab is a generalized spin operator 
for the electron. Writing 


SflJ ~ ^“Yab 


~i[nt„ at]. 


(37) 



with the adjoint equation 


X < t7ab 


Va(-£r + ieAt(r)Y 


dri. 


-V> 


B 


a V, 


+ /eA a (r)))J - 


X = X f 


= 0, 

(41) 


The electromagnetic current J c which appears in 
Eq. (31) is given by 

J c (v) = -lix[a-ri,a c ]x. (42) 

Using the relation 

[« ,T ). a]=[a-V, at a }-2iV a yL 

(43) 


we find that S and L satisfy identical commutation 
relations, 


[5^5,5^] — 6 ac 5(f^ S c b + 6^ c ^ad 5 ac , 

[ ^abf L<tb ” &ai L c b &bc ^ad ” ^bi^ac > 


(38) 


and that 


and Eqs. (40) and (41), we see that the current J c 
satisfies the current conservation condition of Eq. 
(26). The constraint imposed by Eq. (7) is also 
obviously satisfied. 

D. Transformation from the Hypersphere to x Space 


J (39) 

( L-S) 2 = 3L-S-iL 2 . 

The second relation in Eq. (39) leads immediately 
to the identity in Eq. (35). 

Finally, in terms of an 8-component electron 
spinor y the electron wave equation takes the form 


We give in this section the explicit transforma¬ 
tions which map the hyperspherical Feynman rules 
and equations of motion into the corresponding 
rules and equations of motion in x space. We be¬ 
gin with the Feynman rules of Table I and consider 
first a closed fermion loop coupling to 2 n photons, 
given by 


J_ V _t H 2<a ‘T)i -!)i(a -t) 2 +1) -j i(a'T) 2 -l)|(a'T)3 + l) 

2n psrmutationstf 1. 2 n ^ iV (Vl “Tfe) 4 ^TT 2 (t) 2 -T) 3 ) 4 

-i ka • V2 n - 1 )i(a •T)i + 1) 

X ** (Vm-Vi)* 



(44) 


In order to obtain the corresponding 2«-point func¬ 
tion in x space, we must transform each of the 2 n 
current indices according to the recipe of Eq. (8a), 
which means that we effectively make the replace¬ 
ment 

Ota,- ( 45 ) 

for each vertex a a{ . After this replacement has 
been made, we must then find that a purely alge¬ 


braic rearrangement of factors gives the 2n -point 
function computed from x-space Feynman rules. 
For the denominator in Eq. (44) the rearrange¬ 
ment is trivial, since substitution of Eq. (6) shows 
that 

(Vi -TJui) 2 =K iK i+1 (Xi -* 1+ i) 2 . (46) 

To rearrange the numerator we exploit the fact 
that the factors a *tj± 1 appearing in each propa- 
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gator are projection operators, allowing us to re¬ 
write the numerator of the general propagator ac¬ 
cording to 

a(a ■ rji — l)i(oc *i? i+1 + l) 

= i(a -tj, -l)ia * (tj i+1 -Tj,)i(a£ - rj i+1 +1). (47) 


We next introduce the matrix 0(x) given by 

x _ 1 + a 5 ac x 
° {x) = ( 1+x 2 ) l/2 ’ 


1 -a 5 a • x 


(48) 


ow ’ 1= a+^) i/2 


where a • x denotes the 4-dimensional scalar prod- 
uct ctpX Some straightforward algebra then 
shows that 


0(x,)!a * {ru -t/ i+1 )0(x i+1 ) -1 


CL*(X | -X t+1 ) 

= ( 2 K i 2 K i + 1 ) m a-(x i — Xi + 1 ) 

(49) 


and that 

0(x)i(l +a *i?)(a M -x M a 5 )i(a -l)O(x)" 1 

= 1(1 + a 5 )a M i(a 5 -1). (50) 

Substituting Eq. (4) for the a matrices, we can pull 
all factors i(a 5 ± 1) = 1(t 3 ± 1) to the left, where they 
combine to give a single factor i(t 3 +1). The fac¬ 
tors t x appearing in the matrices a M then cancel in 
pairs (t 1 2 = 1), leaving 

-tr 8 [ M t 3 + 1)X { y} ] = -tr 4 [X{y} ], (51) 

where contains y matrices only. The factor 
K i appearing in Eq. (45) precisely cancels the fac¬ 
tor (k { 1,2 /k 2 ) 2 arising from the substitution of Eq. 
(49) and Eq. (46) into Eq. (44). Thus, we have 
shown that when the replacements of Eq. (45) are 
made, Eq. (44) can be algebraically rearranged to 
the form 


1 y* _ j_ r zl 

2n oemu^ations 


-i y-(*i-* 2 ) , -i y‘(x 2 -x 3 ) 


(x 1 -x 2 ) i iey ^ 27T 2 (x 2 -x 3 ) 4 2 tt 2 (x 2 „-X 1 ) i 


-i y(*2„-*i) .. 


permutations 
of 1 . 2 n 


ley. 


Ml 


(52) 


which is just the 2w-point function calculated according to the Euclidean x-space Feynman rules. 
The next step is to verify that the hyperspherical rule, 

( T?1 _|j 2 )8 ’ 


/ 


(53) 


correctly describes the propagation of a virtual photon from to tj 2 . The use of the current J in Eq. (53) 
is of course just a convenient shorthand for describing the 2 «-point functions from which the photon is 
emitted (absorbed), with all variables other than those referring to the virtual photon in question sup¬ 
pressed. Calculating the Jacobian of the transformation of Eq. ( 6 ) by use of 5-dimensional spherical co¬ 
ordinates gives 


- K 4 d 4 x . 


(54) 


Substituting Eq. (54) into Eq. (53), using Eq. (46) to rewrite the denominator of the photon propagator and 
Eq. (9) to reexpress the current J a in terms of the combination of components = k 3 ( J 5 ) which is 

relevant to % space, we find that the factors k precisely cancel, leaving 

J d x x d p 1 (x 1 )yp 2 (* 2 ) 4^2 (55) 


with 




2 (*lV,(*l)to ^-^Ui (* 2)110 4(1 + *i * *2)(*l)pi(*2)pal 


'M2 


MlM 2 
3 


1 +Xi 


Ml v a 'M2 

1+V 

8 


Ml’ 


'M2 


(1 + x 2 )(1+x 2 2 ) 


(56a) 


(x 1 -x 2 ) 2+ a(x 2 ) ll2 [ ln{Xl * 2)2 2 B( Xl ) Ul ln{1+x ' ) \ 

+ ^[ lnUl - X2)2 i nkr 2 ln(1+ *’ 2) ] ^[^(WHna+V)]. (seb) 
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In Eq. (56a) we give the form of the x-space photon 
propagator (x 1? x 2 ) which emerges directly 
from the substitution of Eq. (9) into Eq. (53); in 
Eq. (56b) we show that A can be rewritten as the 
usual Feynman propagator plus total derivative 
terms (gauge terms), which make no contribution 
to Eq. (55) because of electromagnetic current con¬ 
servation. Hence Eq. (53) is completely equivalent 
to the usual x-space Feynman rules for propagat¬ 
ing a virtual photon. In Sec. HE we will show that 
the special significance of the gauge terms in Eq. 
(56a) is that they give A simple transformation 
properties under coordinate inversion. 

To transform the photon and electron equations 
of motion and the current conservation equation to 
x space, we rewrite the differential operators 
9/9rj a and L ab in terms of x derivatives according 
to 


9 9 

— = k _ i (6 -S,** ) — + terms proportional to t} a , 

a Va ° x p 

(57) 


_3 __ 3 _ 

L »v- x n aXv Xv d Xll ’ 

£7 +(1 ~ K ^8*7’ 


(58a) 


and use the following equation [obtained from Eq. 
(6)] to differentiate /c, 


9 k « 


(58b) 


_9__ _9_ 

9X x a ° 9x o fl>L 

“ 2^\.ativfjiv • (^®) 

Similarly, applying Eqs. (58) to Eqs. (25) and (26), 
we find (after considerable algebra) that these 
equations reduce to the usual Maxwell and current 
conservation equations in x space: 


Eq. (25a)=* 3= ej v , 

(60a) 

Eq. (25b)=* 3 m 4„ = 0, 

(60b) 

Eq. (26)=.3 M j M = 0. 

(61) 

To transform the electron wave equation [Eq. (40)] 
and the expression for the electromagnetic current 
[Eq. (42)] to x space, we first note that the Pauli 
matrix t 2 commutes with the wave operator in Eq. 
(40), and hence the 4-component spinors 

Xi = af 1 ± T 2 )x 

(62) 

also satisfy Eq. (40). Defining x-space 4- 
nent spinors by 

compo¬ 

= /c 3/2 0(x)x 4 , 

tes) 

we find that the projection of Eq. (42) into 
takes the form 

x space 

;„(*)=-Wx[a- V, -*„a 6 ]x 



(64) 

with 



(65) 


Applying Eqs. (58) to Eqs. (21)-(24) we find that 
Eq. (24) implies the usual connection between the 
x-space field strength / and potential a M , 


Since a direct (and again somewhat lengthy) calcu 
lation shows that the Dirac wave operator obeys 
the transformation 


/c 3/2 0(x)|iy aA [i7 a ^~- - <eA*(Tj)^—- feA a (rj)Jj +2 J/c’ 3/2 0(x)^ =t 2 /c"V * 


( 66 ) 


the x-space spinors satisfy the usual mass- 
zero Dirac equation 



This completes the demonstration that the hyper- 
spherical formalism is completely equivalent to 
the usual formulation of quantum electrodynamics 
in Euclidean x space. 

E. Interpretation of Symmetries on the Hypersphere 

We briefly discuss in this section the x-space 
interpretation of the rotational and inversion sym¬ 
metries on the hypersphere. As we have seen, 


the photon wave operator is L 2 -4, and this com¬ 
mutes with the ten generators L ab of rotations on 
the hypersphere. Similarly, the free Dirac wave 
operator iy • L + 2 commutes with the ten opera¬ 
tors 

J a b = Lab+S ab , ( 68 ) 

Which are the hyperspherical rotation generators 
when spin is taken into account. We can interpret 
the hyperspherical rotational symmetry as follows. 
Six of the rotational generators L^ v (or J M „) leave 
the 5-axis invariant, and therefore, by Eq. (6b), 
leave x 2 unchanged. These clearly correspond in 
x space to the generators of the homogeneous 
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Lorentz group [which of course, in the Euclidean 
metric which we use, has become the 4-dimension¬ 
al rotation group 0(4)]. The remaining four gen¬ 
erators L 5vf which change x 3 , correspond to rath¬ 
er complicated conformal transformations in x 
space. For example, the 5-dimensional rotation 


r? — r? r : 

= 1 h,2.3 * 

Tj 4 = T] 4 cosa -Tfesina, 
Tls=T| 4 sina+T| B cosa 


(69a) 


corresponds in x space to the conformal trans¬ 
formation 

x~* x': 

■*1,2,3 “ -*1,2,3/^ J 

, r i/i 2 \ ■ i/n ( 69b ) 

x l s [x A cosa - 2(1 - x 2 )sma]/D, 

D = l+x 2 sin 2 (ioF)+x 4 sinoF. 

From this point of view, the manifest covariance 
of the hyperspherical Feynman rules under rota¬ 
tions generated by L 5V is a reflection of the con¬ 
formal invariance of zero fermion-mass electro¬ 
dynamics. We note, finally, that the ordinary x- 
space translation x~* x f = x+a does not correspond 
to a linear transformation on 17 , but rather to the 
nonlinear transformation 


tj-t)': 

TJs =[tj 5 - i« 2 (l +T| 5 ) -T| -a]/D ', 
D' = 1 + |a 2 ( 1 +r) 5 ) +rj • a, 


(70) 


V■ 

Translation invariance of the x-space formalism 
guarantees that the 5-dimensional formalism is 
covariant under the conformal transformations of 
Eq. (70), even though this is not manifestly evi¬ 
dent. 

In addition to the continuous-parameter rotation 
group, there is an important discrete symmetry 
operation on the hypersphere, the inversion 

17 -- 17 . (71a) 

According to Eq. ( 6 ), this corresponds in x space 
to the inverse radius transformation 

x-*—x/x 3 . (7lb) 


L. ADLER 6 

vacuum polarization insertions are made) the ra¬ 
diative corrected 2 «-point functions in the hyper¬ 
spherical formalism are manifestly inversion in¬ 
variant. This is turn implies simple transforma¬ 
tion properties for the corresponding x-space 2 w- 
polnt function under simultaneous inverse radius 
transformation of the coordinates x u .. ., x 2n . To 
find the form of the x-space transformation, we 
follow the notation of Eq. (53), and let de¬ 
scribe the emission of a photon from the 2 «-point 
function at coordinate 77 , with the other 2 w -1 
variables suppressed. In this notation, the in¬ 
version invariance of the 2 w-point function reads 

Jjv'=-v) =J a(y), ( 72 ) 

where, of course, the suppressed variables are 
also inverted. Projecting J^tj) back to the x space 
gives 

k~%(x) = ^( 17 ) -x M 1 / 5 ( 17 ), (73) 

while projecting the inverted current J a (i 7 ') gives 

(*')-%(*') = <40?') > ( 74 ) 

with 



n> 


x' = l +/J 5 
= 1-Vs> 


(75) 



Using Eqs. (73) and (74), we can convert the equal¬ 
ity of Eq. (72) into a relation between ; M (x') and 
j M U)> giving 


j M (*) =(x 2 )~ 3 M )iV ( x)j v (x'), 


(76) 


where 


M llv (x)M va (x) = 6 lla . 


(77) 


Thus, the 2«-point function in x space is left in¬ 
variant under the combined operations of (i) simul¬ 
taneous inverse radius transformation x y -*> -x y /x y 2 , 
j = 1 ,..., 2 n, and (ii) application of the projection 
operator 


2 n 2 n 

n (V)- 3 n m 


j -1 ; =1 





Because the trace of an odd number of a matrices 
vanishes, the hyperspherical expression for the 
closed loop 2w-point function in Eq. (44) is in¬ 
variant under simultaneous inversion of all the 
coordinates tj 1 , .. ., tj 2 „. Similarly, the hyperspher¬ 
ical photon propagator is inversion invariant. 

Hence we conclude that (as long as no divergent 


to the vector indices. This recipe is just the one 
discussed by Schreier . 2 In terms of the matrix 
M v we can understand the significance of the 
gauge terms in the x-space photon propagator of 
Eq. (56): The gauge terms guarantee that under in¬ 
verse radius transformations the photon propaga¬ 
tor transforms covariantly, i.e.. 
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= ) * 

(73) 

The usual Feynman propagator, of course, does 
not satisfy Eq. (78). 

III. CONNECTION WITH THE MANIFESTLY 
CONFORMAL-COVARIANT FORMALISM 

In this section we discuss massless, Euclidean 
electrodynamics in the manifestly conformal-co¬ 
variant 0(5,1) language, and develop its connection 
with the 5-dimensional formalism of the preceding 
section. In Sec. HA we review the 0(5, 1) for¬ 
malism and in Sec. IIIB we develop, in a heuristic 
fashion, the 0(5, 1) Feynman rules for electrody¬ 
namics. In Sec. in C we show that the 0(5,1) rules 
are related to the 5-dimensional rules by a simple 
projective transformation. 

A. The 0(5,1) Formalism 

As has been greatly emphasized recently, 1 a 
large class of renormalizable field theories con¬ 
taining no dimensional parameters (masses or di¬ 
mensional coupling constants) are invariant under 
the 15-parameter conformal group of transforma¬ 
tions on space-time. In particular, quantum elec¬ 
trodynamics with zero fermion mass is conformal 
invariant. We recall that of the 15 conformal-group 
generators, 10 are the generators of the PoincarG 
group, 1 generates the dilatations 

Xp —Atfp f (79) 

and the remaining 4 generate the special confor¬ 
mal transformations 

x„ +c„x 2 

- e —5-J. (80 

p 1 + 2c • x + c 2 x 2 

Although the usual formulations of massless field 
theories are manifestly Poincare-invariant, their 
invariance under the nonlinear transformations of 
Eq. (80) is not manifestly evident. However, a 
very pretty way of achieving manifest conformal 
invariance was introduced by Dirac, 10 and has been 
further developed recently. The basic idea is to 
replace the usual field equations over the Minkow¬ 
ski space-time manifold x^ by equivalent field 
equations over a 6-dimensional projective mani¬ 
fold £ A . (We adopt the convention that 6-dimen- 
sional vector indices are indicated by capital Latin 
letters A,B,.. . which take the values 1,.. ., 6.) 

The coordinate x is related to £ A by the projective 
transformation 


«*««5+6a- < 81 > 

When the metric in x space is the Minkowski met¬ 
ric (1,1,1, -1), the £ space is endowed with the 
metric (1,1,1, -1,1, -1); correspondingly, when 
the metric in x space is the Euclidean metric 
(1, 1, 1, 1), the £ space is endowed with the metric 
(1,1,1,1,1, -1). In either case, if £ is restricted 
to the light cone 

£ 2 = 0, (82) 

then it can be shown that the 15-parameter linear 
group of pseudorotations on £ is isomorphic to the 
conformal group of nonlinear transformations on x. 
In the Minkowski case, the pseudorotations form 
the pseudo-orthogonal group 0(4, 2), while in the 
Euclidean case with which we are primarily con¬ 
cerned, they form the pseudo-orthogonal group 
0(5, 1). So to construct a manifestly conformal in¬ 
variant formulation of massless, Euclidean elec¬ 
trodynamics, we must write equations which are 
manifestly covariant under the operations of 0(5,1). 

Because excellent reviews are available in the 
literature, 11 we will not actually detail the develop¬ 
ment of the 0(5, l)-covariant formalism, but rather 
will simply summarize the results needed for the 
construction of Feynman rules. 

(1) The electromagnetic current is represented 
by a 6-vector J A (£), homogeneous in £ of degree 
-3 and satisfying the kinematic constraint 

{• J({)= 0. (83) 

The equation of electromagnetic current conserva¬ 
tion takes the form 9 

= Ja«)> (84a) 

with 

^ab = £a 1 (84b) 

and J a (£) is related to the ff-space current j^(x) by 
the recipe 

;„(*) = { + 3 { (?) - [ J„({) + J,({)] } . (85) 

Note that Eqs. (84) and (85) are both invariant un¬ 
der "gauge*' transformations of the form 

JAtt)-J A «) + { A ^({), (86) 

with Af(£) homogeneous in £ of degree -4. The in¬ 
variance of Eq. (85) follows immediately from Eq. 
(81), while Eq. (84a) is left unchanged because 

= (87) 

where in the second equality we have used the 
homogeneity of ilf (£). 




R31 


411 


3454 


STEPHEN L. ADLER 


6 


(2) The electromagnetic potential is represented 
by a 6-vector A B (t), homogeneous in I of degree 
-1 and satisfying the constraint I • A =0. The pho¬ 
ton wave equation takes the form 

U 9 A B (i)=eJ B (l), (88a) 

with 


_ 8 8 
n e = &i B &i B ' 


(88b) 


(3) The electron field is represented by an 8- 
component spinor x(£)> homogeneous in I of de¬ 
gree -2, which obeys the wave equation 







+ 2 U = 0. 


(89) 

The matrix y AB is defined by 

7ab = H l Pb] , (90) 

where the 8x8 matrices satisfy a Clifford 
algebra 

{J3 A ,J3 B }= 2 gAB (91) 

with g AB the metric tensor. An explicit represen¬ 
tation of the 0’s is 

- ~y = &C> = ^ 2 • (® 2 ) 

The electromagnetic current of the electron is 
given, in terms of the spinor x> by 

J A = X B XY BA X, X = X r &- (93) 

These equations completely specify the O(5,l)-co- 
variant formulation of massless electrodynamics, 
and, via Eq. (85), allow us to project 2«-point 
functions in the 6-dimensional language back into 
2w-point functions in x space. 


B. 0(5,l)-Covariant Feynman Rules 


We proceed next to deduce, in a heuristic fash¬ 
ion, Feynman rules for the 0(5, l)-covariant cal¬ 
culation of closed-fermion-loop processes. We 
will not actually directly prove the equivalence of 
these rules with the usual x-space rules, but rath¬ 
er will show this indirectly in Sec. mC by deduc¬ 
ing the 5-dimensional rules of Sec. n from the 6- 
dimensional rules which we now develop. To be¬ 
gin, we infer from Eq. (93) that the rule for a ver¬ 
tex where a current with polarization index A acts 


at coordinate $ is 

vertexoc eT A (Z) = et B \fi B , P A ] • (94) 

Clearly, this rule automatically satisfies the kine¬ 
matic constraint of Eq. (83). [in Eq. (94) and sub¬ 
sequent equations of the present section, we omit 
numerical proportionality constants,] Next, we 
must guess the rule for the electron propagator 
S(£j, £ 2 ). We first note that since x(£) is homoge¬ 
neous in £ of degree -2, S must be homogeneous 
of degree -2 in and | 2 independently. A check 
on this requirement is provided by the fact that 
since J A (£) is homogeneous of degree -3, a 2w- 
point function must be homogeneous of degree -3 
in each of the 2w coordinates. Since the vertex 
r A (£) is homogeneous of degree +1, this require¬ 
ment will be satisfied by propagator-vertex chains 
of the form 


su lf ^)r A2 (^ 2 >su 2 , .. - (95) 


only if the propagator is homogeneous of degree 
-2 in each of its arguments. The homogeneity re¬ 
quirement immediately restricts the choice of 
propagator to one of two possible forms: 


^i(?i> £ 2 ) = 


fi - til • U 

u,-y 3 ’ 


{,)- 


• uy 


(96) 


We can rule out S 1 as a possible choice, however, 
by noting that when Sj is sandwiched between the 
two adjacent vertices we get 


^(^(^ yr^(y 


Ux • y 3 


Ux'U 3 ’ 

(97) 

where we have used the fact that 03 • £) 2 = | 2 = 0. 

But as we have seen above, "gauge” terms of the 
form (£i) a or U 2 )a 2 project to a null current in 
x space, so use of as the propagator would lead 
to identically vanishing 2«-point functions in x 
space. We conclude that the correct choice of 
electron propagator is $ 2 , and that the 0(5,1)- 
covariant expression for a closed fermion loop 
coupling to 2n photons is given (up to proportion¬ 
ality constants) by 
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tr. 


permutations 
of l.2n 


(S, • t*? [/3 ’ ■ « 3 ) a ’' ’ ' £l ’ ^ 


(98) 


Although we have constructed our rules to satisfy the kinematic constraint of Eq. (83) and the require¬ 
ments of homogeneity, we must now check whether they are consistent with the equation of current con¬ 
servation, Eq. (84a). To do this, we first examine the effect of the free Dirac operator on the propagator 
forms Sj and S 2 . By direct calculation, we find that 

<tv A bU B + 2JS,({„ {,) ««,({„ UHiVjuXF* - 2) = 0, (99a) 


HyjLF* 2)S,({ 1 , y-aS^J,), 

y(fy^L 2 AB -2) = -2S 1 U 1 , | 2 ), 


(99b) 


all for *| 2 [at ^ = | 2 there are additional 6 function contributions, which we omit in writing Eq. (99)]. 

We see that the correct propagator $ 2 does not satisfy the Dirac equation, and that adding in an arbitrary 
multiple of Sj cannot fix things up. In effect, we see that is a null propagator (because it leads to a van¬ 
ishing current in x space) and that S 2 is a pseudopropagator, which when acted on by the Dirac wave oper¬ 
ator gives a multiple of the null propagator, but not zero. 

Let us now examine the effect of this peculiar state of affairs on the current conservation properties of 
Eq. (98). We consider the propagator-vertex chain linking the points £, and act with the differential 
operator L AB = l A */*l B -i B z/zi A , giving 


l ab‘ • 




^2) 


• » 


= ■ ■ ■&• - - • +Ra- ( 100 ) 

The first term on the right-hand side of Eq. (100) is just the result required by Eq. (84a), while the re¬ 
mainder R a is given by 


+ • • • \fi * !i, B AB^ 8 “ 2) (£ . | )2 [0 ' I 2 ) ' * * | 

Substituting Eq. (99b) and algebraically rearranging as in Eq. (97), we get 12 


( 101 ) 




Although Eq. (102) does not vanish, the first term 
in the curly brackets is a pure '‘gauge” term with 
respect to the index Aj, while the second is a pure 
"gauge” term with respect to the index A lf and 
hence both give a vanishing contribution to the 2w- 
point function when projected back to x space. So 
we see that because $ 2 is a pseudopropagator, Eq. 
(98) only satisfies a pseudocurrent-conservation 
condition: When we test current conservation on a 
given index, Eq. (84a) is not satisfied in the 6-di¬ 
mensional space, but does hold when we project 
on all of the remaining indices to transform back 
to x space. 

As an explicit illustration of this pseudoconser¬ 
vation property, let us consider the single-loop 
two-point function, which according to Eq. (98) is 


1 r . ) 

+ ••• u- ‘' j • 

( 102 ) 


given by 

tr 8 {[0 • • «.,&■.]} it • UgA, M - «,)*(&)* 

ttx ■ y 4 ({, • « 2 ) 4 


Acting on Eq. (84) with (L t ) A i A i gives 



iiSAiAt (^i)Aa(^a) 




«x • U 4 


(103) 


«x * «.****-UxW?.) 44 

(«x * «x) 4 


+ R, 


4{, ‘ Ulit^U)* 
(it • 


(104) 
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As expected, there is an extra term R which, be¬ 
cause it contains the factor (Sa)^, makes no con¬ 
tribution to the two-point function in x space. In¬ 
terestingly, there is no way of modifying Eq. (103) 
to make the extra term R vanish. To see this we 
note that the only other second-rank tensor with 
the correct homogeneity properties and which sat¬ 
isfies the kinematic constraint of Eq. (83) is 


<«i ' U 4 


(105) 


However, Eq. (87) tells us that this expression 
satisfies 




(£l 'Ia 1 (^)a2 


Ui'fiiuK 
«,•«*)* ’ 


(106) 


so adding a multiple of Eq. (105) to Eq. (103) can- 

_I 


not cancel away R. r We conclude that pseudo-cur- 
rent-conservation is an unavoidable feature of the 
0(5, l)-covariant formalism. 

Next, we turn our attention to the photon propa¬ 
gator D AlA2 Uit l 2 )* Because the photon field A fi (£) 
is homogeneous in £ of degree -1, the photon prop¬ 
agator D must be homogeneous of degree -1 in £ x 
and | 2 independently. In addition, in order to an¬ 
nihilate the extra “gauge” terms which appear 
when we test current conservation on indices of 
the closed loop other than A 1 ,A a , the photon prop¬ 
agator must be explicitly transverse, 

til) A 'V AlA2 til, U) -&)**> AlM til, U) = 0 - ( 10 7 ) 

The simplest form which satisfies these require¬ 
ments is 


d a 1 a 2 u u y 


Sa x c 


til) Al til)c 

■ l, • s. 


gC *2~ 


la ' £2 


(108) 


where \ and \ 2 are arbitrary points which are held fixed when doing the virtual integrations over £ x and 
£ 2 . Because of gauge invariance, closed-fermion-loop expressions have no dependence on l x and \ 2 after 
one sums over all orderings, with respect to other photons which are present, of the emission and ab¬ 
sorption of the virtual photon propagated by Eq. (108). The simplest way to verify this statement, and to 
check the correctness of Eq. (108) to begin with, is to transform Eq. (108) back to x space. We find 


%i+*%2+*J Al tii)D Al A2tiit ^J^tiz) ^*Mi^i)^Mim 2 (^u ifi 2 tii) y 


(109) 


with the effective #-8pace propagator given by 

f til “ %l) p ji&l ““ *2) p z (#2 “ ” ^ 2 ) 

A MlM2< a l* ~ {X X -X 2 Y +2 (X 1 -X 1 ) 2 (x 1 -X 2 f +2 (x 2 -X x f(x 2 -X 2 f 


m 2 


til ~ “ ^ 2)^2 

2 (^-Xiftiz-x^) 2 


(110a) 
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(x 1 — x^j* 9 (^ 2 ) 


m 2 






[ lnfai -x*Y\ 


8 (%), 


ln(* 5 


3( Xl 


)u ^(^)l 


[|ln(3t: 1 -^ 1 ) 2 ln(^-^) 2 ]. (110b) 


In Eq. (110a) we give the form of the ff-space prop¬ 
agator which emerges directly from the transfor¬ 
mation; in this equation x Xf x x , x 2i \ denote, ^re¬ 
spectively, the ff-space images of % 2 , £ 2 * In 
Eq. (110b) we see that A' is equivalent, up to 
gauge terms, to the usual Feynman propagator, 
and in particular that all the dependence on 
is contained in the gauge terms. This verifies 
that Eq, (108) is a valid expression for the photon 
propagator, and that £ x and \ 2 drop out of gauge 
invariant quantities, such as closed fermion loops. 
[The derivation of Eq. (110) from the conformal- 


covariant expression of Eq. (108) indicates that 
the effect of the ff-space gauge terms is to render 
A' covariant under #-space conformal transfor¬ 
mations, provided that the points x X9 x^ are con¬ 
formally transformed along with x x and ^. 1S ] 
Finally, calculation of the Jacobian of the trans¬ 
formation of Eq. (81) shows that 

j d*x = J dS e t + -*, ( 111 ) 

where denotes an integration over the hyper- 
sphere +| 2 2 +l 3 2 + + £<? = £ 8 2 , with held 
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fixed. Comparing with Eq. (109), we see that 

= / 

( 112 ) 

indicating that the Feynman rule for virtual inte¬ 
grations is simply 

virtual integration over £: J dS^ . (113) 

This completes our specification of the 0(5,1)- 
covariant Feynman rules for calculating closed- 
loop quantities. 

C. Projection onto the 5-Dimensional 
Unit Hypersphere 

We complete our discussion of the 0(5, l)-co- 
variant formalism by showing that it is related, 


by a simple projective transformation, to the 5- 
dimensional Feynman rules of Sec. II. The trans¬ 
formation is generated by exploiting the fact that 
in an w-virtual photon process, the fixed points \ 
in each of the n photon propagators can be chosen 
independently , provided that over-all Bose sym¬ 
metry is maintained. Since closed-fermion-loop 
amplitudes are independent of all of the propagator 
fixed points, they will be unchanged if we integrate 
all of the fixed points over their respective hyper- 
spheres • • ■ +£ 5 2 = I 0 2 . The effect of this in¬ 
tegration is to replace the photon propagator of 
Eq. (108) by the averaged propagator 

/ dSi dSiD U lt y 

y--. ai4) 

JdS- h dS i2 

The integrations in Eq. (114) are readily evalu¬ 
ated, giving 


y- Si . 


(yT 



+ terms proportional to (^) A or (£ 2 ).^ ! (115) 


r 


the terms proportional to U x ) Al or (£ a )^ are un " 
interesting because they make a vanishing contri¬ 
bution by virtue of the constraint equation, Eq. 

(83). The key feature of Eq. (115) is that the quan¬ 
tities in brackets, 


& AjC 


^A l8 (^)c 

<y 8 ’ 



(116) 


(ii) The five-dimensional current J a (v) is related 
to the 6-dimensional current J A (£) by 

= (ns) 

which is just the projection generated by the brack¬ 
ets of Eq. (116). 

We proceed now to combine Eqs. (115), (117), 
and (118). Using 


both vanish when C =6, so the sum in Eq. (115) ex¬ 
tends only over C = 1,..., 5. This suggests pro¬ 
jecting onto a 5-dimensional space, as follows: 

(i) The 5-dimensional coordinate 7j a is related to 
the 6-dimensional coordinate Z A by 

t)„ = K a-1,...,5. (117a) 

The light-cone restriction on £ implies that 


jds^jaa^, (H9) 

we get 

/ v y 

= j da„da „ (Vi ). 

( 120 ) 


n 2 = l, (117b) 

and scalar products in 6-space may be written in 
5-space as follows: 

£i ’ i 2 =-i(ii)eU 2 )e(v i -V 2 y • (H7c) 


which reproduces the 5-dimensional Feynman rule 
for photon propagation. To study the effect of the 
projection operation of Eq. (118) on the 0(5,1)- 
covariant expression for a closed fermion loop in 
Eq. (98), we consider first the projection of the 
vertex r A (|). We find 
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U '[ */ - »l.A A ]T a ({) - «e 3 D3 •{,£.- T)A] 

= ? 0 4 |> ’V-PvPa-TlM 
= -£ 6 4 (a • TJ + 1 )a r (a • rj — 1) , 

( 121 ) 

where we have introduced matrices a a defined by 
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a. = -PePa • (122) 

Since the propagator (^ * £ 2 )~ 2 can be rewritten as 


(Si• Uf = («!).*«.).*(%-Hi ) 4 ■ (123) 

we see that the projection of Eq. (118) transforms 
Eq. (98) into 


j 


4 2n 2 tr 8 

permutations 

of 






i 


(124) 


which apart from normalization constants is iden¬ 
tical with Eq. (44). So we have verified that the 
projective transformation generated by using the 
averaged propagator of Eq. (114) just gives the 5- 
dimensional Feynman rules for the photon propa¬ 
gator, the electron propagator, and the electron- 
photon vertex. 

To conclude, we show that Eq. (118) and the 
formal properties of the 6-dimensional current 
J A U) imply the corresponding formal properties 
of the 5-dimensional current J a (Tj). We begin with 
the constraint equation, £ • J A (£) = 0, which can be 
rewritten as 

0=«.J.(0-fcr.(0 

= «,ti.[J.({) - J.({)] 

= | 6 -W(t,), (125) 

giving the 5-dimensional constraint equation, Eq. 
(7). Next we consider the 6-dimensional version 
of current conservation, 

> (126) 
and use Eq. (87), with M(£) = £ 6 ~ L J 6 ti), to write 

(127) 

Since the sum on B in Eq. (127) extends only over 
B = 1,..., 5, and since we are interested only in 
values of the free index A = 1,..., 5, no derivatives 
appear. Hence, on multiplying through by 
£ 0 S and using the fact that 

^W b ~* b W' =r, °*v b - r,b H l =L ‘‘ b, (128) 

Eq, (127) becomes the 5-dimensional current con¬ 
servation equation 

JwJ.foW.fo). (129) 


IV. DISCUSSION 

In this section we very briefly discuss possible 
generalizations and applications of the 5-dimen¬ 
sional formalism of Sec. 11. First, we note that 
although we have worked with a Euclidean x-space 
metric throughout, it should be straightforward to 
generalize the 5-dimensional rules to the usual 
Minkowski case. The hypersphere will then be¬ 
come the hyperbolic domain 

Vi 2 + V2 2 + V3 Z -Vi 2 + i?5 2 = 1 j (130) 

which is a (4,1) de Sitter space of unit radius. 14 A 
further generalization would consist of giving the 
electron a mass m and, since the distance scale 
now acquires a meaning, calling the radius of the 
de Sitter space R , so that Eq. (130) becomes 

Tix 3 +T, 3 2 +T,3 2 - (131) 

As Dirac 5 has shown, an appropriate wave equa¬ 
tion describing a massive electron in a de Sitter 
space of radius R is 



(132) 

which is a simple generalization of Eq. (40). The 
electron propagator corresponding to Eq. (132) 
will of course differ from the massless propaga¬ 
tor of Table I, but the electron-photon vertex 
and the photon propagator will be unchanged. The 
massive 5-dimensional formalism is not exactly 
equivalent to ordinary massive electrodynamics 
in Minkowski space-time, but as Dirac 5 has shown, 
in any finite neighborhood of rf 5 ~R, Eq. (132) re¬ 
duces to the usual x-space Dirac equation in the 
limit R *». It is only in the completely massless 
case that the 5-dimensional and x-space formal¬ 
isms have the same physical content. 

It should be emphasized that while our electron 
wave equation is identical (in the case m = 0) to 
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Dirac’s, our treatment of the Maxwell equations 
is substantially different. Unlike our expressions 
for the electromagnetic field strengths, which in¬ 
volve a/a rj a only through the angular momentum 
operator L abi Dirac’s expressions 5 involve a /Brj a 
by itself. Hence, in order to avoid going off the 
hypersurface of constant rf, Dirac finds it nec¬ 
essary to introduce homogeneity constraints on 
the electromagnetic potential, of the type encoun¬ 
tered in the 0(5, l)-covariant formalism. In our 
formulation of the 5-dimensional theory, such 


constraints are unnecessary, and an examination 
of the 5-dimensional Feynman rules of Eq. (9) 
shows, in fact, that they are not homogeneous in 
the coordinates. The absence of homogeneity re¬ 
quirements permits eigenfunction expansions of 
the field operators, and should therefore make 
possible a canonical quantization of the 5-dimen¬ 
sional formalism. 16 The first step in canonical 
quantization would be to write down an appropriate 
Lagrangian density; it is readily seen that a vari¬ 
ation of 


£= -A( F «*:) a+ X j*V« # ~ ieA o^~ r io{^' - ieA M)j +2-imfljx (133) 


gives the correct equations of motion. It should 
then be possible to devise a canonical quantization 
procedure which reproduces the Feynman rules of 
Table I from the Lagrangian of Eq. (133), A re¬ 
lated question is that of developing the connection 
between our 5-dimensional formalism and the 
quantization of electrodynamics by ordering with 
respect to x 2 which has recently been developed by 
Del Giudice, Fubini, and Jackiw. 16 

This concludes our discussion of possible ave¬ 
nues for generalization of our results. 17 Let us 
next briefly consider possible calculational ad¬ 
vantages of the 5-dimensional formalism for mass¬ 
less electrodynamics. The key point to notice is 
that whereas the wave operator in Euclidean % 
space is n 2 , with a continuum spectrum -p 2 
- -(momentum) 2 , the wave operator on the hyper- 
Sphere is L 2 -4, with discrete spectrum -2 fyi +1) 
x(w+2). This difference in spectra has two impor¬ 
tant consequences. First, the fact that the spec¬ 
trum of n 2 contains 0 leads to the occurrence of 
infrared divergences in x-space calculations of 
propagators and vertex parts. These divergences 
are known to cancel, however, in closed fermion 
vacuum polarization loops, 18 and this is reflected 
in our ability to map vacuum polarization calcula¬ 
tions onto the unit hypersphere, where the spec¬ 
trum of the wave operator does not contain 0. In 
other words, closed-fermion-loqp calculations on 
the unit hypersphere are manifestly infrared- 
finite. Second, it is difficult to see how to intro¬ 


duce approximations in massless electrodynamics 
when calculating in Euclidean x space, since there 
is no natural scale for selecting one region of p 2 
as being more important than another. [We have 
particularly in mind the calculation of the function 
defined in Sec. I, where no natural scale 
for making approximations is provided by exter¬ 
nal momenta. ] The situation is different on the 
hypersphere, where unity is a natural scale for 
measuring the spectrum -2(w + l)(w+2), and where 
the semiclassical region of large quantum num¬ 
bers, n» 1, provides a natural domain for making 
approximations. The development of techniques 
for making such semiclassical approximations on 
the hypersphere is an important problem, which, 
hopefully, may shed light on the nature of the elu¬ 
sive function F^(a). 

Added Note 

We briefly discuss here two additional topics 
connected with the 5-dimensional formalism: (a) 
the photon propagator in the 5-dimensional analog 
of the Landau gauge, and (b) the hyperspherical 
harmonic expansion of the 5-dimensional electron 
propagator. 

a. 5-Dimensional Landau Gauge. The x-space 
photon propagator in the generalized Landau gauge 
is obtained by adding to the Feynman propagator 
an appropriate multiple of the gauge term 


j 


-■ 9 _ 9 __ 2 I* *>*-.1 

9 <* 2 v 2 ( 1 ^ J ’ 


(Al) 


adjusted so as to eliminate the logarithmic divergence in the electron wave function renormalization Z z . 
Since Eq. (Al) transforms covariantly under inverse radius transformations in x space, we expect that it 
can be directly transcribed into the 5-dimensional formalism, and indeed a simple calculation shows that 
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^ [> M2 2 


(■^i •^2)^ij(^i 

(x^-X 2 ) 2 


Jaytoy) Ja 2 ^ 2 ) (J?I _ V J2 


6 “l«2 2 


fai -V 2 ) ai (V, ~Vl) 

toi -v 2 y 


]• (A2) 


Referring now to Eq. (56b) in the text, we note that 
the gradient terms which guarantee the coordinate- 
inversion covariance of A MiM2 (x i ^ 2 ) behave at worst 
as (#, -x^)" 1 as ^ ~*x 2i and hence do not contribute 
to the logarithmically divergent part of Z 2 . In 
other words, Eq. (56b) is an inversion-covariant 
form of the Feynman gauge photon propagator. 
Similarly, corresponding to the usual translation- 
invariant Landau-gauge photon propagator 

'Va + A h _ o 

(x,-^) 2 (x, -x 2 f L M * M 2 (x,-^) 2 


(A3) 

there is an inversion-covariant Landau gauge pho¬ 
ton propagator 


MiM2 




A _o *2“* 2 ) 

(X I -X 2 ) 2 plM2 (Xj - x,) 2 


Si]. 


(A4) 

Using Eq. (A2) to transcribe Eq. (A4) into the 5- 
dimensional formalism, we find that the 5-dimen¬ 
sional Landau gauge photon propagator is given by 

6 »,»2 A r (Vy.-V2) <h (Vy.-V 2 )*, ' 

(Vr-Vj (Vy-VsT L “ iaa " (Vy-Vtf J ’ 


(A5) 


with A the same constant as appears in the #-space 
version in Eq. (A3). 

b. Electron Propagator Expansion. We consider 
the 5-dimensional electron propagator, and look for 
a hyperspherical harmonic expansion of the form 


i 


^ Ua ' 1, 1 £ l ( °y % * 1 ' 1 ■= i 2 (a • n, -1)(«• v 2 +1) t D n s I'.Ol.ia 


= T^a (a’^-lXa^JJs + DS ( 2 «+3)D„C„ 3/2 (j?i • j? 2 ) . (A6 ) 

1071 n = o 

Using the orthonormality of the Gegenbauer polynomials Cj /2 (/i), we find that the expansion coefficients D n 
are formally given by the logarithmically divergent expression 


(2n+3)D„= -N* dy.{ 1 - fi z ) , 


(A7) 


with 

JV„ = (AS) 

We proceed by explicitly separating out the logarithmic divergence, writing 

(2n+3)D n =A n I + (2n + 3)B n , (A9) 

with 

A„ = -iV n _I C J /2 (l) = - j (2n+3), (A10) 

and with J? n given by the convergent integral 

(2n + 3)B n = -AT,' 1 dfx(l -M 2 ) ^ ■ (AH) 
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Substituting Eq. (A9) into Eq. (A6), we see that the contribution of the logarithmically divergent part is 

m 

(a • Tty - l)(a • J ?2 +1) S (2n + 3)C fa • V 2 )l = ^ (a • V r - D(a * % + D 6 s(Vi -Vi)* > (A12) 

which vanishes since (a • rj l -l)(a •y l + 1) = 0. Thus, D n is effectively given by the integral in Eq. (All), 
which can be evaluated by generating function methods. Up to an w-independent piece [which, as we have 
seen, makes a vanishing contribution to Eq. (A6)], we find 

d - =1+ I + - + 5ti- (A13) 
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We discuss single-fermion-loop vacuum-polarizalion processes in massless quanium electrodynamics in 
the one-phoion-mode approximation, in which the fermion self-interacts (to all orders in perturbation 
theory) by the exchange of virtual photons in a single virtual-photon eigenmode. The isolation of one 
photon mode is made possible by using the 0(5)-covariant formulation of massless QED introduced in 
two earlier papers, in which the photon wave operator has a discrete, rather than a continuous, 
spectrum. The amplitude integral formalism introduced previously expresses the one-mode 
radiative-corrected vacuum polarization in terms of the uncorrected vacuum amplitude in (he presence 
of a one-mode external field. By exploiting the residual SO(3) x 0(2) symmetry of the one-mode 
external-field problem, which permits separation of variables, we reduce the external-field problem to a 
set of two coupled ordinary first-order differential equations. We show that when the two independent 
solutions to these equations are suitably standardized, their Wronskian gives (up to a constant factor) 
the external-field-problem Fredholm determinant. We study the distribution of zeros and asymptotic 
behavior of the Fredholm determinant, relate these properties to the coupling-constant analyticity of the 
one-mode vacuum polarization, and conclude by giving a brief list of unresolved questions. 


1. INTRODUCTION 

We begin in this paper the analysis of a simple, 
nonperturbative approximation to single-fermion- 
loop vacuum-polarization processes in massless 
quantum electrodynamics. In our approximation, 
the virtual fermion in the vacuum -polarization 
loop self-interacts to all orders of perturbation 
theory only by the exchange of virtual photons in 
a single virtual-photon eigenmode. The isolation 
of one photon mode is made possible by using the 
O(5)~covariant formulation of massless QED in¬ 
troduced in two earlier papers, 1 * 2 in which the 
photon wave operator has a discrete, rather than 
a continuous, spectrum. Specifically, our approx¬ 
imation is obtained by replacing the full effective 
photon propagator 

\ V"* i) Tnmfr (7)2) /1 i\ 

(w+ i )(w + 2 ) (1 " 1J 

by the simple, factorizable form 

D ib (v„V2) = iY[Uv t )Y[Uv2 ), ( 1 - 2 ) 

which results when the sum in Eq. (1.1) is trun¬ 
cated to contain only one of the 10 modes in the 
smallest (w = 1) photon representation of 0(5). 
Specifically, the one mode which we retain has 
the form 

(v^t)-v 2 -v^v v,), (1.3) 

where u, and v 2 are arbitrary, orthogonal five- 
dimensional unit vectors, 

iv Wa^O. (!-4) 

10 


and where?] is the five-dimensional coordinate. 

As was shown in Ref. 2, the radiative-corrected 
single-fermion-loop vacuum functional in the one¬ 
mode approximation (denoted by W^A']) is given 
by the amplitude integral formula 



*W to '[(a+a')Y%] , (1.5) 

where W (o) [A] is the single-fermion-loop vacuum 
functional in the presence of an external electro¬ 
magnetic potential A, with no internal-virtual- 
photon radiative corrections (and with the depen¬ 
dence on the electric charge e eliminated by a re¬ 
scaling of the electromagnetic potential). For¬ 
mally, W (o) [A] is given by the expression 

W l0) [A\ =|Tr In h T , 

(1.6) 

/i r = 2- L # S-?a # Tja*A, 

with the anticommuting matrices a and the 0(5) 
angular momentum and spin L and S defined as in 
Ref. 2. If we introduce the eigenvalues of h T 
(which, as we shall see, occur in quadruples /i, 

/i, - /i, - ii) and define the external-field-problem 
Fredholm determinant 

4m=( n mY'\ u.d 

V ail eigenvalues / 

then W {o) can be written as 

W (o) [A]=21nA[A]. (1.8) 

As is evident from Eqs. (1-5)—(1.8) and as was 
developed in detail in Ref. 2, the analyticity prop¬ 
erties of Wy as a function of coupling e 2 are deter- 
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mined by the asymptotic behavior of 

for large external-field amplitude a, or, what is 

essentially equivalent, by the distribution of zeros 

of the Fredholm determinant A in the complex a 

plane. 

Let us now spell out more specifically the con¬ 
nection between the e 2 analyticity of W x and the 
a dependence of W (o) . In order to make Eq. (1.5) 
unambiguous, we must specify the integration 
contour to be used in evaluating the a integral. In 
Ref. 2 we argued that this contour should be taken 
to be along the real a axis, or possibly (and very 
conjecturally) along the imaginary a axis. Equa¬ 
tion (1.5) with real integration contour will be well 
defined if A has no zeros (and hence W (o) no sin¬ 
gularities) for a real. If is asymptotically 
weaker than an increasing Gaussian in a as a 
becomes infinite along the real axis, then Eq. (1.5) 
defines an analytic function of e 2 in the right-hand 
e 2 half plane. If, moreover, A has no singularities 
in the wedge-shaped sectors |Rea|> |lma| and the 
vacuum amplitude W {o) is asymptotically weaker 
than a Gaussian in these sectors, the integration 
contour can be freely deformed within these sec¬ 
tors from its original position along the real axis, 
implying that is an analytic function of e 2 in the 
entire e 2 plane, apart from a branch cut along the 
negative real axis from e 2 -0 to e 2 - Thus, 
for real integration contour the questions at stake 
are: 

(i) Is A zero-free for a real? 

(ii) Is W to) asymptotically weaker than a Gaus¬ 
sian as a -±<® along the real axis? 

(iii) Is A zero-free in the sectors |Rea|> |Ima|? 

(iv) Is asymptotically weaker than a Gaus¬ 
sian as within the sectors? 

In the following sections we present analytic argu¬ 
ments which answer questions (i), (ii), and (iv) in 
the affirmative, and we present numerical results 
(but no proofs) which also suggest an affirmative 
answer for question (iii). Next let us consider the 
speculative possibility of an imaginary integration 
contour. Such a contour is allowed only if two 
conditions are satisfied: A must have no zeros 
for purely imaginary a, and must vanish as a 
decreasing Gaussian (or faster) aso-« along the 
imaginary axis. As shown in Ref. 2, if W (o) oscil¬ 
lates along the imaginary axis with a decreasing 
Gaussian envelope, then the imaginary contour 
yields a strong-coupling electrodynamics in which 
W x exists for large enough e 2 and can develop an 
infinite-order zero as e 2 approaches a positive e 0 2 
from above. Thus, the questions at issue for a 
possible imaginary integration contour are: 

(v) Is A zero-free for a imaginary? 

(vi) What is the asymptotic behavior of W fo) as 
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| a | - <*> along the imaginary axis? 

The analytic arguments which follow answer ques¬ 
tion (v) affirmatively. With respect to question 
(vi) we can only give limited numerical results, 
these show no signs of decreasing asymptotic be¬ 
havior, but, because the asymptotic region may 
not have been reached, do not conclusively resolve 
question (vi). 

The material which follows is organized so that 
a knowledge of the 0(5) formalism is needed only 
to read Sec. H, in which we consider the wave 
equation determining the eigenvalues p of h T . 

[2 -L ' S -iaa-rja- ^ , (1.9) 

and show that separation of variables with respect 
to the S0(3)><0(2) subgroup of 0(5) reduces Eq. 

(1.9) to a pair of coupled ordinary first-order dif¬ 
ferential equations within each separable subspace. 
In the remaining sections, which can be read in¬ 
dependently of Sec. II, we study the properties of 
this differential-equation system. In Sec. Ill 

we recapitulate the results of Sec. II and argue 
directly from the differential equations that A has 
no zeros for a in strips containing the real and 
imaginary axes. In Sec- IV we construct the 
Green’s function of the one-dimensional system, 
and use it to establish a connection between the 
Wronskian of the two independent solutions of the 
differential equations (suitably standardized) and 
the Fredholm determinant A. In Sec. V we use 
this connection, combined with WKB estimates, to 
determine the order of growth of A for large |o|. 

In Sec- VI we construct series solutions for the 
two independent solutions of the differential equa¬ 
tion, and use them to study A(a) numerically. 
Finally, in Sec. VII we briefly summarize the 
many remaining unresolved questions. In Appendix 
A we explicitly calculate the Green's function in 
the free case, and in Appendix B we give the de¬ 
tails of the WKB calculation used in Sec. V. 

II. REDUCTION OF THE ONE-MODE PROBLEM 

In this section we carry out the separation of 
variables which reduces the partial differential 
equation (1.9) to a pair of coupled ordinary first- 
order differential equations. In Sec. II A we de¬ 
termine the conserved quantum numbers of Eq. 

(1.9) , and show that the eigenvalue problem diag¬ 
onalizes with respect to an SO(3)xO(2) subgroup 
of 0(5). In Sec. IIB we introduce a representation 
of the 0(5) generators which facilitates reduction 
of the eigenvalue problem with respect to the con¬ 
served subgroup. The reduction itself is carried 
Gut in Sec. IIC. In Sec. IID, we perform a check 
by solving the free (a = 0) case and verifying the 
eigenvalue degeneracies found in Ref. 2. We also 
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work out the boundary conditions appropriate to 
the separated equations in both the free and the 
interacting cases. Finally, in Sec. HE we make a 
transformation which simplifies the equations in 
the interacting case, and construct the external 
field problem Fredholm determinant introduced 
in Sec. I. 

A. Conserved quantum numbers 

To analyze the conserved quantum numbers of 
Eq. (1.9) we choose axes in the five-dimensional 
space so that the 1 and 2 axes lie, respectively, 
along v v and v 2 . The Hamiltonian in Eq. (1.9) then 
takes the form 

h T = h^ + V, 

h% ) =2-L-S, K=iXa-T)(a 1 T) 2 - a 2 r),), (2.1) 

X= -aUS/lCTT 2 ) 1 ' 2 . 

Introducing the 0(5) generators 


there are also two discrete invariances of h T . 
Defining a coordinate inversion generator P, 


P V P- L =-T1, P a =l, 

(2.8) 

we see immediately that 


[P,h T \=0. 

(2.9) 


Finally, letting a e be the a matrix which anticom¬ 
mutes with a lt ...» a 5 , we have 

l<*e,*r]=0 . (2.10) 

This last invariance permits us to split the eight- 
component spinor eigenvalue problem of Eq. (1.9) 
into two identical decoupled four-component prob¬ 
lems. Diagonalizing the four-component spinor 
with respect to the conserved quantum numbers, 
we write 

+ J35 2 + =-)() + 1 )4>mu . 


J<lb~ La* + S 


ab T '-'ab 

a 


” 'Ha a „ “ *7& a „ 
&Vb &Va 


+ i[Oa, a,] 


( 2 . 2 ) 




( 2 . 11 ) 


we obviously have 

[</„„,/j ( °> 1 = 0 (2.3) 

since the free Hamiltonian is rotationally in¬ 
variant. Furthermore, since 

[L a ft, TJ C ] — T} a 6^ c “ T}^6 ac , 

(2.4) 

we find, as expected, that a * rj is also rotationally 
invariant, 

[Jab, a-Tj] =0- (2.5) 

Hence the generators J tb which commute with h T 
will be just the ones which commute with the factor 
- & 2 V 1 in the potential term. From Eq. (2.4) 
we find trivially that 




As we will see in detail below, the separation 
constants take the values 

. _ j. 3 

J ~~ 2 > 2, • • •» 


m- -j, -j+ 1, . . . ,j . 

t _ i 3 

5 “ ± 2 * ± 2 * •••, 


( 2 . 12 ) 


€ = ±1. 


Our task in the succeeding sections will be to find 
the form taken by the eigenvalue problem of Eq. 
(1.9) when restricted to the subspace of Eq. (2.11). 


[J 3 4, “l»)2 “ <*2*1 ll = [J 33 , “ <VJ J 

= [ J 45. “l^2 “ <VJl] 

= 0, (2.6) 

indicating that k T is invariant under the SO(3) sub¬ 
group generated by J 34 , J 35 , and */ 45 . In addition, 
we have 

[J l2 , ajT) 2 - a 2 T),] = - a 2 r) 2 - 0 , 1 ), + a 2 r| 2 

= 0, (2.7) 

so that k T is also invariant under the 0(2) sub¬ 
group generated by J l2 . The other generators J ab 
do not commute with h T . In addition to the SO(3) 
*0(2) invariance group which we have just found, 


B. Explicit representation of the 0(5) generators 

We introduce now an explicit representation of 
the 0(5) generators which facilitates the reduction 
of the eigenvalue problem with respect to the 
SO(3)xO(2) subgroup. We begin with the spin 
operator S „ 6 = i[ a., a 6 ]. Letting o li2 . 3 , t 1i2#3 , and 
Pi. 2.3 b e three commuting sets of 2X2 Pauli spin 
matrices, we represent the 8 X 8 matrices a,, ... , 
a e in the form 


0t 4 — Pi UgTj , Q! s — P2^3^*2 » 

so that the spin matrices become 


(2.13) 
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^12 “2^3* ^14 = 2 ^Pl ®2 » 

*^24 = zWl & l t *^53 = 2*Pl » 

5 j 5 = “ 2^2 ^2 » *^25 " 2 ^2^1 » 

*^34 " 2^P2 > *^13 = ~ 2 ^P3^2 » 

*^23 = i^Ps^l » ^4 5 " 2^P3 • 


M, = -sin0, —- -cot0,cos0,;^- , 

a d 

M 2 = cos(p l ~ - cote,sin— , 

9 9 

N, = -sin0 2 — -cot0 2 cos<p 2 — , 


(2.14) 


Since the Hamiltonian h T is even in the a matrices 
a,, ... , a 5 , it is a unit matrix in the space of the 
r Pauli matrices. As noted above, this immediate¬ 
ly reduces Eq. (1.9) to two identical decoupled 
four-component eigenvalue problems. 

To represent the orbital angular momentum L ab} 

we parametrize the coordinates tj,.tj 5 in the 

form 

ij, = sin 0 , cos 0 ,, tj 2 = sin 0 , sinp,, 

173 = 0050,00502, I 7 4 = cos 0 ,sin 0 2 cos 0 2 , 
TJs^cosOjSinOaSin^a , (2.15) 


d d 

N 2 = cos 0 2 — -cot0 2 sin0 2 


d<p 2 ’ 


3 9<p 2 ' 


P, = cos0 2 cos0 2 —— csc0 2 sin <£ 2 

9 On 


9 


P 2 = cos0 2 sin <j> 2 —- + esc0 2 cos <j> 2 

do n 


902 ' 
d 

d<P 2 ’ 


(2.17) 



These satisfy the commutation relations and iden¬ 
tities 


0 ^ 0 , ^ |tt , 0 * 0 , ^ 2n , 

0 ^ 0 2 ^ tt , 0 * <p 2 ^ 2 tt 

corresponding to an 0(2) (angle 0,) and an S0(3) 
(angles 0 2 , 0 2 ) combined with mixing angle 0,. In 
terms of these angular parameters, the coordinate 
inversion operation is 


[M, ,Mj] = -M k , [Mj, Nj] = [M t , Pj\ = 0, j 
[ JV,. JVjl = — JV* , cyclic 

[P i ,P,]=N k , [1 V l ,P i ] = -P*, J 

( 2 . 18 ) 

^ 2 = ^ 2 = sin0 2 9e7( Sln02 ae7) + Sin 2 0 2 3^7 ' 


( 0i-* e i. 0 1 -<P 1 + 7T. 

rj-rj «=> < (2.15') 

( 0 2 -7T-0 2 , <p 2 — <P 2 +tt . 


In terms of the auxiliary operators, the orbital 
angular momentum operators take the form 


The hyperspherical surface element becomes 


L l2 - W 3 » 


•^53 ~ N,, 


8 ? 7 i 017 , 877 , 817 , " 
7,1 80, 80, 8 0 2 80 2 


T?2 


dfir, = det 


dM0i<*M<f>2 



d JL 

802 


^34 ~ ^2 , L 45 - N 3 , 

= -sin0 2 cos0 2 M 2 +tan0, cos0,P,, 

L ls = -sin0 2 sin0 2 M 2 + tan0 L cos0 l P 2 , 

(2.19) 

L 13 = -cos0 2 M 2 + tan0,cos0 1 P 3 , 

L 24 = sin0 2 cos0 2 M, + tan0 l sin0 1 P l , 


= cos 2 0 , sin 0 1 d 0 ,(d 0 ,)(sin 0 2 d 0 2 d 0 2 ), (2.16) 

which, not surprisingly, has a mixing-angle fac¬ 
tor, an 0(2) factor (d0,), and an S0(3) *actor 
(sin0 2 d0 2 d0 2 ). By dint of considerable algebra 
one can express the orbital angular momenta in 
terms of derivatives with respect to the angles of 
Eq. (2.15). To write the results in a compact 
form, we introduce auxiliary operators^, N Jt 
P jt )- 1, ... , 3, as follows: 


L 25 = sin0 2 sin0 2 Af, + tan0, sin 0 ,P 2 , 

L 23 = cos0 2 M, + tan0, sin0,P 3 , 

and by using Eqs. (2.17) and (2.18) it is straight¬ 
forward to verify that the expressions in Eq. (2.19) 
satisfy the 0(5) commutation relations 

[ ^a&j L cd J - 6 a c L* db — 6 ad L cb + 6 bc L ad — L ac . 


( 2 . 20 ) 
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Using Eqs. (2,14) and (2.19), it is a simple matter to express the Hamiltonian h T and the conserved gen¬ 
erators J 12t J 53 ,... in terms of the angular parameters. We find 

fir - 2 “ t[M 3 a 3 +N l p l + N 2 p 2 + N 3 p 3 + + M 2 o 2 )( p x sin0 2 cos<p 2 +p 2 sin0 2 sin <£ 2 + p 3 cos0 2 ) 


+ i tanfl^gfa, cos<£, +a 2 sin<^ 1 )(P l p 1 + P z p z + .P 3 p 3 )] , 

V=X sinfl^ag sin^ - cos 0^0 t cos<f> l + a 2 sin^Jf p x sin0 2 cos <£ 2 + p 2 sin0 2 sin <£ 2 + p 3 cos0 2 )] , 


( 2 . 21 ) 


and 

C / 3 = — i J l2 - — i M 2 +2^31 
T 1 = - i t / 53 = - * N t + 5 p,, 

( 2 . 22 ) 

T z — ~ i t / 34 - i N 2 + 2 p 2 , 

^3 ~ ~~ i ^45 - i + 2 p 3 . 


C. Reduction of the eigenvalue problem 

The first step in the reduction of the eigenvalue 
problem with respect to the SO(3)*0(2) subgroup 
is to find the eigenvalues and eigenfunctions of the 
conserved generators in Eq. (2.22). This is, of 
course, just a standard angular momentum prob¬ 
lem. For the 0(2) subgroup we find two eigen¬ 
functions with opposite inversion parity for each 
eigenvalue 5 of U 3 , 


U 3 w 4 = , 




y 





(2.23) 




0 

1 


o 


f 


=j(j +!)«., T 3 v t = mv t , 

Pv t = (- , 

(2.25) 


(; - rn + DP7.-.W* (cos0 2 )e l(m_I/2 '* 2 
P?+i /2 (cos8 s )e l(m * l/2),l> 2 

(j + m)P 7_-‘/ 2 2 (cose 2 )e' ( "- 1/2) ^ “ 

- iT. 1 // (cos0 2 )e i( m+1/2 ^ 


with PfU) the usual associated Legendre poly- 
nomial. The allowed values of j,m are the usual 
ones for spin-£ coupled to an orbital angular mo¬ 
mentum, 

1 a 

J 2 ) 3 ? • • • » 

(2.26) 

m- -j, -j + 1, ... ,j , 

and the subscript p indicates that the spinors are 
acted on by the Pauli matrices pj. In terms of 
the 0(2) and S0(3) eigenfunctions which we have 
just found, the general decomposition of is 

4'j nft =i4+(fl 1 )w+M + +C+(6 l )v_u_ , 

4>m£-€ =A_(0 1 )v + m_ + C_(0 1 )v_M + , (2.27) 

The next step is to substitute Eq. (2.27) into 
Eq. (1.9), using the expression of Eq. (2.21) for h T . 
To find the action of the various terms of h T on 
w* and v ± we use the following identities, which 
may be verified by straightforward calculation: 


The subscript a on the spinors Indicates that they 
are acted on by the Pauli matrices Oj. Because 
the orbital angular momentum -iM 3 must have 
integral eigenvalues, the eigenvalues of t / 3 must 
be half-integral; hence the allowed values of £ are 


<r 3 u i =±u i , 


(o l cos(p l +o 2 sin^> l )u i: =w T , 
(2 -iM 3 a 3 )u ± =(! ±€)w* , 


- i(M l o l +M 2 o 2 ) u ± = ± w* 


+ d*£) cote, 


(2.28) 


l = ±i, ±1 . (2.24) 

For the SO(3) subgroup we again find two eigen¬ 
functions with opposite inversion parity for each 
pair of T eigenvalues j, m, 


( p l sin 0 2 cos <£ 2 + p 2 sin 0 2 sin <p 2 + p 3 cos 0 2 )v* = v ,, 
- 2 N* pv + = -(7 + l)v + , - zN*pv_ = (j - \)v_ , 

P-pv + =(j + |)v. , P *pv_ = - (j-l)v + . 

Hence we get 
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h T il>j m fr = (% + Z)A+v + u+ +(l - (j + i) A + v + w + +(j-i)C + v_u„ 


lde * 


+ (s -£)cot0* 


A + vju~ - 


L de l 


+ (i + Z)cotO l 


C+v+u+ 


-tan0*O' + |).A + ?;_w_ -tan0*(j -|)C + v + w + + Xsin 2 0 1 A + ?; + M + - Xsin 2 0* C + v_u, 
- X sin0, cosfl^ + v.w. - Xsin0* cos0* C + v + w+ 

- /M + v + w + + pC+v_w_ » 

Ar*jw-« = (i -£M_?; + w_ + (i + £)C_?;_w + -0 + i)-A-V* M - + 0 " 


(2.29) 




+ (5 + ?)C0tfl 1 


A_v.w+ + 


lde, 


+ (i- £)cot0* 


C_l> + W. 


+ tan0,O' +fM_v_w + +tan0* (j - |)C.v + m. -Xsin 2 0*A_i; + w_ + X sin 2 0,C_?;_w + 

- X sin0* cos0*A_y_w + - X sind I cosd 1 C.i j +w_ 

= 

= pj4 _V+W_ + liC_V_U + . 

Equating coefficients of like terms then gives us the following two sets of coupled first-order differential 
equations for j 4 4 (0*) and C 4 (0,): 


U -jM* - 


lde, 


+ (| + £) cote, + (j - |)tan0* C + + Xsin0 l (A + sin0* - C + cos0*) = pA + , 


(j + l-«)C + + 


-U+;)A„ + 


L dd t 

d 

lde, 


+ (2 - 0cote* -(j + I)tan0* A + -Xsin0x(C + sine* +A + cose*) = pC + ; 


+ (=? - |) cote* + ( j -1) tan6* C.-Xsine* (sine* A_ + cose*C J = , 


(2.30a) 


(2.30b) 


(; + 1 + £)C_ - 




+ (^ + £) cote* - (j + |)tane* A . + X sine* (sine *C_ - cosfi^AJ = pC_ . 


These two sets of equations can be further re¬ 
duced to just one set of coupled differential equa¬ 
tions by exploiting the fact that 


a % r\h r - -hrcrr] . 


(2.31) 


Since a*7] has odd inversion parity, Eq. (2.31) 
tells us that if ipjmt-e is an eigenfunction of h T with 
eigenvalue p, then a t T]ip jni ^ € is an eigenfunction of 
h T with eigenvalue -p, quantum numbers j, m, £ 
unaltered, but (reversed) inversion parity +€. 
Specifically, writing 3 


a * = [(<*! cos^ +a 2 sin0j)sin & l +<J 5 {p : sine 2 cos<p 2 +p 2 sine 2 sin<p 2 + p 3 cose 2 )cos0*) 

x [A_(0 1 )v + m_ +C_(0 1 )v_m + ) 

= A+(0 1 )v+u+ + C+(6 1 )v_u_ , (2.32) 


we find from the relations of Eq. (2.28) that 

A 

A+ - A sin0, + C_ cos0, . 

(2.33) 

C + = -i4_ cos0* + C_ sin0*. 

From the differential equations [Eqs. (2.306)] sat¬ 
isfied by A_ and C_, we find that A+ and C + satis¬ 
fy the coupled differential equations 


(€ -j)A+ - + (i+ £)cote* + (j -|)tan0* 


C + 


U + 1-?)C* + 


+ Xsin0 x (A + sine* -C + cos0x)- - pA + , 

(2.34) 

+ (i - £) cote* - (j +1) tane*] A+ 


lde * 


A A A 

-Xsin0* (C+ sine* +A + cose*) = - pC + . 
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As expected, these are identical to Eqs. (2.30a), 
apart from the reversal in sign of the eigenvalue. 
Thus, we need only study the one set of equations 
in Eq. (2.30a). 

To find the measure with respect to which two 
eigensolutions of Eqs. (2.30a) with different eigen¬ 
values p, p' are orthogonal, we start from the 
hyper spherical orthonormality condition 

fctojL ««* = 0, 11*11’. (2.35) 

Using the expression for dfl,, in Eq. (2.16), and the 
fact that 

utu_ - u\(o l co3<p l +a 2 sin<f> 1 ) 2 u + 


The differential equations which we must study 
thus are 


(£ - 


^ + (i + £) cotO + (j - £) tan0 | c 


(j + 1 - £)c + 


d (i 
d8 +(5 


+ X sin 6(a sin0 - c cos0) = \m , 

(2.39) 

5) cot0 - (j + f) tan0 ] a 


-Xsin0(c sin0 + acos0) = pc , 




(2.36) 


vlv, = vl(p, sin0 2 cos<^ 2 + p 2 sin0 2 sin<J> s 


+ p 3 cos0 2 ) 2 v 4 


with the measure for orthogonality 



cos 2 0sin0d0(a*a' + c*c') = 0 , 



p*p' . (2.40) 


Eq. (2.35) reduces to 

X V 2 

cos a 0 1 sin0, d0 1 (AlAi + C*C;) = 0 , p * p' 

(2.37) 

which identifies the measure for Eqs. (2.30a). 

Now that the eigenvalue problem has been re¬ 
duced to a single set of coupled first-order differ¬ 
ential equations, the subscripts used in the above 
analysis are no longer needed. To expedite the 
subsequent discussion, let us change notation as 
follows: 

0,-0, 

A + (0J-a(0), (2.38) 

C + (0J-c(0). 


D. Solution of the free (X=0) case 
and check on eigenvalue counting 

Let us now check the reduction leading to Eq. 
(2.39) by solving the differential equations in the 
case of vanishing interaction and comparing the 
energy spectrum with the free-particle spectrum 
calculated in Ref. 2. WhenX^O, the differential 
equations simplify to 


U-j)a- 


L ^+(i+£) cot 0+ (j -i)tan0 


c=pa, 


(j +l-(j)c + 


LdB 


+ (i-£)cot0-(j +|)tan0 


(2.41) 
a = pc . 


Changing the independent variable to w = cos 2 0 and 
eliminating either c or a, we find that a satisfies 
a second-order differential equation of standard 
Riemann type, 4 


j 


d z a + / 3 _1 | 1 \da + 

du 2 \2 u + u-l)du + 

(2.42) 


(J+*)(J+i) 1 U-i)‘ 

4 u 2 4 (w-1) 2 


(j +i)(j + i) + (£-i) 2 +2 +p(l-p) 1 


w(w- 1) J 


a- 0, 


and c satisfies a similar equation obtained from 
Eq. (2.42) by the replacements j—j -1, £-£+1. 
The characteristic exponents of Eq. (2.42) at the 
regular singular points at u = 0 and u = 1 are given 


in Table 1. Equation (2.42) can be solved in terms 
of Jacobi polynomials, giving the following four 
series of eigenfunctions and eigenvalues. 
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a=/(cos 0 ) > + l/2 (sin 0) £ ~ 1/2 

x p^ + 1 * £ * l/ 2 ) (l- 2 cos 2 0 ), 

c = ( cos 0 )'- 1 / 2 (sin 0 ) £ + 1/2 
x p(J, 6 - 1 / 2 ) (1 . 2 cos 2 0). 

Series 1. (2.43) 

li=2n+2+j + Z, n = 0, 1 , 2 , ... 
f= -(w + £ + i)/(w + j +1), 


For each eigenfunction with eigenvalue /i and in¬ 
version parity e obtained from Eqs, (2.43) and 
(2.44), there is another eigenfunction with eigen¬ 
value -/i and opposite inversion parity obtained 
by inverting the transformation of Eq. (2.33) to 
give 

=asin 0 -ccos 0 , ^ 4 - 

C_ - a cosfi + c sinfi . 

Hence the positive eigenvalues of h T are 


Series 2. 

p. = - (2n +1 + j + £), n - 0, 1, 2, . , . 

/=1- 

a=/(cos0) j+l ' 2 (sin0) 1/2 - £ 
x P c n >+ 1,1/2 * e) (l-2 cos 2 0), 
c-(cost?)'* 1 ' 2 (sin0)* l/2 * £ 
x P c n > ;7 l/2 * £) (l-2 cos 2 0). 

Series 3. (2.44) 

=2n+3 + j-£, n = -l,0,l,... 

/mi¬ 
series 4. 

M = -(2n + 2+ j-£), « = 0,1, 2, ... 

/=(»+;-«+!)/(«+ 1). 

These solutions can be verified by direct sub¬ 
stitution into Eq. (2.41), using the following four 
identities satisfied by the Jacobi polynomial 
P<-„ a - B) (x ) 5 : 

(l-x)J^ P £"■ B) (x) = aP<?- B >(x)-(n + B+ ^(x), 

(1 +x)-£p<?- B) (x) = (» + 0)P<“ + B - l >(x)-/3P J" 8) (x), 

(2.45) 

/3(1-x)P ( “' b) (x)-(m +a)(l +x)P ( “- I - 0+l) (x) 

= -2(n + l)P<“-‘- B -)(x), 

2 ic P “ (x) = ( ” + " + ^ + 2 ^ 0 W • 

Let us now count the total degeneracy with which 
the eigenvalue p. =k + 2 occurs. Remembering 
that we have reduced our problem to a four- 
component spinor, the expected degeneracy of the 
eigenvalue p. = k + 2 is 

deg(fe+2) = dim(fe +i i) 

= f(* + l)(*+2)(*+3), 

* = 0,1,2 . (2.46) 


2n + 2+i+l?l l twice each 
2n + 1 +j + | £ \) 

n- 0, 1, 2, .. ., j - 2,2,. 

^2 = » j , I £ I " 2 » 2 » 


(2.48) 


and the degeneracy of the eigenvalue k +2 is 


deg(& + 2 ) = 2 y] ( 2 j + 1 ) 

n.TTtl 

2 n +)+ | ? |s* 

+ 2 X) (2j +1) . (2.49) 

n»>.l £ I 

2rt4->f 1 cl = A 4-1 

The right-hand side of Eq. (2.49) is obviously a 
cubic polynomial in k , which by direct enumer¬ 
ation of the two sums, takes the values 4, 16, 40, 80 
for * = 0,1, 2, 3, respectively. Hence it is equal to 
\{k + l)(k+2)(k + 3), and the eigenvalue-counting 
checks. In group-theoretic language, what we 
have done is to exhibit the decomposition of the 
(* +i, |) representation of 0(5) in terms of states 
labeled by the quantum numbers of the SO(3)xO(2) 
subgroup. 

From Eqs. (2.43) and (2.44), we see that in the 
free case the two-component wave function 


*- 


a 

c 


satisfies the finiteness boundary condition 


(2.50a) 


TABLE I. Characteristic exponents of the differential 
equations for a and c at u = cos 2 0 = O,l. [See the dis¬ 
cussion following Eq. (2.50).] 


Singular point: a-0, 6~iir 

Singular point: 

: u = 1 , 0^0 

a~u° a - (cos 0 ) 2 ° a 

a ~ (1 — «) x a 

= (sin 0 ) 2x a 

C ~U ° c - (C08d) 2ac 

c ~ (1 —U ) X c 

= (sin0) 2x c 

Characteristic exponents 

Characteristic exponents 

Solution 1 Solution 2 

Solution 1 

Solution 2 

° a 4(j+4> -it?+3) 

X a 

1 

■UJl 

—|N 

1 

°C -i0’ + 4) 

Xc i(«+i) 

“s(£ + §) 
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</>~finite at 0 = 0, 0 = ^tt> 


(2.50b) 


and an examination of the characteristic exponents 
in Table I shows that Eq. (2.50b) is equivalent to 
the square-integrability boundary condition 


n r/2 

J cos 2 0sin0d0(|a| 2 + |c| 2 )<°° . 


(2.50c) 


Since the interaction term in Eq. (2.39) is non¬ 
singular at 0 = 0, 0= 2^1 the characteristic ex¬ 
ponents of the differential equation system at 0-0, 
0 = \ tt are the same in the interacting case as in 
the noninteracting case. Hence the boundary con¬ 
dition in Eq. (2.50), which we inferred from the 
free solution, is appropriate to the interacting 
case as well. 

E. Reduction of the interacting case and construction 
of the Fredholm determinant 

It is convenient, for the work which follows, to 
reduce the coupled differential equations of Eq. 
(2.39) to a somewhat simpler form. We work with 
the two-component spinor notation of Eq. (2.50a), 
and write Eq. (2.39) in the matrix form 


Hip - flip . 


(2.51) 


Introducing Pauli matrices t,, t 2 , t 3 which act on 
the spinor ip, it is easy to see that H may be 
written as 

H - (j -5 — Asin 2 0 +£)t 3 

-[£ cot0 + (j +-|)tan0 + Asin0 cos0]t, 


+1 cot0 -tanflj t 2 . 


(2.52) 


We now make a similarity transformation on Eqs. 
(2.51) and (2.52), writing 


ip=Sip R , 

H^SHxS- 1 , 

S= (cos|0 -^T 2 sin|0)[(sin0)^ 2 cos0]' 1 
The transformed eigenvalue problem is 

Hr*Pr ~ V-^R * 

Hr = -[(jfsinfl)" 1 +Asin0]T, 

- (j +i)(COS0)* 1 T 3 -^ T 2 . 

The measure for orthogonality is now 


(2.53) 


(2.54) 


r w/2 

dQ $RipR “0, 
J o 


(2.55) 


and the boundary condition is 

ip R ~ (sinfl^^osOx finite at 0 =0, 0=i7r, 


or equivalently 


J ^n/2 

d&\ipi 

0 


< OO 


(2.56b) 


To construct the external-field-problem Fredholm 
determinant, we display the parameter dependence 
of the eigenvalue m by writing 




(2.57) 


so that the Fredholm determinant within the sep¬ 
arable subspace takes the form 


Aj^(X)- A A p.^(A) 

all eigenvalues b J 
in subspace 

=det [H r ] . 


(2.58) 


Remembering that for each eigenvalue ji £ >(*) 
there is an eigenvalue -ji £ >(A) coming from eigen¬ 
functions with opposite inversion parity [see the 
discussion following Eq. (2.31)], and that there is 
an additional duplication of eigenvalues when we 
reconstruct back to eight-component spinors, we 
find that 

n m - IT n 

all eigenvalues 1/ 2,3/2. . . . £ = ±l/2,±3/2.... 


x n [M t( wr n • 


(2.59) 


all eigenvalues 
in subspacc 


Thus, comparing with Eq. (1.7), we see that the 
full external-field-problem Fredholm determinant 
is given by 

aw= n n • 

>=1/2,3/2.. . . C=±l/2,±3/2,. . . 

(2.60) 

One further transformation of this formula proves 
to be useful. From Eq. (2.54), we see that if 


HRip - M £ >(A)i/> , 


then 


Hr £ —£ T i^ ^uW r i 1 P • 

X--X 


( 2 . 61 ) 


(2.62) 


Since t, 2 = 1, we conclude that the sets of numbers 
{-ji u (A)},{ji_ u (-A)} are identical. Hence 

IT M-£>W" H (—)M£>(“^)» (2.63) 


all eigenvalues 
in subspace 


all eigenvalues 
in subspace 


permitting us to eliminate the negative -4 factors 
in Eq. (2.60). Dividing out a[ 0] to eliminate an 
irrelevant (and infinite) constant factor, we get 
finally 

A[A] _ y-r TT l~ A £i (A)a (—A) 1 2i * 1 

A 0] >=1/ 2.3/2,. . . £ = l/2,3/2. Ag^(0) 


(2.56a) 


(2.64) 
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Equation (2.64) is still a formal expression, in that 
renormalizations have not yet been made. In Sec. 

V below we discuss the modification of Eq. (2.64) 
which is made necessary by renormalization sub¬ 
tractions, and which guarantees convergence of 
the infinite product. 


111. ZERO-FREE STRIPS 

For the benefit of the reader who has skipped 
Sec. II, we briefly recapitulate the principal re¬ 
sults derived there. In terms of the effective ex¬ 
ternal-field amplitude 

‘HiST 

we found that the external-field problem could be 
reduced to the two-component eigenvalue problem 
(T 1( 2 g = Pauli matrices) 

Hip = , (3 2) 

H = -[£ (sinfl)* 1 + Xsin0]r l 

- (j +4)(cos0)' 1 t3 —i t^ , 


with the measure for orthogonality 

f dOip^ip' =0, (3*3) 

do 

and the boundary condition 

ip~ (sine^cosG x finite at 6 =0, %tt . (3.4) 

Defining the Fredholm determinant corresponding 
to Eq. (3.2) by 

Agj(^)" II (3-5) 

all eigenvalues 

we found that the full external-field-problem 
Fredholm determinant introduced in Eq. (1.7) is 
given (up to renormalization subtractions) by 


A 

[dJ 

- 17 

IT 

^a e ,(x)a £ ,(-x) 1 

A 

[0] 

JL 1 

>=1/2.3/2,. , . 

JL X 

C*l/2,3/2,. . . 

L A U (0 f 


(3.6) 

The reminder of this paper is devoted to a study 
of the mathematical properties of Eqs. (3.2)—(3.6). 

We begin by showing that A e> (X) cannot vanish in 
strips in the X plane containing the real and imag¬ 
inary axes. From Eq. (3.5) we see that zeros of 
a £> (X) occur at values of X where Eq. (3.2) has a 
vanishing eigenvalue, that is, where 

Htp = 0 (3.7) 

for nonvanishing, normalizable ip. To get our first 
restriction on the locations of zeros, we multiply 
Eq. (3.7) by ip' r T l and integrate, giving 


r n /2 

- [Ksinfl)* 1 +Xsin0]i/> ~ipd6 +iR x =0 , 

J ° (3.8) 

r n /2 

+ 1) (cos 6)- l ip'Tjpd6 

d0 

+ L ^ i^Ze) 4,(16 ’ 

Using the boundary condition of Eq. (3.4) to inte¬ 
grate by parts, we readily see that R, is pure 
real. Hence taking the real part of Eq. (3.8) gives 
the relation 


-ReX L n/Z (sine)‘ 1 ip' r ipd6 


4 


S ± 2 . 


f* 2 si ndip^ipdO 


1 . 


(3.9) 


We learn from this relation that a[A] has no zeros 
for X in the strip |ReX| and in particular no 
zeros on the imaginary axis. To get a second re¬ 
striction on the locations of zeros, we multiply 
Eq. (3.7) by ip t T s and integrate, giving 


r /2 

i I \sinO ip~T 2 ipd6 

Je\ 


J mfz 

(cos 6)~ l ip^ipd8 + iR 2 = 0 , 
o 


(3.10) 


r T/2 y-TT/2 / J \ 

#2 = -^ (sin0)* 1 ^ t T 2 ^rf0 - J ip t T l Jq ) ipd8 

Again, the boundary condition of Eq. (3.4) implies 
that R 2 is real, so taking the real part of Eq. 

(3.10) gives the second relation 


rr /2 


imx (cos ey l ^ii,de 


+ r> 


... ‘'O 

r “ ir/ 2 , 


J* ainQ ip*T 2 ipdQ 


(3.11) 


Since t 2 has eigenvalues ±1, we have the inequality 
\^T 2 ip\ and so Eq. (3.11) implies the in¬ 

equality 

tt/2 

[cosv)'*ip 'ipav 

>1. (3.12) 


iimx| > f^'^cosey^ipde 


d + 2 


f n 2 sin0 ipfipdO 


Thus A [A] can have no zeros for X in the strip 
|lmX| < 1, and in particular no zeros on the real 
axis. Combining the restrictions of Eqs. (3.9) and 
(3.12), we get the regions in the X plane where A £ ,(X) 
is allowed to have zeros, as illustrated in Fig. 1, 
Note that the absolute value sign in Eq. (3.12) can¬ 
not be removed. In fact, since the Hamiltonian H 
is Hermitian for real X, a u (X) is a real analytic 
function of X and satisfies the reflectien principle 

A £J (X)*=A £J (X*) . (3.13) 

Hence for each zero X of A U (X), there is a corre¬ 
sponding zero at the complex-conjugate point x*. 
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IV. WRONSKIAN FORMULA FOR THE 
FREDHOLM DETERMINANT 

We proceed next to derive a connection between 
the Fredholm determinant in each separable sub¬ 
space and the Wronskian of two suitably standard¬ 
ized independent solutions of Eq. (3.7). In Sec. IVA 
we construct the Green’s function for if, introduce 
the standard solutions, and discuss their analyti- 
city and rate of growth in X. In Sec. IVB we prove 
the connection between the Wronskian and the 
Fredholm determinant. 

A. Green's function and standard solutions 

Let H =H(6) be the Hamiltonian of Eq. (3.2), and 
let S-H -1 be the Green’s function satisfying 

ff(0i)S(e 1 ,0 2 )-6(e 1 -e 2 )i, 

O<0 lf 0^77 , (4.1) 


dw _/. d \ /. d \ T 

d$ ^ 2 ( ITa d0^7 ( IT2 d0^ 2 j^ 

= -^ 2 T {[£ (sinej-’ + Xsin^Tj +(j +i)(cos0)~ 1 T 3 }^ 1 


with 1 the 2x2 unit matrix. To construct an ex¬ 
plicit expression for S, we introduce the solutions 
ip x , ip 2 of Eq. (3.7) which are regular at 0 =0, 0 =£tt, 
respectively: 

Hip x =Hip 2 = 0 , 

^ ^ ~ (sin0) I/2 x finite at 0 =0 , (4.2) 

ip 2 = - COS0 x finite at 0 . 

We also need the Wronskian of the two solutions, 
defined by (the superscript T denotes transpose) 

M = zJpi = (0 )c, (0) - a, (0 )c 2 (0) . (4.3) 

Since 


ipRltfainS)" 1 +Xsin0]r 1 + (j + s)(cos 0 )" 1 t 3 } t i/j 1 = 0 , (4.4) 


the Wronskian is 0-independent. Applying the 
method of variation of parameters,® we then find 
the following expression for S: 


2 

*2 


S(0 6 )-w~ 1 X 1 * &\ < Q 

To verify Eq. (4.5), we note that 

H(e l )S(e l ,e 2 )=o, e x <e 2 , e x >e 2 ; 

dB 1 H(f) 1 )S{e !, 0 2 ) —► -iTgW 

€ -*0 

X ^2(®2)^ 1 (^ 2 ) 
-ipyie^ipzie^} 


1 


V e 


-1 


0 o — € 


(4.5) 


(4.6) 


° - 1 ) «r> ( ° W 

10/ \-w 0 


= 1 , 

as required. In Appendix A, as an illustration of 
this construction, we give a formula for S in the 
noninteracting (X = 0) case. 

Up to this point the normalization of ip x and ip z 
has not been specified, and it is obviously im¬ 
material for the construction of Eq. (4.5). How¬ 
ever, for future use we now standardize the nor¬ 
malization by requiring that 

^ as 0-0 , 

as 6 


conditions which, as we shall see explicitly be¬ 
low, can be satisfied by taking the leading terms 
in the series developments of ip x (ip 2 ) about 0=0 
(0 to be X-independent constants. 7 Equation 
(4.7) uniquely specifies the X dependence of ip lt ip 2 , 



♦ 

Im \ 



Re\—£ 


Im\= j+1/2 


Re\— 


Im \*-( j+1/2) 


FIG. 1. Regions in which A C> (X) can have zeros ac¬ 
cording to the inequalities of Eqs. (3.9) and (3.12). We 
assume 4>0. 


(4.7) 
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and w y leaving arbitrary only a A-independent 
normalization factor. It is now possible, by 
straightforward majorization arguments, to prove 
the following result: The standardized solutions 
ib, P are entire functions of X, bounded for large X 
by e c ’ x \ with c an appropriate constant. 

B. Proof of the connection 

To connect the Fredholm determinant A g> (A) 
with the Wronskian, we start from the formal re¬ 
lation 8 

InA %J (A) = T r InH 

= Tr lnj-^fsinfi)" 1 +Xsin0]T 1 

-U +j)(COS0)- I T 3 -Z Jg t 2 | , 

(4.8) 


from which we get by differentiation 


A £ j (X) 


-i ^A ej (X)_ 


d\ 


= Tr 


gm • 


(4.9) 


Substituting Eq. (4.5) for S^H" 1 and evaluating 
the trace, we find 

,-i dA,,(X) _ 
dX 




-fl /2 QfJ 

=w W'J detf — h. 


(4.10) 


I 


ir/2 




. 6 . 


° A 6,-0 Jo, 

e 2 i t/2 1 


We next show that the numerator on the right-hand 
side of Eq. (4.10) is just equal to dw (\)/d\ when 
ip x and ip 2 are taken to be the standard solutions. 

To see this, we start from Eq. (4.3) for w, which 
yields 


dw (X) _ 8^ T 

dX ax ax 


(4.11) 


Letting 0 t and using Eq. (4.7), only the second 
term on the right-hand side of Eq. (4.11) survives, 
giving 


dw (X) r ._ 3 ip. 


dx 


~lp2^2 


3 A 


e-jr /2 


(4.12) 


To proceed we consider the integral appearing in 
the numerator of Eq. (4.10), 

r it/ 2 i it afJ 

I lim I de $TT to ■ < 4 ' 13 ) 

Jq 9 A 0i-O J&, dX 

e 2 -jr/2 

By differentiating the equation - 0 with respect 
to X we get 


3X n 3X 


(4.14) 


and substituting this into Eq. (4.13), using the ex¬ 
plicit form of H and integrating by parts, we find 


= lim - f 2 d0ip%\-i ■^•T 2 -U(sin0)" l +XsinejT 1 -(j + i)(cos0)" 1 T 3 |- 

6j —0 Jdi I ) dX 


e 2 -ir/2 


1 0 0 

2 - f 2 d0(Hip 2 ) T <p l 

6, Je, 

a * — j 4 x 


e 2 — tt/3 

= iih T T 
^2 T 2 ax 

dw(X) 


0—tt/2 


(4.15) 


giving the desired result. Substituting Eq. (4.15) 
into Eq. (4.10) we get, finally, 

A {J (X)-> , (4.16) 

which on integration gives the connection between 
the Fredholm determinant and the Wronskian, 


that A £> (A) is also entire, as expected for a 
Fredholm determinant. Obviously, A g> (X)will 
also have exponentially bounded growth at infinity; 
the precise asymptotic form of A U (A) will be 
given below. Equation (4.17) will be of great utility 
in the subsequent sections, where it will allow us 
to study A £> (A) by applying WKB and series ex¬ 
pansion methods to the solutions of Eq. (3.7). 


A^fX) _ w (X) 17 ) V. ORDER OF GROWTH OF A t /(X) AND A[v4 | 

A u (0) to (0) ’ 

In this section we give more precise results 
Since ip lf are entire functions of a, we conclude concerning the large-x asymptotic behavior of 
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A £> (X) and of the full external-field-problem 
Fredholm determinant A[A]. In Sec. VA we pre¬ 
sent a WKB formula (derived in Appendix B) giving 
the asymptotic behavior of A £> (X). Using this 
formula, we determine the asymptotic distribution 
of zeros of A u (x). In Sec. VB we discuss the re¬ 
normalization subtractions needed to make the 
infinite product for A [A] convergent. Using our 
knowledge of the distribution of zeros of A, com¬ 
bined with results from the theory of entire func¬ 
tions, we determine the order of growth of the re¬ 
normalized determinant A [A] for large external- 
field amplitude X. Combining this estimate with 
the absence of zeros in a strip containing the real 
axis, we show that the real amplitude integral 
contour discussed in Sec. I yields a function of e 2 
analytic in the right-hand e 2 half plane. 


A. Asymptotic behavior of Ae/(X) 


As we have Been above, A £> (X) is given by the 
Wronskian of two suitably standardized indepen¬ 
dent solutions of the differential equation Hip- 0. 

In the limit when |x| is large, or more specifically, 
when the inequalities 



(5.1) 


are satisfied, we can apply WKB methods to cal¬ 
culate approximate solutions of the differential 
equations, and hence to get the asymptotic form 
of A U (X). The calculation, whichisoutlinedinAp- 
pendix B, gives the result (valid for £ >0) 


~ r(2(j +1)) r(s) + 2()+1/2) ()+1/2) 

A„( 0 ) r(M) ro + i) r(4 + i) 

ej«l 


X 


[e x + e - x (-ir « +1 2 - 2 ' {+1/2) (7 + *)r(« + c+1/2 >] 


(5.2) 


showing that the entire function A gJ (X) is of expo¬ 
nential type. One special case of Eq. (5.2) is 
worth noting. When; —-j, Eq. (5.2) reduces to 


A^(X) 


A„(0) 


• 1/2 €,«1 

€ 2 «! 


(5.3) 


we will show in Sec. VIA below that this is an 
exact, and not just an asymptotic, result. From 
Eq. (5.2), we can calculate the asymptotic distribu¬ 
tion of zeros of A £> (x) by solving the equation 

0 = e x + e- x (-D >+6+1 2- 2(6+l/2) (; +i) 

xr(£ + i)X~ (£+l/a) , (5.4a) 

which we rewrite in the form 


e 2X (-X ) c i = e c 2 

Ci=t + 2, 

c 2 - -2(4 + 2 )ln 2 + ln(j + £) 
+ lnr(£ + 5 ) - iir(j + f) 


(5.4b) 


Neglecting terms which vanish for large X, the 
solution is 

X ximi + ic 2 - l^lnt-Trwi) 

+ ^ ln (^p) + 0(£J) t 

/f+~\ ^ 5 ' 5 ^ 

ReX»£te+i»n(S_j2 j + ott,;) , 

ImX * Tin + 0(£,j) . 

In the region of validity of Eq. (5.5), where \n\ 


»£, we see that ReX is asymptotically negative, 
as required by the inequality of Eq. (3.9). The 
occurrence of zeros in complex-conjugate pairs 
is also apparent from Eq. (5.5). 

For application in Sec. VB, it is convenient to 
give the zeros of a £> (X) an index k which arranges 
them in order of increasing magnitude: 

X £> ^general zero of a £> (X) , 

(5.6) 

|Xi u |<|x!V|x!>|<— . 

For large k the index defined this way can be iden¬ 
tified (up to a factor of two, since the zeros occur 
in complex-conjugate pairs) with the positive in¬ 
teger \n\ appearing in Eq. (5.5). Since the effective 
expansion parameters in the WKB procedure are 
thus 


t I j j 

w\~a 


(5.7) 


we expect the following bounds on |X ft e> | to hold 
uniformly in £ and ; : 


Ai * (77V +4+i)AJ} a ) i/a SA2 ’ 

(5.8a) 

(; a + « 2 ) ,/2 « |X^|«A J (; a + ^) 1/2 ) (5.8b) 


for suitable constants A 1<2>3 and C. [Equation 
(5.8b) also incorporates the lower bounds of Eqs. 
(3.9) and (3.12).] We have not constructed a proof 
of Eq. (5.8), so these inequalities should be 
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considered a conjecture, suggested by the WKB 
analysis, on which some of the arguments of Sec. 

V B are based. 

B. Order of growth of A\A \ 

We are now ready to examine the asymptotic be¬ 
havior of the full external-field-problem Fredholm 
determinant A [A], given by the product formula 
Eq. (3.6). First we must deal with the question of 
renormalization subtractions alluded to above. By 


L, ADLER 10 

dividing out A u (0) 2 in Eq. (3.6), we have eliminated 
the most divergent vacuum diagram illustrated in 
Fig. 2(a). However, the second-order diagram 
shown in Fig. 2(b) is also divergent, and must be 
eliminated by a further subtraction. To do this, 
we write the small-X expansion 

A t At!w ( ~ x) =1+A{/2+o( ^ ) ’ (5-9) 

and then define the renormalized Fredholm deter¬ 
minant a [A] by writing 


A[A]=e°' x > IT 

>»!/ 2,V2,. 


n 

C = 1/2,V2,. 


L a u (0) 2 


2 j+ 1 


(5.10) 


In this expression 
q(x) = q 0 + q 2 x 2 


(5.11) 


is a polynomial which expresses the fact that the 
renormalization counterterms always have an un¬ 
determined finite part. To see that Eq. (5,10) is 
the correct recipe, we note that the renormalized 
vacuum amplitude, which according to Eq. (1.8) is 
proportional to 


ln£[A] = Q(X) +1 iia[A] - lnA[0] 





A u (0) 
giving 


xyj exp (xF. 


(5.13) 


A ti (A.)A Ej (-X) 

A»(0) 2 


n 


i * 2 1 

l 'Wf\ 


= 1 -* 2 E £T7)5 +°(* 4 ) • (5.14) 


From Eq. (5.14) we identify A gj as 


(5.12) 

now receives contributions only from the conver¬ 
gent vacuum diagrams illustrated in Fig. 3. 

Let us next rewrite Eq. (5.10) in an alternative 
useful form. Since a £> (X) is an entire function of 
exponential type, we can use the Hadamard factor¬ 
ization theorem 9 to write it as an infinite product 
in terms of its zeros , 


ja * (5.15) 

Let us define an additional constant B tJ by 

firry (5.16) 

ft ft 

and combine Eqs. (5.13)-(5.16) to rewrite Eq. 
(5.10) as 


A[A]=e°(M e -Bx 4 /2 i>(x)j 


PM- n 

j= 1/2.3/2. 




2>+l 


- n 

all zeios 
X y oi 

*= E E (2) + 1)B u . 

> = 1/2.3/2,, . . C 8 l/ 2,3/ 2, . . . 




(5.17) 


The constant B is the contribution of the fourth- 
order graph which appears as the first term in 
the series of Fig. 3, and hence is finite. The sec¬ 
ond expression for P(\) in Eq. (5.17) has the form 
called a canonical product in the theory of entire 
functions 9 ; Eq. (5.17) thus expresses &[A] as a 
canonical product multiplied by the exponential of 
a fourth-degree polynomial in X. 


o o 

(a) (b) 

FIG. 2. (a) Divergent vacuum diagram which is re¬ 
moved by division by A £> (0) 2 in Eq. (3.6). (b) Divergent 
vacuum diagram which is removed by the factor 
exp(-^4 u X 2 ) in Eq. (5.10). 
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Let us now introduce some further concepts from 
the theory of entire functions. 9 Let /(x) be an en¬ 
tire function of the complex variable X. Its maxi¬ 
mum modulus M(r) and minimum modulus m(p) 
are defined by 

M(r)= max |/(re l0 )| , 

0^8=52* 

(5.18) 

m(?)= min |/(re i0 )| . 
ose=s2ir 

The order p of /(X) is defined to be 

.. lnlnAf(r) /r . . 

p = limsup—- : (5.19a) 

r ^co lnr 

if / is of order p it is asymptotically bounded by 

|/(X)|«Ae fl l x l p (5.19b) 

for suitable positive constants A and B. Finally, 
let{r„~|xj} be the sequence of moduli of the zeros 
K of /(X), arranged in increasing order. The 
smallest number o for which 


T m — 5 <°°, for all a> a (5.20) 

U= 1 

is called the exponent of convergence of the se¬ 
quence. According to the theory of entire func¬ 
tions, the order of an entire function is closely 
related to the exponent of convergence of its zeros. 

To determine the order of &[A], we wish then 
to calculate the exponent of convergence of the 
zeros X v appearing in Eq. (5.17). Remembering 
that all zeros X^ occur with multiplicity 2 j +1, 
we consider the sum 


s a = E 

j= i/2,a/2,. 


(2 j+D E E 

c* i/2,3/2,... ~ 


1 


(5.21) 


In estimating the convergence properties of S w it 
obviously suffices to replace the sums in Eq. 
(5.21) by integrals. We first show that Eq. (5.21) 
is convergent for a>4. Using the lower bounds 
obtained from Eq. (5.8), 


nkA l * |X 2 W | , k>k 0 

u 2 +e) U2 ^ ix»i, k<k 0 

we get the estimate 


(5.22) 



c(f+e ) 1/2 r dk 

0' 2 + l 2 )“ /2 + h u *,^ /2 


C' 

= (^2 + ^zya-i)/2 » (5.23) 


O • o-" 

FIG. 3. Convergent diagrams which contribute to 
Eq. (5.10). 



4C'2“ -4 

(a -3)(a -4) <0 ° 


» 


(5.24) 


as claimed. Next we show that Eq. (5.21) diverges 
when a-*4. Since (lnx)/x ^ 1/e for x ^ 1, the upper 
bounds in Eq. (5.8) take the form 


\f'l «A 2 7T 




k >k 0 





S 


2\l/ 2 


k <k Q 


(5.25) 


giving, by a procedure identical to that in Eqs. 
(5.23) and (5.24), the estimate 


C" 

s “ s (TIT) ’ 


C">0 


(5.26) 


We conclude that S u diverges for a = 4, and that 
the exponent of convergence of the zeros of 
£[A] is o =4. 

From the fact that o =4 we can immediately con¬ 
clude that the order of the canonical product P(X) 
is 4, and hence that the order of &[A] is less than 
or equal to 4. 9 If the order of &[A] were actually 
less than 4, then the sum in Eq. (5.21) would con¬ 
verge 9 for exponents a smaller than 4, which we 
have seen is not the case. So we conclude that the 
order of &[A] is precisely 4. 

Let us now use these results to determine the 
convergence properties of the amplitude integral 
when taken along the real contour. Since a 6> (X) 
cannot change sign on the real axis, all of the 
factors in Eq. (5.10), and hence &[A] itself, are 
positive for X real, and so lnA[Aj is real. Since 
the maximum modulus of A [A] is bounded as in 
Eq. (5.19b) with p =4, we have 

ln£[A]<R|x| 4 (5.27) 


for an appropriate positive constant B . In order 
to restrict ln£[A] from below, it is necessary to 
have a lower bound on the minimum modulus of 
£[A]. We get this by using the following theo¬ 
rem 9 : “Let P(x) be a canonical product of order 
p. About each zero X v (|xj >1) we draw a circle of 
radius l/|xj a , a>p. Then in the region outside 
these excluded circles, |P(X)| > expt-r***) for e > 0 
and for r > r 0 (e, a).” To apply this theorem, we 
note that the sum of the radii of all the circles is 
just S m and can be made smaller than 1 by choos¬ 
ing a large enough. Since &[A] has no zeros in 
the strip |lmX| <1, the entire real axis then lies 


so that 


434 


Adventures in Theoretical Physics 


2414 


STEPHEN L. ADLER 


10 


in the region outside the excluded circles, and so 
we learn 

ln£[A]> -|x| 4+E , |x| >r 0 (5.28) 

for r 0 appropriately large. Taking Eqs. (5.27) and 
(5.28) together, we see that |ln£[A]| is poly¬ 
nomial-bounded on the X-real axis. The Gaussian 
factor in Eq. (1.5) then guarantees that the ampli¬ 
tude integral converges when taken along the real 
axis, provided that Ree 2 >0, and thus defines a 
function of e 2 analytic in the right-hand e 2 half 
plane. Note that this conclusion does not depend 
on the fact that S[A] is of order 4, but only re¬ 
quires the weaker statement that the order of 
&[A] is finite, which is known to be true 2 indepen¬ 
dent of the validity of the inequalities in Eq. (5.8). 


fli = (tanj 6) i f(x), 

c t =tan0(tani 0) t g(x ), (6.3) 

x = l- — . 

COS0 

In terms of the new variables the coupled equa¬ 
tions become 


df X/ / . 1X A 

Tx + oTxr (] " )8=Q ’ 


x(2 - x) 


d s 1 , 2 1 

L dx (l-x) 2 J + ^ 


(6.4) 

+ 1 -x)g+()+ 2)/=0. 


We now look for a power-series solution in the 
form 


VI. NUMERICAL RESULTS 

We turn next to numerical studies of A u (x) and 
A[Aj. In Sec. VIA we derive power-series expan¬ 
sions for the standardized solutions ^ and ^ 2 . The 
circles of convergence of the two series which we 
obtain overlap, allowing one to compute the Wron- 
skian, and hence A c> (x), by picking 6 to have any 
value in the overlap region. In Sec. VIB we nu¬ 
merically study the location of low-lying zeros of 
A U (X), and find that there are no zeros in the sec¬ 
tors | Rex | > | ImX |. Consequences of this fact for 
the coupling-constant analyticity properties of W x 
are discussed. Finally, in Sec. VIC we give nu¬ 
merical results for the behavior of the vacuum 
amplitude as X increases along the imaginary axis. 


/ = £*"/., g='£2’fgn- (6.5) 

n =o n= o 

We find that Eqs. (6.4) are satisfied if we take 

fn=gn=0 (n< 0), /„=- 2({ + i), g 0 = (i+i), 

fn+ 1 = ~ 

+ U + *)(gn-2g n . l +gn.2)\ gj 

*-> = 2^ 24 + 3 [(5w+4 ^ + 3 + 2x) ^ 

- (4w + 24 - 1 + X)g n , l + (n - l)^ rt . 2 

- U + *)(/,+ , - 2 f„ +/„_,)], n> 0 . 


A. Power-series solutions 

Substituting 



into Eq. (3.7) and writing out the coupled differ¬ 
ential equations for the two components, we get 

— - [£(sin0) -1 + Xsin0]a + (j + 2 )(cos 0 ) _l c =0, 


(2) Solution ip 2 regular around G = ^ir. In this 
case we make the substitution 

“■“(lTrfSfl) [h(y)+co&el(y)\, 

c ‘ = Vr+slnfl ) L*Cy) - c °te/Cy)J, (6.7) 


sin0 ' 

The coupled differential equations now become 


( 6 . 2 ) 


^■+[£(sin0)“ 1 + Xsin0]c + (j + i)(cos0) -1 a=0 . 

To construct power-series solutions regular 
around 0=0 and 0 = z7T we make the following 
changes of variable, motivated by the form of the 
noninteracting (X=0) solutions presented in Ap¬ 
pendix A. 

(1) Solution ip x regular around 0 = 0. We substi¬ 
tute 


dy K (1 - y? 


y(2-y)^ + [2(j + i)-y]i + [t 


1 = 0 


(1 - y ) : 


( 6 . 8 ) 

h = 0 . 


Assuming power-series solutions in the form 


CO 


h= Y, h > 


n= o 


n-A 


we find the solutions 


(6.9) 
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h n = t n = 0 (n<0), h 0 = - 2(j + 1), i 0 =5 + X, 

= ^71 [2«* n - (n - D/in-, + (« + \)l„ + ^„_ a - 2/..,)], 
in+1 = 2 n + 2j + 4 f (5 ” +4 ^ + 5) l ” ~ < 4n + 2 l) *«-!+(« " 1) i B . a " (X+ + S(2fe„ -/Jn.jj, 


( 6 . 10 ) 


A number of observations about the above solu¬ 
tions are now in order. First, we note that since 


Eq. (6.16) corresponds to singular points in the 
x variable at 


»£a = SSa = 3 Al 

BX BX BX 



(6.11) x = l± (l + J)‘ l/2 , 


(6.18) 


and since l in Eq. (6.7) appears multiplied by the 
factor cot0, which vanishes at & = i tt, the standard¬ 
ization conditions of Eq. (4.7) are satisfied. Sec¬ 
ond, we consider the greatly simplified form of 
the above equations when j j. Working directly 
from Eq. (6.2) we find in this special limit the de¬ 
coupled equations 


— - [ ^ (sinfl) -1 + X sine] a = 0, 
^ + [£(sin0) -1 + x sinfl] c =0, 


( 6 . 12 ) 


which can be immediately integrated, giving 


fl l = -2(tanie) t (£ + 5)e x(1 - cos ^, 

c i ~0, 

a a = -(tanid)«e- XTO#e . 
c 2 ~ - (tan^e)"* e Xco * 9 . 


(6.13) 


Hence the Wronskian is 

w(\)=a 2 c 1 -a l c 2 

= -2({+i)c\ (6.14) 

giving for the j — - 2 limit of the Fredholm deter¬ 
minant the result 


which did not appear in the x form of the equation. 
Hence the singularities in Eq. (6.16) must be re¬ 
movable, and a direct calculation shows this to be 
the case. We conclude, then, that the power- 
series solutions for tl> L and have the following 
regions of convergence: 

^converges for |x|< l«l^cos0> 2*^*0 <s G<±tt, 

(6.19) 

if 2 converges for |>|< 1«“*1 ^sinfl > 6 tt . 

Thus, in the angular range -J-7r< 0 <■£ ?r both power 
series are convergent, and so we can calculate 
the Wronskian from Eq. (4.3) by taking 6 to be 
any value in this interval. Since the Wronskian 
is 0-independent, a powerful check on both the 
programming and the absence of serious round¬ 
off and truncation errors is obtained by calculat¬ 
ing W for two different values of Q in the allowed 
range and then checking that the same answer is 
obtained. In practice, using double precision on 
an IBM 360/91, we found we were able to explore 
the region £^80, j^80, |x|^20 in good detail, 
but for | XI values between 20 and 24, serious 
roundoff errors started to set in. 


B. Low-Lying zeros of At/(X) 


=e * 

A e-./ a (0) ’ 


(6.15) 


as was stated in Sec. VA above. 

Finally, we discuss the convergence properties of 
the power-series solutions. Rewriting Eq. (6.4) 
as a single second-order differential equation we 
find singular points at x = l, 2, and 0 ®. Rewriting 
Eq. (6.8) as a single second-order equation we 
find singular points at y- 1, 2, and °°, and addi¬ 
tionally at 


y i± 



1/2 


(6.16) 


Since x and y are related by 

1 1 1 

(I.*)* + (i_ } ,) , - i > 


(6.17) 


Numerical results for the low-lying zeros of 
A u (x) in the upper half plane are given in Tables 
II and III. In Table II we give the locations of the 
lowest zero (the zero of smallest magnitude |X|) 
for a range of values of £ and j. In Table III we 
give the locations of the lowest four zeros for 
£ = j ~ i. For all of the zeros tabulated, the ratio 
lImX|/lReXl is larger than 1. As 5 increases for 
fixed j , the ratio appears to be approaching 1 
from above; as j increases for fixed £, the ratio 
grows, as might be expected from the inequality 
of Eq. (3.12). For a given £,j, the successive 
higher zeros move up in the imaginary direction 
with a spacing ~tt between the imaginary parts, 
as is expected from the WKB estimate of Eq. (5.5). 
The pattern of the numerical results strongly 
suggests that [Imxl/|Rex|>l for all zeros of 
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A ?> (x). if this property were true, the zero-free 
regions of A [a] would be as indicated in Fig. 4, 
and a contour of integration in Eq. (1.5) initially 
along the real axis could be freely deformed to 
the positions indicated as “# 1” and 2.” The 
first (second) contour allows analytic continuation 
of into the entire upper (lower) e 2 half plane. 
Hence, for the distribution of zeros of £[A] shown 
in Fig. 4 one gets a radiative-corrected vacuum 
amplitude which is analytic in the entire e 2 
plane except for a branch cut running along the 
negative real axis from 0 to -*>. 

C. Behzvior of vzcuurn zmplitude for X imaginzry 

As we have stressed repeatedly above, the pos¬ 
sibility of taking the contour in Eq. (1.5) to lie 
along the imaginary axis can be realized only if 
W® decreases as a Gaussian (or faster) as X be¬ 
comes infinite along the imaginary axis. Actually, 
when subtractions are taken into account, the 
relevant question becomes whether (d/d\ 2 ) 2 \n&[A\ 
decreases along the imaginary axis. The differ¬ 
entiations just eliminate the arbitrary subtraction 
polynomial Q(X) which appears in Eq. (5.10); this 
polynomial is not relevant to the physics, and 
specifically is not present if we consider (in the 
one-mode approximation for virtual photons) the 
set of single-fermion-loop vacuum polarization 


TABLE II. Lowest-lying zero with ImX>0 for various 
i,j values. 


3 

* 

HeX ( 

ImX, 

IlmXil/lRfiXil 

1 

2 

1 

7 

-1.67 

7.12 

4.26 

1 

2 

3. 

o 

-3.47 

7.36 

2.12 

1 

T 

7 

2 

-6.46 

8.50 

1,32 

1 

2 

_y_ 

2 

-9.87 

12.57 

1.27 

1 

7 

15 

2 

-13.25 

16.65 

1,26 

l 

7 

1 

7 

-1.67 

7.12 

4.26 

3 

2 

1 

7 

-1.43 

8.93 

6.24 

5 

2 

1 

2 

-1.14 

7.73 

6.78 

T 

2 

1 

2 

-1.10 

9.71 

8.83 

9 

7 

1 

7 

-1.08 

11.70 

10.83 

2 

1 

2 

-1.17 

16.60 

14.19 

11 

2 

1 

2 

-1.14 

18.59 

16.31 

1 

7 

1 

7 

-1.67 

7.12 

4.26 

3 

2 

3 

2 

-2.94 

6.27 

2.13 

T 

7 

7 

2 

-6.13 

10.98 

1.79 

11 

2 

11 

T 

-9.86 

18.67 

1.89 


diagrams shown in Fig. 5. In order to obtain good 
convergence of the sum over separation parame¬ 
ters £, j , we found it necessary to differentiate 
once more with respect to X 2 . Multiplying (for con¬ 
venience) by X 2 , we get, finally, as the quantity 
being studied 

Results for W® versus - ix are shown in Fig. 6. 

In calculating the points for this curve, we 
summed on £ from i to 2\ and on j from i to 39 z; 
doubling both summation ranges for a subset of 
the points produced a 6% change for - iX = 1 and 
negligible (<1%) change for - ?'X^5. In fact, near¬ 
ly all of the sum for - iX ^5 came from A £ /s 
with £ = 2 , most likely a result of the fact that 
this is the value of £ which gives zeros of ly¬ 
ing closest to the imaginary axis (see Table II). 

The curve plotted shows no sign of a rapid de¬ 
crease, but unfortunately the distortions in both 
the envelope of the oscillations and the wave form 
suggest that the asymptotic region has not been 
reached, and so the results are inconclusive. We 
did not attempt to extend the computations further, 
because of the roundoff error problem mentioned 
above. 

VII. OPEN QUESTIONS 

We conclude by giving a brief recapitulation of 
the remaining unresolved questions. Within the 
framework of the one-mode approximation dis¬ 
cussed at great length above, some key problems 
are: 

(i) determining the asymptotic behavior of W W (X) 
along the imaginary axis (ruling out a Gaussian 
decrease would rule out the imaginary contour 
possibility and hence, as discussed in Ref. 2, 
would rule out the possibility of obtaining a cou¬ 
pling-constant eigenvalue when only a finite num¬ 
ber of photon modes are included), 

(ii) proving (or disproving) the distribution of 
zeros illustrated in Fig. 4, 

(iii) if Fig. 4 is correct, finding a simple for¬ 
mula or interpretation for the discontinuity of W x 


TABLE m. First four zeros for £=/-r‘ with ImA>0. 
(For each there is a corresponding complex-conjugate 
zero in the lower half plane.) 


Zero number k 

lie\ 

ImA* 

IlmAft l/jReA*] 

ImA* - lTnA*_ r 

i 

-1.67 

7.12 

1.26 



-l.BG 

10.23 

5.50 

3.11 

r-> 

o 

-1.99 

13.99 

6.73 

3.16 

4 

1 

to 

0 

k-t 

o 

16.52 

7.87 

3.13 
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across its cut in the e 2 plane, and 
(iv) finding a compact expression for A w (x) in 
which the parameter 6 in the Wronskian has been 
explicitly eliminated. 

Going beyond the one-mode problem to the case 
when a finite number of photon modes are present, 
one can ask whether the zero-free regions shown 
in Fig. 4 persist. 10 * 11 If so, then the real contour 
would give cut-plane analyticity in e 2 for any finite 
number of modes, and the important (and un¬ 
doubtedly difficult) question of what happens when 
the limit to an infinite number of modes is taken 
would be brought to the fore. 
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APPENDIX A: FREE GREEN’S FUNCTION 

We give here a closed-form expression for the 
Green’s function of Eq. (4.5) in the free (A-0) 
case. The result is most compactly expressed 
in terms of the Jacobi functions 



FIG. 4. Conjectured zero-free regions suggested by 
the numerical results of Sec. VI B and Tables II and III. 
The dashed lines show permissible deformations of the 
real-axis contour of integration in Eq. (1.5). 


P («.b>/^ = ry+a + i) 
y K} r> + i)r(a + i) 

XF(- v, v + a + fi + 1; a + 1; i - (Al) 

where F(a r b; c; z) is the usual hypergeometric 
function. [The ordinary Jacobi polynomials cor¬ 
respond to the case where u in Eq. (Al) is a non¬ 
negative integer. We will also use the case where 
v is a non-negative half-integer.] We find (for 
£> 0 ) 




c[=-i tane(tanle)«(^); 


a 0 

^2 “ l -o } y 


(A2) 


2 ' 1 +sin6 


cosfl V +l/2 / 1+ sin£ ^] p ( jl . j . n / 1 


£ dQ 


sin 6} 9 


o - / cosg_ y +1/2 ( 1 _sme ±\ n.M) /J_\ 
° 2 \l + 3 ine/ \ i d$! X \sinfl/ ' 

The Wronskian of the two solutions is easily cal¬ 
culated by taking either the limit 6*0 or the limit 
7T, giving 

w 0 =aZc°-a° c° 

- -r(j + g + i) (A3 

r(i)ru + i)r(j + |) • ' 

The free Green’s function then immediately fol¬ 
lows from the recipe of Eq. (4.5), 


+ ^CEK + 

+ + 

+ 

FIG. 5. Single-fermion-loop vacuum-polarization dia¬ 
grams. This set of diagrams is finite for 7] 1 * 7] 2 , and 
requires no subtractions. However, if we contract with 
r&fiji) yfiiiVi) and integrate over ij t and t) 2 , the short- 
distance singularity as ijj_ — t) 2 leads to a divergence, 
corresponding to the^, counterterm in Eq. (5.10) and 
the finite remainder Q s \ 2 in Eq. (5.11). This divergence 
is of no physical significance, and so we differentiate 
to eliminate Q 2 . 
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*i<*. 

*i >e 2 . 

(A4) 


a 

€ 


c 
-€ 


-x 


(B2) 


We note finally that the solutions ip°, ip% in Eq. (A2) 
differ by constant factors from the X = 0 limit of 
the power-series solutions for ip 2 given in Sec. 
VI. 


APPENDIX B: WKB EXPRESSION FOR A*/(\) 


We derive in this Appendix the WKB asymptotic 
approximation for A E> (x) quoted in Sec. VA. Our 
starting point is the set of coupled differential 
equations for the components a, c of ip* 


da 
d 6 

dc 

d6 


~[£(sin0) 1 + Xsin0]a + (j + i)(cos0)~ 1 c =0, 


+ [^(sinfl)" 1 + Xsin0] c + (} +i)(cos0)" 1 a =0 . 


(Bl) 


These equations are evidently invariant under the 
interchange 


allowing us to obtain equations satisfied by c by a 
simple substitution once we have found the cor¬ 
responding equations satisfied by a. Eliminating 
c and defining a new variable x = cos0, we find the 
following second-order differential equation sat¬ 
isfied by a (a r =da/dx, etc.) 

a” + Pa' + Qa = 0 , 


1 - 2* 2 
%(!-**) ’ 



-2jx rJi±i21 

l-x 2 LxMl-x 2 ) 


+ (1-X 2 ) 2 


+ X 2 


5 Xd-2* 2 ) 

+ xd-x 2 ) 2 + xd-x 2 ) ■ 


(B3) 


Noting that P is unchanged by the substitution of 
Eq. (B2), we introduce new dependent variables 
b and d by writing 



FIG. 6. Results for versus - ?X. The dots denote computed points. Maximum and minimum points denoted by 
X were determined by a polynomial interpolation procedure from the neighboring computed points. 
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a =6 exp 


hr 


Pdu 


= bx~ ll2 ( l-x 2 )' 1 ' 4 , 

r f x 

c=dex pj -jJ Pdu 

= d*- 1/2 (l-*T 1/4 . 


These satisfy the differential equations 

b"+k t 2 b = 0, d"+k 2 d= 0, 

k b + k d ~ t 2 > 


l+2x 2 2£X 

4x 2 (l~x 2 ) 2 " 1-x 2 


U + lf + ? 2 + * 

x 2 (l-x 2 ) (1-x 2 ) 2 


(B4) 


(B5) 


= 4 \ti-2x 2 ) 

12 x(l -x 2 ) 2 xd-x 2 ) ■ 

It is also useful to have the first-order differen¬ 
tial equations coupling b and d, which from Eqs. 
(Bl) and (B4) are found to be 


2 x(l-xr) 1-x 2 


xd-x 2 ) 1 ' 2 

U+i) . 


(B6) 


,, J 1 2^-1 _j_ 1 (j+i) n 

L 2 xd-* 2 ) " 1-x 2 J " xd-x 2 ) 1 ' 2 ° • 

Finally, in terms of b and dthe Wronskian is 
given by 


w =a 2 c I ~a 1 c 2 


x(\—x 2 yi 2 


(B7) 


We now proceed to construct approximate, WKB 
solutions to the above equations when | X | is treated 
as a large parameter. We begin with the equation 
for b. We have 


fe h 2 = -X 2 +X 


-U , 1-2* 2 

1-x 2 xfl-x 2 ) 


+ 0 ( 1 ), 


(B8) 


and hence 


R s «1 


(B9) 


for all x except very near the end points at x = 0,l. 
Near the end points we find 


R ~ 


2X 2 X 


x»0 


(BIO) 


R~ 


2£+l 
4 (1-x) 2 |X 


x = 1 


and so except in the intervals 


r*r 


1—x — 


TxT 


(B11) 


we can use a WKB solution for b and d. Applying 
the standard lowest-order WKB recipe 12 to the 
second-order differential equations for b and d , 
and then imposing the linear equations in Eqs. (B6) 
which relate b and d t we find the WKB-region 
solutions 


& WKB = 4" e> “x- 1/2 d+x) (t - 1/2l/ ^ 1 -x)- <t + 1/2l/2 +Be- X ‘x 1/2 (i+x)- (t -‘ /2l/2 d-x/ t + 1/2y2 , 

A 

d WKB = e' x ‘x' 1/2 (1 +x)- (c+ 1/2l/2 (l-x)f E - 1/2,/2 + l^e^x 1 ' 2 (1 +x/ t+ " 2l/2 (l-x)- a ' 1/2l/2 . 

^X J + g 


(B12) 


In the end-point regions x~0,1 we must join Eq. (B12) on to more accurate approximate solutions. In 
the vicinity of the end points we find 


= -j<y l J + ill_( X + 4 )* + l -j^ + D + ff^ + CMx 2 ), H 0 = -X+ 2«, x = 0 


, 2 (^ ~ 2 ) (^ + j) £X + 2 X-4^ + 4(^-2 ) U + 2 )+j j( j + 1) 

(1-x) 2 " 1-x 


(B13) 


-f[;(;+i)-^]-[^+^U-f)] 2 -iU-f) 2 -i+^(i-^) + o((i-x) 2 ), 

tfi = --r;(; + i) + £-iU + !■)(€-*-)+ *=*■ 


Forx=j/|x|, the x -2 , x“\ x° terms near x = 0 
are of order |x| 2 , whereas the term U 0 x is of 
order j£ t down by a factor (j/lx|)(£/|x|) from 
the leading terms. Similarly, for l-x=£/|x|, the 
(l-x)~ 2 , (1-x)" 1 , (l-x)° terms near x = l are of 
order |x| 2 with the term H t ( 1-x) of order £ 2 , 


down by a factor of (£/| x| ) 2 from the leading 
terms. The terms Ofx 2 ) and 0((l-x) 2 ) can be 
shown to be as small as the linear terms which 
we have just evaluated. Hence we identify 


e, = «/|x|, e 2 =j/\\\ 


(B14) 
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as the effective smallness parameters in the WKB 
solution, and proceed to solve the differential 
equations at the endpoints neglecting the linear 
and higher terms in x and 1-x in Eqs. (B13). Both 
at x = 0 and x~l, the differential equations can 
then be reduced to Whittaker’s equation 

g + (B15) 

with the regular solution 


b=e'‘ /2 z U2 * l ‘m + ti-K,l+2^,z), (B16) 


where 4> is the confluent hypergeometric function 


$(a, c; z) -1 



z_ a(g + 1 ) z 2 
1 ! + c(c + 1 ) 2 ! 




= e'$(c-a, c; -z). (B17) 

Carrying out the solutions explicitly, we find to 
the required accuracy the following end-point 
solutions: 


8* 0 , x» 1 : 

6 J (z)=exp{-[x+i(^-l)]z}z <E+l/2,/2 * 




j 



-u+j) 

2 *' 2 3 4 U + i) 


exp{-[x+iU+f)].z}.z (E + 3/2l/2 *U + i £ +■§; 2 [x+*U +|)]z), 


6~{ir, x~0: 

b 2 (x) = e" (X+ E)I x 1 * *$(j +i 2(j + 1 ); 2 (X + «)*) , 
d 2 (x)=e' (X+E 1 , x> + 1 $(j+i 2 (j + l); 2 (X + «)x). 


z = 1 -x; 


(B18) 


Joining the WKB-region solution onto the asymp¬ 
totic form 13 of the x« 0 end-point solution, we de¬ 
termine the constants A, B in Eq. (B12) to be 


i - 

This is most easily done by a comparison with 
the explicit free solutions given in Appendix A. 
Writing 


2A _ r(2(j +1)) 
j+r r(j+l) (2X) 

„ r(2(j + p) /-i \ j * U2 

r (j+i) \2x ) 


(B19) 


This permits us to extend the solution ip 2 to the 
region near 0 , x~ 1 , which is the asymptotic 
region for the x* 1 end-point solution tpi* Substi¬ 
tuting the WKB extension of ip 2 and the asymptotic 
expansion of ij) l into Eq. (B7), we get for the 
Wronskian 


u>(\) = - r - ^ [i 2 (E -*' 2) ' 2 (2X)' 0 + " 21 

+ 2 ) 

x[e x +e“ x (-l) >+ £+ » 2 “ 2<e+ 1/2) 

x U+i)ru+|)x- (E+1/2 >] . (B 20 ) 



(B21) 


and letting 0 — 0 , 5 tt to determine K j, K 2 , respec¬ 
tively, we find from Eqs. (B4) and (B18) that 


= 2 (E -^ 2 )/ 2 ru+i)r(j+l) 
r(j + « + i) 

= 2 Jtl/ 2 r(j + pr (4 + i) 
ru+«+i) 


(B22) 


Combining with Eq. (A3) we then get 


w (0) = -K l K 2 


r(j+ 4 +i) 

r(i)r(4 + i)r(j+|) • 


(B23) 


To complete the calculation, we must determine 
the value w>( 0 ) corresponding to the normalization 
of the solutions i/j lf tp 2 used in the above analysis. 


Dividing Eq. (B20) by Eq. (B23) to get w(X)/w( 0), 
and then using Eq. (4.17), gives the final WKB 
formula quoted in Eq. (5.2) of the text. 


l S. L. Adler, Phys. Rev. D 6, 3445 (1972); 7, 3821(E) 
(1973). 

2 S. L. Adler, Phys. Rev. D JB, 2400 (1973). 

3 We can omit the matrix t 2 in Eq. (2.32) because the 
spinors which appear have already been reduced to 
four-component form. 

4 The re is, of course, a third regular singular point at 


m — For a discussion of the Riemann equation and 
its solution see G. Birkhoff and G. C. Rota, Ordinary 
Differential Equations (Blaisdell-Ginn, Waltham, 

Mass., 1969), p. 272 ff. 

5 These may be derived from the identities on pp. 274—276 
of Y. L. Luke, The Special Functions and Their Approx- 
imations (Academic, New York, 1969), Vol. 1. 
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6 See f for example, G. Birkhoff and G. C. Rota, Ordinary 
Differential Equations (Ref. 4), p. 47. 

T It is always possible to find solutions satisfying the 
standardization conditions because, as stressed in 
Sec. HD, the boundary conditions at 0 = 0, are X- 
independent. 

3 Let tr denote the Pauli matrix trace; then Tr denotes 
the complete trace TeA = / 0 T T,2 d0 (0|trA 1. 

9 A. S. B. Holland, Introduction to the Theory of Entire 
Functions (Academic, New York, 1973). See especially 
Sec. 1.4, Chap. 4, and Sec. 6.2. 

10 S. Coleman (unpublished) has conjectured this to be the 
case. Coleman argues that at the 45° sector boundaries 
in Fig. 4, Re (A 2 ) changes sign from positive to negative, 


corresponding to a transition from “magnetic-field - 
like” to “electric-field-like” behavior of the external- 
field problem, and suggesting very different analyticity 
properties on the two sides of the boundary. 

U A. S. Wightman (unpublished) has proved, in the Minkow¬ 
ski metric case, that the Fredholm determinant can 
have no zeros for arbitrary purely real external fields. 

l2 See, for example L. 1. Schiff, Quantum Mechanics 
(McGraw-Hill, New York, 1968), third edition, pp. 270- 
271. 

13 Higher Transcendental Functions (Bateman Manuscript 
Project), edited by A. Erdelyi (McGraw-Hill, New 
York, 1953), Vol. 1, p. 278. 


Erratum: Massless electrodynamics in the one-photon-mode approximation 

[Phys. Rev. D 10, 2399 (1974)] 

Stephen L. Adler 


In Sec. VIB of this paper, numerical evidence 
was given suggesting that the zeros of a Cj (a) obey 
the condition |lmx|/|Rex|>l, which would imply 
cut-plane analyticity for the radiative-corrected 
vacuum amplitude W t . Recently, Chernin and Wu 1 
have shown that this conjecture is false by deriving 
the following approximate large- £ expression for 
the zeros of A t i/2 (^)* 


2 [F(0 o )+wJTf]-iln^- - | lncos0 o =0 , 




F(9) = £ In tan£0 - X cos# , 



For £ =-^ this formula gives the following pre¬ 
dicted zeros: 


n=2: X = -11.63 +i 9.58 , 
w = 3: x =-12,52 + i 13.14 , (2) 

n =4: x = -13.23 + t 16.57 . 


The n =4 zero is the one given in Table n of Sec. 

VIB; a reexamination of the computer output 
which I used in preparing Table n indicates that 
the lower zeros were missed by careless reading 
of the output (the programming itself was correct), 
and are indeed given quite accurately by the 
Chemin-Wu formula. For example, the program 
used to get Table II gives X =-11.66 + 49.56 for the 
location of the n =2 zero for j=i, |=-^. For 
fixed w, the Chernin-Wu formula shows that 
-x/| 1 as | — », and so in fact there are zeros 

with arbitrarily small |Imx|/|Rex|, and hence 
no zero-free angular sectors for the Fredholm 
determinant, which is proportional to 

n[A C ,u)A £ ,(-*)r' +i . 

s. c 

The zeros “near” the real axis still lie outside the 
zero-free strip containing the real axis which was 
established in Sec. IV. 


l D. Chernin and T. T. Wu (unpublished). 
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(Received 26 October 1971) 

Contributions to the absorptive K L -~ 2ju amplitude coming from intermediate 3?r states are 
estimated on the basis of recent soft-pion results for the process 3?r— 2y. These contribu¬ 
tions turn out to be far too small, by 4 orders of magnitude, to resolve the K L -~2fA puzzle. 


All theoretical resolutions so far proposed for 
the K l — 2p. puzzle 1 are forced to call upon can¬ 
cellation effects which have to be regarded as 
accidental at the present level of understanding. 
Apart from this, the various schemes differ widely 
with respect to introduction of qualitatively new 
physics. 2 The most conservative approach is one 
which dismisses the possibility that CP violation 
or new kinds of particles or interactions play an 
important role in the puzzle. Instead, the burden 
is placed on 3jt intermediate states, which are 
supposed to provide terms which largely cancel 
the contribution from the 2y state in the unitarity 
equation for the absorptive K h - 2p amplitude. 

The strain on credulity here lies in the magnitude 
required of the 3tt contribution, a magnitude which 
has to be appreciably larger than first rough esti¬ 
mates would suggest. 3 In the present note we add 
our contribution to this strain, in the form of an 
estimate of 3tt contributions based on soft-pion 
considerations. 

In order to assess the 3 tt effects in a frame¬ 
work which ignores CP violation and accepts 
standard photon-lepton electrodynamics, one re¬ 
quires information on the amplitudes for 3 tt— 2 y 
and 3 tt— 2p, In our conventional framework, 
the latter is fully specified if the former is known 
for virtual as well as real photons. All the re¬ 
maining ingredients of a unitarity analysis based 
on 2y and 3 tt intermediate states are well enough 
known: the 2y— 2p amplitude from standard elec¬ 
trodynamic theory, the k l — 2y and K h — 3tt ampli¬ 
tudes (or rather, their moduli) from experiment. 
Throughout the unitarity discussion we ignore all 
other intermediate states. To lowest order in the 
fine-structure constant the 2vy and, strictly 
speaking, also the Ziry intermediate states ought 
to be considered. However, the former has been 
shown to be unimportant, 4 and the latter can rea¬ 
sonably be expected, on phase-space considera¬ 
tions alone, to be even more negligible. We shall 

5 


have a brief comment on this later on. 

At theoretical issue then are the amplitudes for 
3 tt°, 7 r + 7r“7r° — two real or virtual photons. These 
objects are of course interesting in their own 
right, even apart from their role in the K L — 2p 
puzzle. In particular, the application of soft-pion 
considerations has been discussed by Aviv, Hari 
Dass, and Sawyer 6 ; and the subject has since been 
taken up by other authors. 0-10 Interesting issues 
concerning current algebra, partial conservation 
of axial-vector current (PCAC), and Ward-identity 
anomalies arise here. Especially relevant for our 
present purposes is the idea, proposed by Aviv 
and Sawyer, 11 that the soft-pion approximation 
might provide a reasonable basis for estimating 
contributions from the Ztt states in the unitarity 
analysis of K h - 2p. decay. It must be said at once 
that, kinematically, the pions in K h - Ztt decay 
cannot all three be so very soft, unless one re¬ 
gards the K -meson mass to be “small” on a had¬ 
ronic scale. With appropriate reservations on 
this score, one may nevertheless hope that the 
soft-pion methods provide more reliable estimates 
than can be gained from purely dimensional and 
phase-space arguments. 

The Aviv-Sawyer analysis 11 of K L — 2p. decay 
was based on the Ztt— 2y amplitudes of Refs. 5 and 
6 . We believe that these amplitude results are in 
error and that the correct soft-pion expressions 
are as in Ref. 8. We have therefore repeated the 
analysis. Despite these corrections, we find with 
Aviv and Sawyer that the Ztt states play a negligible 
role in the absorptive amplitude for K h — 2 jll decay. 
The “naive” unitarity bound, based solely on the 
2y intermediate state, is corrected at most (de¬ 
pending on phases) by a factor of order 10“ 4 in the 
decay rate. A brief account follows. 

The k l — 2/jl amplitude has the structure 

Amp(K L -2n)=gu(p)r s v(p), (1) 

where p and p denote the \x~ and /x + momenta. The 

770 


Copyright© 1972 by the American Physical Society. Reprinted with permission. 
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decay rale is given by 

r^-2^ = 1^|if I 2 , (2) 

where 

y = (1 - 4m 2 /M 2 ) 1 ' 2 

is the muon velocity, with m the /x mass, and M 
the K mass. The object is to estimate the absorp¬ 
tive amplitude Im^, on the basis of unitarity con¬ 
siderations, in order to set a lower bound for 
K l - 2/x decay. To get at the unitarity contribution 
from the intermediate 2y state we have to consider 
K h — 2y decay, whose amplitude has the structure 

Amp(Kx - 2y) = Ge (J ^ 0 e[, 1) fe[, 1) € < p 2) fe < 0 2) , (3) 

where fe (i) and e c,) are the momentum and polariza¬ 
tion vectors of the ith photon. The decay rate is 

r ^- 2 y)= f£ |g|2 - (4) 

The contribution to Im^ coming from the 2y state 
is given by 

Im if I 2 r = ln (lT7;) ReG ’ (5) 

Now the modulus | G | is known from empirical 
information on the K L — 2y decay rate. If unitarity 
contributions coming from 3 tt states are systemat¬ 
ically ignored for both K h — 2y and K h -> 2p. decay, 
then Im^= Im^| 2r , ReG = | G |, and one finds the 
“naive” unitarity bound 

T(K L -2v.)/TUt L )Z 6X10- 9 . (6) 

Our task here is to compute the direct 3 tt contri¬ 
butions to Im£, and also their contributions to 
ImG. For these purposes we require the ampli¬ 
tudes for 3 tt°, 7r*7r"7r°-two real or virtual photons. 
We adopt, but do not reproduce here, the soft-pion 
expressions of Ref. 8. These expressions contain 
three parameters, of which two are well estab¬ 
lished experimentally; the constant which 
describes 7[°-2y decay; and/, the PCAC constant. 
The remaining parameter, x, measures the iso- 
tensor component of the “a term” in the current- 
algebra treatment of tt-tt scattering. One usually 
supposes, as we shall do here, that x=0, Unless 
x is unbelievably large, of order 10 3 -10 4 , this 
neglect will not qualitatively alter our conclusion 
that the 3?r states do not resolve the K L — 2/x puz¬ 
zle. Indeed, given the formulas of Ref. 8, and 
with a little thought about the structure of the uni¬ 
tarity equations and the size of phase space for 
three pions, one can readily arrive at this qualita¬ 
tive conclusion from rough dimensional arguments. 
Nevertheless, since we have in fact carried out 
the numerical work in detail, and because a cer¬ 


tain delicacy appears in the details, we shall 
comment here on a few technical points. For the 
unitarity calculations we require not only the 3 tt 
— 2y and 3 tt — 2/x amplitudes, but also the full com¬ 
plex amplitudes for K L - 3tt. The latter are known 
from experiment only in modulus. However, we 
can get upper bounds on the 3 tt contributions by 
replacing all amplitudes in the unitarity equations 
with their moduli. It is these upper bounds that 
we shall report. The computations for ImG are 
now completely straightforward. For the 3/ and 
it'it'v 0 contributions we find 

ImG| 3ff o^ 3xlO" 5 |G|, 

, ' , , (7) 

ImG2x 10 |G|. 

It is evident that the 3 tt effects here are totally 
negligible. 

Computation of the direct 3 tt contributions to 
Im£, the absorptive K L — 2p. amplitude, is some¬ 
what less straightforward. The formulas of Ref. 

8 are supposed to apply (in the soft-pion limit) for 
virtual as well as real photons, and they therefore 
provide a basis for computation of the 3 tt— 2p. am¬ 
plitude. On inspection of the formulas for 3 tt — two 
real or virtual photons one observes two kinds of 
terms: those which describe emission of a photon 
by an external pion (bremsstrahlung terms) and 
those which do not. Correlation of these descrip¬ 
tive expressions with explicit terms in the formu¬ 
las should be evident and is left to the reader. 

The 3/- 2y amplitude is purely of the nonbrems- 
strahlung type, whereas the ttV"/ amplitude has 
both kinds of terms. Computation of the brems¬ 
strahlung-term contributions to 3 tt — 2fi presents 
no difficulties, although it is tedious. The calcula¬ 
tion here has a structure of the kind associated with 
a one-loop box diagram and was carried out nu¬ 
merically. For practical purposes we found it 
convenient to use dispersion-relation methods, 
taking the invariant squared mass of the 3 tt system 
as the dispersion variable. One encounters no 
anomalous thresholds here, thanks to the mass¬ 
lessness of the physical photons in the intermediate 
state 3 tt— 2y — 2fx. For the nonbremsstrahlung 
terms, the calculation of the 3 tt— 2/x amplitude has 
a structure of the kind associated with a one-loop 
triangle diagram. But here one encounters a loga¬ 
rithmically divergent integral. This comes about 
because the corresponding amplitudes for 3 tt — 
two virtual photons do not have any damping as 
the virtual-photon masses become very large. 

The soft-pion approximation is unsatisfactory in 
this regard. However, since the divergence is 
only logarithmic, we do not think it misleading to 
employ a cutoff. We again employ dispersion-re¬ 
lation methods. The dispersion integral is loga- 
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rithmically divergent and we simply cut it off, at 
an invariant squared mass taken rather arbitrarily 
to be 1 GeV*. 

Once the 3 tt- 2/x amplitudes have been estimated, 
computation of the Zir contributions to the absorp¬ 
tive K l — 2/x amplitude is now a simple matter. 

We present the results in the form of comparison 
of the 3;r and 2y contributions to Im#, 

Impair" s 3xlO' 5 Im^| ar , 

( 8 ) 

3xlO- 5 Im^| ar 

In summary, the 3tt states, at least when treated 
in the soft-pion approximation, do nothing to re¬ 
solve the K l — 2/x puzzle, 12 

Finally, we comment briefly on the inter¬ 

mediate state, which is the remaining intermediate 
state which can contribute at this order in a . Al¬ 


though the decay K h — ttV-ttV has not been ob¬ 
served, to leading order in the photon momentum 
(the bremsstrahlung approximation) the amplitude 
for this process is related by gauge invariance to 
the amplitude for K L — ttVV. The current-alge¬ 
bra coupling 6 of a photon to ttV'tt 0 can then be used 
to compute — mV"* An estimate of the 

relevant integrations indicates a contribution to 
Im^ of essentially the same size as that coming 
from the 3tt intermediate state. So the 37iy con¬ 
tribution is also at least four orders of magni¬ 
tude too small to resolve the K h - 2/x puzzle. 
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National Accelerator Laboratory, where part of 
this work was done. 
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We give a simple phenomenological analysis of hadronic and electronic vacuum-polarization effects. We 
argue that the derivative of the hadronic vacuum polarization, evaluated in the spacelike region, 
provides a useful meeting ground for comparing e + e ~ —»hadron annihilation data (assumed to arise 
from one-photon annihilation) with the predictions of parton models and of asymptotically free field 
theories. Using dispersion relations to connect the annihilation and spacelike regions, we discuss the 
implications in the spacelike region of a constant e*e~ annihilation cross section. In particular! we 
show that a flat cross section between t = 25 and r =81 (GeV/c) 2 would provide strong evidence 
against a precociously asymptotic “color” triplet model for hadrons. We then turn to a consideration of 
the apparent discrepancy between observed and calculated muonic-atom x-ray transition energies. 

Specifically, we analyze the hypothesis of attributing this discrepancy to a deviation of the asymptotic 
electronic vacuum polarization from its expected value, a possibility which is compatible with all 
current high-precision tests of quantum electrodynamics. Under the additional technical assumption that 
the postulated discrepancy in the electronic vacuum-polarization spectral function increases 
monotonically with t, the hypothesis predicts a decrease in the expected value of the 
muon-magnetic-moment anomaly a M = j(g M — 2) of at least —0.96 X 10 -7 , which should be 
detectable in the next round of g ^ — 2 experiments and which is substantially larger than likely 
uncertainties in the hadronic contribution to a M . By contrast, postulating a weakly coupled scalar boson 
<f> to explain the muonic-atom discrepancy would imply a (very small) increase in the expected value of 
a M . Both the vacuum-polarization and scalar-boson hypotheses (for M $ ^ l MeV) predict a reduction 
of order 0.027 eV in the 2p 1/2 — 2s 1/2 transition energy in [ 4 He, ju.]*, an effect which may be 
observable. 


I. INTRODUCTION 

A number of recent experiments have brought 
aspects of vacuum-polarization phenomena to the 
fore. Most prominent are the measurements by 
the Cambridge Electron Accelerator (CEA) and 
the Stanford Linear Accelerator Center-Lawrence 
Berkeley Laboratory (SLAC-LBL) groups of an 
unexpectedly large cross section for —had¬ 
rons, 1 which gives the absorptive part of the had¬ 
ronic vacuum polarization. In another area of 
physics, measurements of muonic-atom x-ray 
transition energies, undertaken to probe the as¬ 
ymptotic form of the electronic vacuum polariza¬ 
tion, appear to show a persistent deviation from 
theoretical expectations. 2 Forthcoming high- 
precision measurements of the muon-magnetic- 
moment anomaly g ^ — 2 will provide an even more 
sensitive probe of the asymptotic electronic vacu¬ 
um polarization, and of the hadronic vacuum po¬ 
larization as well. We present in this paper sim¬ 
ple phenomenological arguments which bear on 
the interpretation of both the annihilation and the 
muonic experiments. Although fundamentally 
different physical issues are at stake in the two 
classes of experiments, common elements of 

10 


formalism make it natural to consider them to¬ 
gether. In Sec. II we use dispersion relations to 
determine what the timelike-region e*e~ annihila¬ 
tion data say about the possibility of precocious 
asymptotic scaling in the spacelike region of the 
hadronic vacuum polarization (assuming that the 
observed data do indeed result from one-photon 
annihilation). In Sec. m we analyze the muonic 
experiments, with the aim of distinguishing be¬ 
tween the possibilities that the muonic-atom 
x-ray discrepancies may arise from a discrepancy 
in the asymptotic electronic vacuum polarization, 
or from the existence of a weakly coupled light 
scalar boson. Some technical details are given 
in the appendixes. 

11. ELECTRON-POSITRON ANNIHILATION 
AND PRECOCIOUS SPACELIKE SCALING 

The experimental data for electron-positron 
annihilation into hadrons are conveniently ex¬ 
pressed in terms of the ratio R(t) t defined as 

^ o^hadro^ 
a(e e - MM ; t) 

with 

3714 
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446 


Adventures in Theoretical Physics 


10 SOME SIMPLE VA C UU M - P OL A RIZ A T ION PHENOMENOLOGY:... 3715 


<7(eV- mV; f) = (l + 


m 



m 


1/2 


4;ra : 
37“ 


47TQ! 2 87xl0' 33 cm 2 

~ 3* = t [in (GeV/c) 2 ] 


and with t the virtual-photon four-momentum 
squared. In Fig. 1 we have plotted (versus E = t ,/2 ) 
a smooth interpolation through all available exper¬ 
imental data for R in the continuum region (ex¬ 
cluding the p, co, and <p vector-meson contribu¬ 
tions). The CEA and SLAC-LBL data points are 
indicated, 1 while the portion of the curve below 
t = 2.5 is taken from the “eyeball” fit given by 
Silvestrini. 3 When replotted versus t> the data 
for R{t) rise approximately linearly, indicating 
a roughly constant hadronic annihilation cross 
section of 2lxl0~ 33 cm 2 . Assuming that single¬ 
photon annihilation is indeed being measured, this 
behavior strongly contradicts the asymptotic be¬ 
havior expected on the basis of parton or of as¬ 
ymptotically free-field-theory models of the 
hadrons, which predict 

R~C , o (3) 

with the constants C tabulated in Table I. How¬ 
ever, it can always be argued that while pre¬ 
cocious asymptotic behavior is expected from the 
SLAC scaling results in the spacelike region, the 
annihilation reaction involves the timelike region, 
in which asymptotic predictions may be approached 
much more slowly. This objection naturally raises 
the question of determining what the annihilation 
data tell us about behavior in the spacelike region. 

To answer this question we consider the renor¬ 
malized hadronic vacuum-polarization tensor 
(*«? 2 ) 

^v(q) = (q v q v -tg vv )n'~ H '>(t), 
which obeys the dispersion relation 

n ( ' H) (t) = tC du ( 1 A) Imn( " > («) 


(4) 

(5) 


and which is related to the electron-positron an¬ 
nihilation cross section into hadrons by 

v(e + e ~ — hadrons; u) 

= ^ImII Cff >(M) x (known constants). (6) 


Rather than using Eq. (5) directly, we consider 
its first derivative 



(1/tt) Imn fff) (w) 

{u-t? 



which on substituting Eq. (6) and using Eqs. (1) 
and (2) can be rewritten as 



FIG. 1. “Eyeball” fit to the continuum e*e~ annihilation 
data. The p, co, and 0 vector-meson contributions are 
not included. 

^-n H (t)- f du t ^~2 x (known constants). 
dt J 4mir 2 {u-t) 

( 8 ) 


Restricting ourselves to the spacelike region 
t = -Sy s >0 and rescaling to remove the constant 
factors, we obtain from Eq. (8) our basic relation 



du R ( u) 

2 (S + M ) 2 




t 


x (known constants). (9) 

— s 


The quantity T(-s) has two desirable properties 
which make it suitable for studying the implica¬ 
tions of the annihilation reaction for spacelike- 
region behavior: 

(i) The integrand in Eq. (9) is positive definite, 
and so omitting the high-energy tail of the integral 
makes an error of known sign. Specifically, if 
experimental data on R are available only up to a 
maximum momentum transfer squared t Ci and if 
we define T obs (-s) by 


obs 


(-s) = /‘ C duR(u) 


TT 


2 (S + u) 


( 10 ) 


TABLE I. Values of C In different models. 


Model 

C 

Simple quark triplet 

2 

T 

Color quark triplet 

2 

Color quark quartet 

10 

3 

Han-Nambu triplet 

4 

Han-Nambu quartet 

6 
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then we have 


T obs (-s)*T(-s), 0*s<~. (11) 

(ii) It is the quantity T(-s) for which parton 
models and asymptotically free field theories most 
directly make predictions 4 ; the asymptotic pre¬ 
dictions fori? are always obtained from the pre¬ 
diction for T(-s) by a dispersion-relation argu¬ 
ment, which is bypassed if we use T(-s) as the 
primary phenomenological object. In a model in 
which R asymptotically approaches C, we have 

T Ih (-s)~C/s, s-°°. (12) 


In asymptotically free field theories, the leading 
logarithmic correction to Eq. (12) is also de¬ 
termined. Specifically, in the SU(3)® Su(3)' 
“color” triplet model of the hadrons, one has 4 


Tto(-s) 


2/ 4 1 

s \ + 9 ln(s /s 0 ) + 


(13) 


with s 0 an arbitrary momentum scale which, in 
the numerical work, we will take as 2 (GeV/c) 2 . 

Before proceeding to numerical applications, 
let us briefly discuss the question of subtractions. 
Clearly, if the one-photon annihilation cross sec¬ 
tion were to remain constant as t **>, we would 
have /?(«) as m — °° and the integral in Eq. (9) 
would need an additional subtraction to be well 
defined . 5 However, such behavior of R would in 
itself contradict Eq. (3) for all values of C , and 
hence would rule out all versions of the parton 
model or of asymptotically free field theories. 

On the other hand, if Eq. (3) is true for any finite 
C, then the integral in Eq. (9) converges as it 
stands and provides a suitable medium for com¬ 
paring the annihilation data with theoretical ex¬ 
pectations in the spacelike region. Note that a 
constant subtraction term in Eq. (5), which would 
be present if we renormalize at a point other than 
£ = 0 , would not contribute to the t derivative in 
Eq. (7); hence the renormalization prescription 
is not a possible source of ambiguity. 

We turn now to the numerical results. In Fig, 2 
we plot T ob 8 (-s) [in units where unity = (1 GeV/c) 2 ], 
as obtained from all experimental data up to 
£ c = 25 (GeV/c ) 2 according to the formula 6 

7 1 ob S (-5) = T w ^(-s)+T p (-s) + T conl( 1 ) (-s) , 


9tt My r(V-eV) 


'V + <l>(-s)=— Y* v v 
a 2 , (s + 


(xJ. <f> 


(s +M V 2 ) 


2\2 > 


(14) 




r 25 dt 



-s in (GeV/c) 2 


FIG. 2. The function T obs {—s) as obtained from all 
experimental data up to t c = 25 (Gev/c) 2 , in units where 
unity = (1 GeV/c) 2 . 


The vector-meson parameters appearing in Eq. 
(14) are given in Appendix A, while R(t) is the 
continuum contribution to R graphed in Fig. 1. 

In Fig. 3 we plot a family of curves, obtained by 
assuming that for 25 « t c the annihilation cross 
section < 7 (e*e~ — hadrons; t) remains constant at 
2lxlo " 33 cm 2 . That is, we take 



-s in (GeV/c) z 

FIG. 3. Ratios of T obs (-s) to r lh (-s), with T lh (-s) 
the color-triplet prediction of Eq. (13). The t c -25 
curve uses the presently known data} the curves for 
higher tc assume a constant hadronic annihilation cross 
section of 21 x lO" 33 cm 2 above 25 (GeV/c) 2 . 
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T obs (-s) = T^(-s) + T p (-s) 


iCont(2) 


+ T cont(l)(_ s ) +T conl(2)(_ s ) ; 




25. 


(15) 


= 0.24lln| —4^ -s (<c ~ 25) 


s +25/ (s + £ c )(s + 25) J 


Rather than plotting T obs (-s) we have plotted the 
comparison ratio T obs (-s)/T th (-s), with T lh (-s) 
the “color" triplet prediction of Eq. (13). The 
/ c = 25 curve is just the curve of Fig. 2 divided by 
Eq. (13); since this curve lies below 1, the ex¬ 
isting annihilation data do not yet challenge the 
“color" triplet model in the spacelike region. 
(However, since the t c ~25 curve lies well above 
the existing data already definitively rule out 
a precociously asymptotic simple quark-triplet 
model.) Evidently, the curves in Fig. 3 rise 
rapidly with t c and show that if the annihilation 
cross section should remain constant at roughly 
2lxl0" 33 cm 2 in the region 25 ^ f ^ 81, which will 
be accessible at SPEAR II, a precociously asymp¬ 
totic “color" triplet model would be ruled out in 
the spacelike region. 

To explore the consequences of an annihilation 
cross section which remains flat up to large t Ct 
we ignore the vector-meson contributions to T obs 
and approximate by taking R(t) *0, 

t< 2; R(£)«0.24£, 2^£«25, giving the simple 
analytic expression 

T obs (-s) = f^ ^77p x 0-24 1 


= °- 24 H 7 ^l - 


(tn ” 2) 


(s + t c )(s +2)J 


*0.24 


In 


s +t c 

. 5 



Hence, 


Tobs (*~^ ) 

l/s 


R(t c )f(z), 


R(t G ) ~ 0.24^ c r 

f(z) = | ln(l + z) - ^ = t c /s . 

1 T <i 


(16) 


(17) 


A simple maximization shows that f(z) attains a 
maximum of 0.22 at z M ‘ x = s u /t c = 0.46, and falls 
to half maximum at z L ~ 1 =s L /t c = 0.053 and z v ~ l 
~Su/t c - 3.22. That is, T obs (-s)/s" 1 reaches a 
maximum value 

[T ob s(-s)A" 1 ] max =0.22x0.24^ c 

= 0.053 t c , (18a) 


and lies above half this value in the wide range 

0.053^s « 3.22/ c . (18b 

To give a concrete illustration, if o(e + e~ 

— hadrons; t) should remain constant up to the 
maximum t c of 900 obtainable in a 15 GeV/c on 
15 GeV/c storage ring, the maximum of T ob ,(-s)/ 
s' 1 would be 0.053x900 = 48. This would exclude 
by a factor of 2 parton or asymptotically free 
models with C « 24, thus covering just about every 
model which has been seriously proposed. 


III. MUONIC-ATOM X-RAY DISCREPANCY ANDtf a -2 

Recent studies of the transition energies between 
large circular orbits in muonic atoms have shown 
persistent discrepancies between theory and ex¬ 
periment. Because the muonic orbits in question 
lie well outside the nucleus and well inside the 
innermost K -shell electrons, one believes that 
nuclear size and electron screening corrections 
can be reliably estimated. In particular, the 
disputed nuclear-size corrections to the vacuum- 
polarization potential have been reevaluated re¬ 
cently by three independent groups, 7 in good agree¬ 
ment with one another. A survey of all known 
theoretical corrections has been given by Watson 
and Sundarasen 2 (see also Rafelski et al , a ), with 
the conclusion that all important effects within 
the standard electrodynamic theory have been 
correctly taken into account. On the experimental 
side, independent measurements by the groups 
of Dixit et aL 9 and of Walter et at. 10 agree on x-ray 
transition energies which deviate by 2 standard 
deviations from the theoretical predictions, as 
summarized in Table II. While it may still turn 
out that systematic experimental errors or errors 
or omissions in the theoretical calculations ac¬ 
count for the discrepancy, we will assume this 
not to be the case. Rather, we will treat the dis¬ 
crepancy as a real effect, to be explained by mod¬ 
ifications in the conventional theory. 

The unique aspect of the muonic-atom transition 
energies is that, because the muonic orbits lie 
well inside the electron Compton wavelength, they 
receive a large contribution from the electronic 
vacuum-polarization potential and (unlike the 
more accurate Lamb-shift experiments) they 
probe the asymptotic structure of this potential. 
Motivated by this observation, our principal focus 
will be to explore the possibility that the observed 
x-ray energy discrepancy arises from a nonper- 
turbative deviation of the electronic vacuum po¬ 
larization from its expected value. Such an effect 
is qualitatively expected (but with unknown quan¬ 
titative form) if recent speculations that the fine- 
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TABLE II. Muonic atom 

x-ray discrepancies. 


Element 

Z E 

Transition 

-i,_i 

—6E y ~ Average 

Ey (th) — Ey (expt) discrepancy -6Ey 

(eV) (eV) 

Reduced average discrepancy 

-6E 

iF ' 

0 y 4.35 eV xZ 2 [l/(n - l) 2 - l/n 2 ] 

20 Ca 

* d 3/2~* 2 Pin 
^5/2 P 3/2 

7±19 1 

11 ± 1 7 j 

9 ± 13 

(37.2 ± 54)x 10” 3 

22 Ti 

3 ^3/2 _ * 2 / > l/2 
^5/2 P 3/2 

-3±l9\ 

l0±18j 

3.5 ± 13 

(12.0± 45)x 10” 3 

26 Fe 

3 ^3 /2~* 2 Pl/2 
3 d5/2 _ * 2 i , 3/2 

21±201 

1G±17 i 

15.5± 13 

(37.9 ± 32) x 10” 3 

33 Sr 

4 /5/2“* >3< f3/2 

4 /t/2 _ * 3 ^5/2 

11 ±201 

0 ±18 J 

5.5 ±13 

(18.0±43)x 10~ 3 

47 Ag 

4 /5/2"* 3 d3/2 

4 /T/2 _ * 3 d5/2 

27 ± 201 

19±20J 

23 ±14 

(49.3 ± 30) x 10" 3 

43 Cd 

4 /5/2 _ * 3 d3/2 

Vt/2“** 3 ^5/2 

13 ± 19 \ 

7 ± 17 ) 

10± 13 

(20.5±27)x 10" 3 

50 s11 

4 /5/2"* 3 d3/2 

4 /7/2"* 3 d5/2 

21 ± 21 1 

2 5 ± 19 / 

23 ± 14 

(43.5±26)x 10" 3 

56 

%/2~*%n 

4 /7/2"* 3 d5/2 
^7/2 "* 4 / 5/2 

Si/2 fa 2 

55±23l 

76 ± 20) 

12±17l 

3 ± 16 / 

65.5± 15 

7.5 ±12 

(98.8± 23) x 10" 3 

(24.4± 39)x 10" 3 

30 

*$1/2“**/ 5/2 
Si/2~* f 7/2 

52 ±241 

38 ±25 / 

45 ± 17 

(71.8± 27)x 10" 3 

ii T1 

Vt/2 _ * 4 /5/2 

b Si/2~**fl/2 

31 ± 24 1 

40±24 J 

35.5 ±17 

(55.3 ± 26) X 10* 3 

32 Fb 

V7/2 4 /5/2 

5 ^9/2"* 4 /t/2 

52 ±21 1 

45 ± 18 / 

48.5 ± 14 

(73.7 ±2l)x 10“ 3 


structure constant a is electrodynamically de¬ 
termined prove to be correct. 11 We will also 
briefly consider an alternative explanation which 
has been advanced to explain the x-ray discrepan¬ 
cy, the possible existence of a weakly coupled 
light scalar boson. 12 

To calculate the effects of a possible discrepan¬ 
cy in the electronic vacuum polarization we start 
from the Uehling potential written in spectral 
form, 


V(r) = 




p.W. 




Wj 1 ' 2 


(19) 


This potential contributes to muonic-atom ener¬ 
gies through the diagram shown in Fig. 4(a). 

Since Eq. (20) is a small perturbation and since 
the muon orbits of interest are appreciably larger 
in radius than the muon Compton wavelength, in 
evaluating matrix elements of 6 V(r) we make the 
approximation of using nonrelativistic hydrogenic 
muon wave functions. [The same approximation 
applied to Eq. (19) yields the Uehling energy shifts 
for all of the levels in Table n to an accuracy of 
about 5%. 14 ] Thus we take 


n-lW 




-(Z/na 0 )r 






( 21 ) 


If we now assume that the spectral function p e [t] 
is changed by nonperturbative effects 13 to p e [t ] 

+ 6p[f], then V is replaced by V+&V , with 


6F(r) = 





giving for the change in transition energy pro¬ 
duced by 6 V(r) t 


6E 7 ^6E n ^6E n ^ 1 

= fr’drlR^^rf -fl n _ ln _ 2 M 2 ]6F(r). (22) 

' 0 


( 20 ) 
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Substituting Eq. (20) into Eq. (22), evaluating the 
r integral, and using a 2 /(37ra 0 ) =4.35 eV, we find 

_ 

* ' 4.35 evxz 2 [l/(n - l) 2 - l/« 2 ] 


-r 

J 4m 


2 jfyWp[t], 




(n -1) 5 


1/2 


1 + 


4*n „ 2 


n - 1 
Za 


-2(n-l) 


n 


1/2 


1 + 


n 


m 


Za 


-2 n 


/y[0]= 1 • 


(23) 



Finally, for convenience in doing the numerical 
work we make the change of variable 

* = 4m.V, (24) 

giving the formulas 

6 E y = f dw f y (w)&p(w ), 

J o 

f y (w) =/ r [4meV], (25) 

6 p(w) ~ 6p[4m e 2 e“']. 

Evidently, in the nonrelativistic approximation 
which we are using, the shifts in the transition 
energy 6 E y are j independent, and hence the two 
transitions for each w, l measure the same weight¬ 
ed integral of 6p(w>). Thus, for purposes of com¬ 
parison with Eq. (25) we average the two discrep¬ 
ancy values for each w, /, as shown in the fourth 
column of Table II. 15 The “reduced discrepan¬ 
cies” 6 E y introduced in Eq. (23) are tabulated in 
the final column of Table II. 

Before proceeding further with our discussion 
of the muonic x-ray discrepancy, let us turn to 
consider another electrodynamic measurement 
which is sensitive to the asymptotic electronic 
vacuum polarization, the muon g^ — 2 experiment. 
Here the conjectured deviation in the electronic 
vacuum-polarization spectral function contributes 
through the diagram of Fig. 4(b). Introducing the 
standard definition 

2 ) (26) 

and using well-known formulas 16 for the photon 
spectral-function contribution to a, we find that 
changing the electron vacuum-polarization spec¬ 
tral function induces a - 2 discrepancy 


FIG. 4. (a) Diagram by which a vacuum-polarization 
modification (denoted by the shaded blob) contributes to 
p- and e-atomic energy levels, (b) Diagram by which a 
vacuum-polarization modification contributes to g^ — 2 
and g e -2. (c) Diagram by which a scalar-meson con¬ 
tributes to p-atomic energy levels, (d) Diagram by which 
a scalar meson contributes to g^ —2. 

6a » = wf im t T* f ' [t]6plt] ’ 

0 

?■ (2 ’> 

Using a 2 /(3^ 2 ) = 1.80x10"® and making the change 
of variable of Eq. (24), we get the convenient 
formula 

= 1.80xl0"®{ f dwf a (w)&p(w ), 

J o 

/ fl W=/ fl [4m 0 V]. (28) 

The result of carrying out the integrations in the 
expression for f a (t) is given in Appendix B. 

Let us now return to our analysis of the muonic 
x-ray discrepancy. The kernels f y (w) for four 
representative transitions are plotted in Fig. 5. 

Our numerical evaluation shows that the six tran¬ 
sitions listed in Table m have weight functions 
f y which are nearly identical (their spread around 
curve b in Fig. 5 is less than one third of the 
spacing between curve b and curve a)\ averaging 
the weight functions for these transitions gives 
the function f y plotted in Fig. 6. Substituting the 
average of the reduced discrepancies for these 
six transitions into Eq. (25), we find 

(54.5±10)xl0" 3 = average of six (-6E y ) 

00 

dwf y (w)6p(w) , 



(29) 
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indicating that the sign of the discrepancy corre¬ 
sponds to a reduction in the electronic vacuum- 
polarization spectral function from its usual value 
of Eq. (19). Referring back to Fig. 6, we note 
that the function f a is always greater than/ r - 
Hence if we assume that 6 p(w) is always of nega¬ 
tive sign in the region where f a and / are non¬ 
zero [as might reasonably be expected if we are 
just entering a new region of physics where the 
discrepancy 6 p(w) is turning on], we learn that 

f dwf a (w)6p(w) « -(54.5 ± 10)x 10" 3 . (30) 

J o 

Comparing Eq. (30) with Eq. (28) we then get an 
inequality for the g - 2 discrepancy, 

/'6a^-l,80xi0~ 6 i(54.5±10)xl0~ 3 
5p«0=> = -(0.49 ± 0.09) X10“ 7 , 

\§a u /a u ^ -42±8 ppm. (31) 

A stronger prediction follows if, in addition to 
our assumption on the sign of 6p, we assume that 
the magnitude of 6p increases monotonically with 
t (again as might reasonably be expected for an 
effect just turning on). Then defining 



FIG. 5. Kernels f y for some representative transitions. 


TABLE m. Transitions with nearly identical f y . 


Element 

Transition 

Reduced average 

zE 

”l -l 

discrepancy ~6E y 

4T A £ 

4 /- 3 <i 

(49.3 ± 30) x 10“ 3 

4s Cd 

4 /- 3 <i 

(20,5 ±27) x 10” 3 

50^ n 

4 /- 3 <i 

(43.5±26)xl(T 3 

80^g 

h g- l f 

(71.8 ±27) x 10“ 3 

31T1 

5 ^- 4 / 

(55.3 ± 26) x 10“ 3 

32 

V- 4 / 

(73.7 ±21) x 10“ 3 

Weighted 

average of six 


reduced discrepancies a : 

(54.5 ± 10) x 10” 3 


a We have treated the errors as if they were purely 
statistical and have quoted the rms error for the average. 


f y (w)= 0, w< 0, (32) 

we find that we can represent f a (w) as a super¬ 
position of displaced curves f yt 

f a (w) = 1.016f y (w) 

rio .2 

+ I dw'c(w')f y (w -w ; ) , (33) 

J o 

with the positive weight function c plotted in Fig. 
6 . Multiplying by 6p(w) and integrating we get 



dw f a (w)6p(w) 



dw f y (w)6p(w) 


+ 



dw' c(w') 


x J dwf y (w-w')&p(w ). 

** o 

(34) 

But using Eq. (32) and the assumed monotonicity 



w 

FIG. 6. Plots of the kernels f y and f a [see the dis¬ 
cussion which follows Eqs. (28) and (29) of the text]. 
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of 6 p, we get 



w')6p(w)- I dwfJw)6p(w +w') 
J o 


* f dwf y (w)&p(w) , (35) 
J o 


and so we learn 



dwf a (w)6p(w )« 


/ 10.2 

dw* ctyu') 


x 



dw f y (w)6p(w) 


= 2.00x J dw f y (w)6p(w ). 

(36) 

Thus adding the assumption of monotonicity dou¬ 
bles the prediction of Eq. (30), giving 

6 p « 0 

I spl ♦ 

Equation (37) is the principal result of our analy¬ 
sis. 

Two remarks about Eq. (37) are in order. First, 
the discrepancy in a u predicted in Eq. (37) is 
compatible, within errors, with the present agree¬ 
ment of experiment with the conventional electro¬ 
dynamic prediction for a^, 17 


-(0.98±0.18)xi0 

V 

^ l&a^/a^ -84 ± 16 ppm . 


-7 


(37) 


gion. To get a crude estimate, let us make the 
(hopefully extreme) assumption thatR(f) rises 
linearly up to t- (460) 2 , where the one-photon an¬ 
nihilation cross section violates the J- 1 unitarity 
limit , 19 and cut off the integral at this point. This 
procedure suggests a bound on the high-energy 
hadronic contribution to a u of 15xl0' 9 . (Again 
see Appendix B.) 

(iii) Unified gauge theories of the weak and 
electromagnetic interactions which do not have 
charged heavy leptons typically give contributions 20 
to a in the range from a few to ten parts in 10 ‘ 9 . 
Specifically, the Weinberg-Salam SU(2)® U(l) 
model predicts a contribution to a u of less than 
9xl0' 9 . Thus, from (i), (ii), and (iii) we con¬ 
clude that the sum of unknown contributions to 
is likely to be no bigger than * 35 x 10 ' 9 , and 
hence should not mask the effect predicted in Eq, 
(37). 

Although we have shown that the inequality of 
Eq. (37) does not contradict the current — 2 ex¬ 
periment, we must still verify that it is possible 
to find specific functional forms 6 p(w) which fit 
the muonic x-ray discrepancy without seriously 
violating any of the conventional tests of QED, 
including the very high precision g e - 2 and Lamb- 
shift experiments . 21 A postulated vacuum-polari¬ 
zation discrepancy contributes to g e - 2 through 
the diagram of Fig. 4(b), giving a formula identical 
to Eq. (27) apart from the replacement of m u in 
f Q [t] by m e . The smallness of m e then permits use 
of the large-/ asymptotic expression \f Q ~m e 2 /($t ), 
giving the simple expression 


a u (expt) - a u (conventional Q ED) = (2.5 ± 3.1) x 10 " 7 

(38) 

However, it should be readily observable in the 
next g u - 2 experiment, where it is anticipated 17 
that the current experimental error of ± 3 ,lx 10" 7 
(= ±270 ppm in 6 aja^) will be reduced by a factor 
of 20. Second, the predicted effect is substantially 
larger than the likely remaining uncertain con¬ 
tributions to a Specifically, these are the fol¬ 
lowing. 

(i) The 8 th-order electrodynamic contribution 
to a u , which has been variously estimated 16 as 
6 x l 0 " 9 - 7 x 10 " 9 , with an uncertainty of perhaps a 
few parts in 10 “ 9 . 

(ii) The uncertainty in the hadronic contribution 
to a u . Including the p, co, and <£ resonances and 
integrating the e*e~ annihilation continuum up to 
t c - 25 gives a known hadronic contribution of 

71 x 10' 9 with an estimated uncertainty of ± 7 x 10" 9 
(see Appendix B). The unknown contribution of the 
e*e~ annihilation continuum beyond / c =25 will of 
course depend on the behavior of R(/) in that re¬ 


6 a e ~ 0 . 60 xi 0 - e f 

*'4 m e 2 * * 

= 0.15x 10 ” 6 f dwe“"6pfr). (39) 

•'o 

Comparing Eq. (39) with the current difference 
between experiment and theory for a e , 22 

^(experiment) - ^(conventional QED) 

= (5.6± 4.4)x 10 " 9 , (40) 


we get the restriction 



dw e~ w 6p(w) = (37±29)x 10“ 3 . 


(41) 


Next we consider the Lamb shift, which receives 
contributions from a vacuum-polarization dis¬ 
crepancy via the diagram of Fig. 4(a). Working 
again in the nonrelativistic hydrogenic approxima¬ 
tion, we find for the change in the 2s-2/> Lamb- 
transition energy 
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TABLE V. Sample functional forms giving statistically satisfactory fits. 


(1) Fits minimizing x 

2 

2 


Functional form 6p(w ) 

V 2 

X 2 

10 t <5« m 

(MHz) a 

-0.0530(40- 3) 

12.1 

-1.9 

0.08 

-0.0n(~^- J’ 2 Q(w- 3) 

12.1 

-2.1 

0,08 

-0.1Yo(40-2) 

12.9 

-2.4 

0.08 


w 


Functional form 6p(u>) 


(2) Fits minimizing x 2 i 


X 2 ! 


6£(ff) (MHz) a 


—0.0660(40 - 3) 

— 2 \°> 2 

•0,088f —- e(u>~ 3) 

w / 

- 0.21 Jo(w-2) 


7.9 

-2.3 

0.10 

7.9 

-2.6 

0.10 

8.1 

-3.1 

0.11 


(3) Reduced discrepancies predicted by the fit -0,071 [(4o-3)/4o]° ,2 0(4o-3) 


Z 20 

Transition n-+n- 1 3-*2 

22 

3 — 2 

26 

3—2 

38 

4-*3 

47 

4-* 3 

48 

4 — 3 

50 

4-* 3 

56 

4 — 3 

56 

5 — 4 

80 

5—4 

81 

5 — 4 

82 

5 — 4 

-10 3 x6£ r (expt) 37.2 

12.0 

37.9 

18.0 

49.3 

20.5 

43.5 

98.8 

24.4 

71.8 

55.3 

73.7 

±54 

±45 

±32 

±43 

±30 

±27 

±26 

±23 

±39 

±27 

±26 

±21 

—10 3 xdE y (fit) 40.6 

46.2 

57.3 

30.5 

42.5 

43.8 

46.5 

54.2 

21.6 

40.0 

40.8 

41.6 


a See the comment in Ref. 25. 


shift data, while the slow growth above the turn¬ 
on is needed in order not to violate the current 
limits on deviations in g u - 2. 

(iii) All of the good fits satisfy 5p% -0.03 for 
large w. This is a general feature for any mono¬ 
tonic form 6p which is small in the Lamb-shift 
region w ^ 2, since (using the fact that J y -0 for 
we have 

-54.5xl0" 3 = f dwj y (w)bp(w) 

“'o 1 

« j dwj y (w)&p(w) 

*■'2 


is in the Lamb shift in muonic helium. 26 Applying 
Eq. (42) to this system (and noting that the 2 p 
level here lies above the 2s level), we find 


6£([ 4 He,p] + )s6£ 2p _* 2S 


6ir 


r dt _mo_ 

+t 1 '*a a /Z? 


= h a ^f;f 0 dw fHMmw), 


[1 +(rn e /m ll a)e w/2 f’ K °- 13 • 

(51) 


^ 6p(9) f dwf y (w) = 1.6x 6p(9), 

J 2 

(49) 

that is, 

-0.034 s 6p(9). (50) 

Possible implications of Eq. (50) for QED tests 
involving timelike photon vertices will be dis¬ 
cussed elsewhere. 25 

One additional place where a vacuum-polariza¬ 
tion discrepancy should produce interesting effects 


TABLE VI. Results of step-function fits. 


Functional form 6p 

X 2 2 

10 7 6a y 

6£(ff) (MHz) a 

—O.OO40(«j — 0.5) 

38 

-0.23 

0.08 

—0.0140(40 —1.5) 

21 

-0.69 

0.10 

—0.0320(40 — 2.5) 

14 

-1.3 

0.09 

-0.0720(40-3.5) 

12 

-2.3 

0.07 

—0.160(40 — 4.5) 

13 

-4.0 

0.06 

—0.370(40 — 5.5) 

22 

-6.7 

0.05 

-O.390(io-6.5) 

43 

-5.0 

0.02 


a See the comment in Ref. 25. 
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Numerical evaluation of Eq. (51) shows that f He (w)/ 
0.13 lies within 20% of f y (w) in the range 0 * 6 

where neither is vanishingly small. Hence inde¬ 
pendent of the detailed form of 6p, we find the 
prediction 

6£([ 4 He, nf)~ |-a^ x (-54.5 x 10' 3 )x0.13 

07T Ytl ^ 

* -0.027 eV, (52) 

which may be an observable effect. 

At this point let us conclude our examination of 
vacuum-polarization effects and turn to an alterna¬ 
tive explanation for the muonic x-ray discrepancy, 
the possible existence 12 of a weakly coupled sca¬ 
lar, isoscalar boson <p. Interest in this explana¬ 
tion has been stimulated by the fact that such 
particles (with undetermined mass) are called 
for in unified gauge theories of the weak and elec¬ 
tromagnetic interactions. Letting g and 
denote the <f>-muon and the (^-nucleon scalar 
couplings, and the <£ mass, the potential pro¬ 
duced by <j> exchange between a muon and a nu¬ 
cleus of nucleon number A [Fig. 4(c)] is the simple 
Yukawa form 27 



\jg^)NN » 6 N$ 

4ir r 


(53) 


Since a repulsive potential is required to remove 
the x-ray discrepancy, fitting Eq. (53) to the x-ray 
data will necessarily give g^g^j} < 0. As shown 
in Appendix C, this sign for the product of cou¬ 
plings is not possible in the simplest forms of 
gauge models, in which there is only one physical 
scalar meson and in which the chiral SU(3)® SU(3) 
symmetry-breaking term in the strong-interaction 
Lagrangian transforms as pure (3,3)© (3,3). 
Nonetheless, let us proceed in a purely phenome¬ 
nological fashion and make a quantitative fit of 
Eq. (53) to the x-ray data. Replacing 6F(r) in 
Eq. (22) by V 0 (r), we find 

6F 

6E r*W2Z) = 2 - 82xl °V0,p£W r [^/] > 


(54) 

with 6 Ey the “reduced discrepancy 1 ' appropriate 
to a potential which couples to A rather than to 
Z . The experimentally measured values of 6 E y 
are tabulated in Table VTI. Since in all gauge 
models the </>-electron coupling is expected to 
be of order (m 9 /m ^g^, the <£ will have a neg¬ 
ligible effect on the electron g e - 2 and Lamb- 
shift measurements. So in fitting Eq. (54) to the 
data we minimize x 2 i defined in Eq. (48), giving 
the results shown in Table VUI, in good agree¬ 
ment with the results quoted by Sundaresan and 
Watson. 28 


TABLE VII. Reduced discrepancies for scalar-meson 
calculation. 


Element 

zE 

Transition 

-i 

Reduced average 
discrepancy -6E y 

20 Ca 

3 <*-V 

(37.2± 54) x 10“ 3 

22 Ti 

3 rf-^ 

(ll .0±41) x 10" 3 

26 Fe 


(35.1 ± 30) xl0“ 3 

3&Sr 

4 /- 3 rf 

(15.5 ± 37) x 10“ 3 

iAS 

4 /- 3 d 

(42.9 ±26) x 10“ 3 

4s Cd 

4 /- 3 d 

(17.5± 23) x 10“ 3 

50 Sn 

4 /- 3 d 

(36.6 ±22) xlO” 3 

56^ 

4 /- 3 rf 

(81.0± 19) x 10" 3 


5 s- 4 / 

(20.0 ±32) x 10“ 3 

solfe 

5 s- 4 / 

(57.0 ± 21) x 10" 3 

»iTl 

5 ^-* 4 / 

(43.9 ± 21) x 10" 3 

S2 Pb 

V- 4 / 

(58.5 ±17) x 10“ 3 


Since a light scalar boson, as well as a vacuum- 
polarization anomaly, can satisfactorily fit the 
x-ray discrepancy, let us examine ways of distin¬ 
guishing between the two possible explanations. 
First we consider the muonic-helium Lamb shift. 
Since fnt(w)*f y (w) for 0^w>^6, a scalar boson 
in the mass range from 1 to 22 MeV predicts an 
effect within about 20% of -0.027 eV, while for 
scalar bosons lighter than 1 MeV (corresponding 
to w <0), the muonic-helium Lamb shift decreases 
as 


< 56 » 

where M $ is in MeV. Hence the muonic-helium 
experiment could only distinguish between a very 
light scalar boson 29 and the joint possibilities of 
a heavier scalar boson or a vacuum-polarization 
effect. On the other hand, the muonic vertex 
correction involving scalar-meson exchange 
makes a small positive 20 contribution to a UJ as 
distinct from the sizable negative contribution 
predicted by a vacuum-polarization anomaly. So 
the next generation of g p - 2 experiments should 
unambiguously distinguish between the vacuum- 


TABLE VIII. Results of scalar-meson fits. 


M ^ (MeV) 

v 2 

A i 


0.5 

8.1 

—1.3 x 10“ T 

1 

7.9 

-1.4 x 10“ T 

4 

6.8 

-2.0 xlO" 7 

8 

6.1 

—3.8 x 10“ T 

12 

6.5 

-6.9xl0" 7 

16 

7.5 

-1.2x10"® 

22 

10.1 

—2.5x10”® 
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polarization and scalar-meson explanations for the tion of the total for large -s) to an accuracy of 
muonic-atom x-ray discrepancy. about 15%. 
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APPENDIX A: VECTOR-MESON PARAMETERS 

For the co and <f> vector-meson parameters we 
take 30 


= 784 MeV, r(co - e + e') =0.76 keV, 
= 1019 MeV, r(<£-eV) = 1.36 keV. 


(Al) 


For F w (t) we use the Gounaris-Sakurai formula 31 
with an co —2 tt interference term, 30 

p m _ M P 2 (i + &r p /Mp) 

1 " M 2 - t + H(t) - iMpT 9 (k/k rfMj'l~t 


+ Ae 


i a 


M 


OL 


H(t ) = 


_ r pM 2 


k 3 
R p 


r w ’ 

ty 2 \h{t)-h(M p 2 )] 

+ k 2 k'(M 2 )(M 2 -t)}, 


V TT yfl \ 2m ff 


h'(M 2 )* 


1 


m n 


In 


f Mp +2k p 


p 2rn 2 m p 3 k p *“\ 2m 

k = (kt-m v 2 f 2 , k p = (iMp 2 -m i, 2 ) 1 ' 2 , 

6 B ll2 (u - ee) T v 
A ccM^ 2 ’ 2 B 

<*,-(.-W, 


(A2) 


with the following values for the parameters 30 : 
5=0.6, a =86°, 


M p = 775 MeV, 

r p = 149 MeV , 
m* =140 MeV, 


T w = 9.2 MeV, 

(A3) 

B 1/2 (co -ee) = 0.906 x 10’ 2 , 
B 1/2 (co-2tt)=0.19. 


As discussed in Appendix B, approximating the 
small-* region in this fashion as a sum of co, <p, 
and p contributions should yield the small-* con¬ 
tribution to T obs (-s) (which is only a small frac- 


APPENDIX B: FORMULAS FOR/ a [*J AND THE 
HADRONIC CONTRIBUTION TO -2 

The function /„[*] appearing in Eq. (27) has been 
evaluated by Brodsky and de Rafael, 32 who find 

0«f «4m fJ 2 , r = */4 m / , 

K (*) = \ - 4r - 4r(l - 2r) ln(4r) 

/ T \ 1/2 (l - t\ 1/2 

- 2(1 - 8r + 8T 2 ) — j arctan^——j ; 

. > 4 2 l-(l-4mp 2 A ) 1/2 

*" 4m " ’ *" l+(l-4 m 2 /t) lli! ’ 


1^2 


K{t)^x 2 (Z-x 2 ) + (l + xf(\ + x 2 )^^^^ 

X 

^ + X x 2 lnx. (Bl) 


1-x 


Corresponding to the division of T obs (-s) into four 
pieces in Eq. (15), we write the hadronic contribu¬ 
tion to a u as 

i r 

a v “ I dto(e*e~~ hadrons; t)K(t) 


= a“ +rb +a p l +a™ l{1) +a™' {2) . (B2) 

Working in the same narrow-resonance approxi¬ 
mation as in the text, we find for ajf" 0 the ex¬ 
pression 8 


«r*- Z \k(m y 2 ) 


r(v-gV) 

My 


(B3) 


while a p p is given by the integral 





3/2 


(B4) 


Substituting the parameters from Appendix A and 
evaluating Eqs. (B3) and (B4) numerically gives 


a p * 0 = 9.1 x 10 ” 9 , a J = 45xlO“ 9 , 


a 


tot small I 


r54xl0’ 9 . 


(B5) 


A more elaborate evaluation of the small-* con¬ 
tribution has been given by Bramon, Etim, and 
Greco, 33 who sum the contributions of the various 
important hadronic states directly from Eq. (B2), 
giving 


a g* 1 smaU 1 «(61 ± 7) x 10 " 9 , (B6) 

indicating that our method of treating the small-* 
region is good to about 15%. To evaluate a“ nt(1) 
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and a£ ont(2) we approximate K(t) by its asymptotic 
form 

(B7) 

giving [in units where unity = (1 GeV/c) 2 ] 


a”" t(1) =6.7xl0- 9 F* 

J 0.39 * 


a”" t(2) =6.7xio- 9 

J 25 * 


(B8) 


Evaluating a° ont(1) numerically using the data 
plotted in Fig. 1 gives 

a“ nt(l >=9.6±2, (B9) 


with the error a rough guess. Thus the total 
known hadronic contribution to a u is 

(61±7)xi0" 9 +(9.6±2)xl0" 9 = (71±7)xi0" 9 . 


(BIO) 


Estimating the unmeasured contribution by as¬ 
suming a linearly rising R(t) up to t c = (2.230) 2 , 
we get 

a co n i(2) =6i 7xl0- 9 x0.24xin(^^) 


= 15x 10 -9 , (Bll) 

as stated in the text. 

APPENDIX C: SIGN OF THE SCALAR-MESON EXCHANGE 
POTENTIAL IN SIMPLE GAUGE THEORIES 

Consider a gauge theory of the weak and electro¬ 
magnetic interactions in which only one scalar 
field <f> develops a vacuum expectation, <p-~ <p + \, 
as a result of spontaneous symmetry breaking. 
Since X is the source of the lepton masses, the 
interaction Hamiltonian (= - the interaction La- 


grangian) coupling <p to the muons is 34 


<p 


(Cl) 


Since in the hadronic sector X is the origin of 
chiral SU(3)® SU(3) symmetry breaking, the inter¬ 
action Hamiltonian coupling <p to the hadrons is 34 


^0 hadron “ ^ ® ^chiral breaking 


(C2) 


Hence the sign of is the same as the sign 

of ( N\ 63C chiralbrMktag |fV>. Now if 65C chiral ^ reafcini . trans- 
forms under SU(3)® SU(3) as (3,3)© (3,3), then 
using the notation of Gell-Mann, Oakes, and 
Renner 35 we readily find that 


(N | 63C 


chiral breaking 


\ n)= ^£^jl 


(/2+c)/2 


+ 


c - -L)(jv|m 8 |jv>, 


= _/2 


™ K 1 = -1.25, 

m/ +zm n 2 


(C3) 


(N|M 8 |iV) = baryon mass splitting parameter 
= 170 MeV, 

s s(^2 + c) (N I V"2 u 0 + u s |iV>. 


That is, we have 

(N\ 63 C chiIalbrMkin2 |iV> = 12.9a^-333 MeV. 

(C4) 


Recent determinations of the o term o nNN suggest 
a value in the range 45-85 MeV, 36 making 
{N\ 6K chira j brMltta2 |JV> positive and giving an attrac- 
tive scalar-meson exchange force. A value of 
o VNN smaller than 25 MeV would be needed to 
make the scalar-meson exchange force repulsive, 
as is required to explain the muonic x-ray 
discrepancy. 


*For a summary of the recent data, see R. Gatto and 
G. Preparata, Phys. Lett. 50B , 479 (1974). 

2 A good review of the relevant experiments and theory 
is given by P. J. S. Watson and M. K. Sundaresan, 

Can. J. Phys. 52 , 2 037 (1974). 

3 L. Silvestrini, in Proceedings of the XVI International 
Conference on High Energy Physics , Chicago-Batavia, 
III., 1972, edited by J. D. Jackson and A. Roberts 
(NAL, Batavia, Ill., 1973), Vol. 4, p. 1. 

4 T. Appelquist and H. Georgi, Phys. Rev. D 8^ 4000 
(1973); A. Zee, ibid. 8, 4038 (1973). 
s See, for example, C. H. Llewellyn Smith, CERN Report 
No. CERN TH. 1849, 1974 (unpublished); N. Cabibbo 
and G. Karl, CERN Report No. CERN TH. 1858, 1974 
(unpublished). 


8 We follow the treatment of the vector-meson contribu¬ 
tions to cr(e*e“-~ hadrons;f) given by M. Gourdin and 
E. de Rafael, Nucl. Phys. B10 , 667 (1969). 

7 J. Arafune, Phys. Rev. Lett. 32, 560 (1974); L. S. 

Brown, R. N. Cahn, and L. D. McLerran, ibid. 32_, 

562 (1974); M. Gyulassy, ibid. 32, 1393 (1974). 

S J. Rafelski, B. Muller, G. Soff, and W. Greiner, Univ. 

of Pennsylvania report (unpublished). 

3 M. S. Dixit etal. t Phys. Rev. Lett. 27, 878 (1971). 

10 H. Walter etal. t Phys. Lett. 40B , 197 (1972). 

**S. L. Adler, Phys. Rev. D 5, 3021 (1972); K. Johnson 
and M. Baker, ibid. 8^ 1110 (1973). 

12 M. K. Sundaresan and P. J. S. Watson, Phys. Rev. Lett. 
29, 15 (1972); L. Resnick, M. K. Sundaresan, and 
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P. J. S. Watson, Phys. Rev. D8, 172 (1973). 

13 To make the definition of 6p precise, we are assuming 
6p to be an extra contribution to the vacuum-polariza¬ 
tion two-point spectral function above and beyond the 
usual second- and fourth-order electron and muon 
contributions, of a magnitude much larger than the 
naive perturbative estimate of the sixth- and higher- 
order terms. Because Za is not a very small param¬ 
eter for some of the atomic Species being considered, 
in establishing the existence of the muonic anomaly 
it is important to take into account 4,6,... -point 
vacuum-polarization amplitudes in which one vertex 
emits a photon coupling to the muon and all the re¬ 
maining vertices couple to the nuclear Coulomb po¬ 
tential. Numerically, the contribution of such diagrams 
to the x-ray energies is only a few percent of the 
Uehling potential contribution, so we neglect possible 
nonperturbative modifications of the higher-point func¬ 
tions. 

l4 Just to set the energy scale involved, the Uehling en¬ 
ergies are typically 0.2-3 kev out of total transition 
energies of 150—500 keV. 

ls We assume independence of errors and add errros in 
quadrature for both the muonic discrepancies and the 
Lamb-shift measurements. If systematic errors are 
large, then the errors for average quantities will be 
larger than those quoted, making the constraints on 
the x z fits less restrictive than those we have assumed. 

l6 B. Lautrup, A. Petermann, and E. de Rafael, Phys. 

Rep. 3C, 193 (1972). 

l7 V. W. Hughes, in Atomic Physics 3 , edited by S. J. 
Smith and G. K. Walters (Plenum, New York, 1973), 
p. 1; B. Lautrup (unpublished). 

1S B. Lautrup, Phys. Lett. 38B , 408 (1972); M. A. Samuel, 
Phys. Rev. D 9, 2913 (1974). 

19 C. H. Llewellyn Smith, Ref. 5; N. Cabibbo and R. Gatto, 
Phys. Rev. 124, 1577 (1961). 

20 R. Jackiw and S. Weinberg, Phys. Rev. D£, 2396 
(1972); I. Bars and M. Yoshimura, ibid. 6_ ( 374 (1972); 

J. R. Primack and H. R. Quinn, ibid. 6^, 3171 (1972). 

21 The atomic fermium K a x-ray measurements of P. F. 
Dittner et al. [Phys. Rev. Lett. 2£, 103 (1971)1 only 
probe the Uehling potential to an accuracy of about 20%. 
See B. Fricke et al., Phys. Rev. Lett. 28, 714 (1972). 
Constraints imposed by timelike photon vertex mea¬ 
surements are discussed in S. L. Acfler, R. F. Dashen, 
and S. B. Treiman, following paper, Phys. Rev. D 10, 
3728 (1974). 

22 N. M. Kroll, in Atomic Physics 3, edited by S. J. 

Smith and G. K. Walters (Plenum, New York, 1973), 
p. 33. 

23 The data are taken from Ref. 16, except for Li ++ , 
where we use the more accurate measurements of 
P. Leventhal and P. G. Havey, Phys. Rev. Lett. 32_, 

808 (1974). 

24 The conclusion that satisfactory fits can be found has 
also been reached by F. Heile (unpublished), using a 
momentum-space parametrization of the vacuum- 
polarization discrepancy. 

25 S. L. Adler, R. F. Dashen, and S. B. Treiman (in 
preparation). We show in this paper that a decrease in 
the vacuum-polarization spectral function necessarily 
implies a decrease in the vertex for emission of a time¬ 
like photon, and tests of the effect are suggested. 
ObviouBly, modifications in vertex parts will at some 


level make contributions to the electrodynamic process¬ 
es discussed in the present paper above and beyond the 
direct effect of the postulated vacuum-polarization dis¬ 
crepancy. There seems at present to be no way of 
estimating the size of such additional contributions; 
about all one can say is that they are likely to be most 
important in the electron Lamb shift and g e — 2 experi¬ 
ments, where only one mass scale ( m e ) is involved and 
the vacuum-polarization and photon-electron vertex 
parts are off-shell to a similar degree. Hence the 
electronic Lamb-shift predictions of Tables V and VT 
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the muon energy level and g u — 2 experiments, two 
mass scales (both m e andnz M ) are involved, with the 
electron vacuum-polarization loops much further off- 
shell (relative to their natural mass scale) than are the 
photon-muon vertices. Thus in this case the neglect of 
possible vertex modifications, which is implicit in all 
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2e E. Campini, Lett. Nuovo Cimento 4, 982 (1970); P. J. S. 
Watson and M. K. Sundaresan, Ref. 2. As both of these 
references emphasize, in order for muonic helium to 
be useful for electrodynamics tests, current uncertain¬ 
ties in the helium nuclear charge radius and nuclear 
polarizability will have to be reduced. 

27 The factor (47T)" 1 in Eq. (49) appears to have been 
omitted in the basic paper of R. Jackiw and S. Wein¬ 
berg, Phys. Rev. D 5, 2396 (1972) and in subsequent 
papers quoting their formulas. 

23 Because of the factor of 4n mentioned in Ref. 26, our 
(£0 M jT£0jvjv)/ 4!r should be compared with g^g^sR 
of Ref. 2. Omitting the 20 Ca, 22 TL 2 gFe, and 3S Sr dis¬ 
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find an effective coupling of —7.6x 10" T at M<j> = 12 MeV, 
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tron and Photon Interactions at High Energies, edited 
by N. B. Mistry (Laboratory of Nuclear Studies, 
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p. 121. 

38 H. Pilkuhn et al. t Nucl. Phys. B65 , 460 (1973). See 
especially pp. 480—481. The estimate of Eq. (C4) is 
also given by E. Reya f Rev. Mod. FhyS. 46, 545 (1974) 


Reya writes M B =M 0 + <B|63C chiralbreaking |B) f where M 0 
is the baryon mass in the absence of SU(3)<8> SU(3) 
breaking. For the (3,3)® (3,3) case, he finds M 0 
= 1300 MeV- 130-^jy, so for in the range 45-85 
MeV the mass M 0 is less than the nucleon mass, making 

<^l<53C ch iraIbreaking!^) positive. 
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/ = = contributions to v M +Af+7r° in the Weinberg weak-interaction model 
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We use a detailed dispersion-theoretic model for pion production in the (3,3)-resonance 
region to calculate the ratio 

<r(vu +n-*vy +n + tt°) + c(v u +p — +p + ir°) 

~ — m" +/> + 

in the Weinberg weak-interaction theory. We find that I = i contributions do not substantially 
modify the earlier static model calculation of R given by B. W. Lee. 


Neutral-pion production by neutrinos appears to 
be one of the best reactions for searching for the 
hadronic weak neutral current predicted by the 
Weinberg weak-interaction theory. 1 In fact, if one 
accepts the bound 

slrf0y<O.35 (1) 

given by Gurr, Heines, and Sobel, 3 the static-mod¬ 
el calculation by B. W. Lee 3 of 

_ g (Vu +n- Vy +n + tP)+a(v u v u + 

2a(pp +«— p. - +p +7r°) 

( 2 ) 

in the Weinberg theory is already in conflict with 
existing experiments 4 in complex nuclei. Two es¬ 
sential cautions are necessary, however, before 
concluding that the Weinberg theory is ruled out. 
First, charge-exchange effects are important in 
complex nuclei, and may result in an experimen¬ 
tally measured value of R which is smaller than 
the true single-nucleon-target value by a factor of 
up to 2. 6 Second, the static-model approximation, 
which neglects I=\ s-channel contributions to the 
reactions in Eq. (2), has the effect of overestimat¬ 
ing R. 6 If the / = i corrections are large enough, 
then, together with charge-exchange corrections, 
they may move experiment and theory back into 
agreement. 

In this note we report the results of calculating 


the l = i corrections to R using the detailed disper¬ 
sion-theoretic model of weak pion production in 
the (3, 3)-resonance region which we developed 
some time ago. 7 The model is basically a gener¬ 
alization to weak pion production of the old CGLN 
model for pion photoproduction. 8 Nonresonant 
multipoles are treated in the Born approximation,® 
while the resonant (3, 3)-channel multipoles are 
obtained from the Born approximation by a unitar - 
ization procedure. The model is in excellent 
agreement with pion photoproduction data, 7 agrees 
well with pion electroproduction data up to a four - 
momentum transfer of fe a »0.5 (GeV/c) 2 , 7 and is al¬ 
so in satisfactory accord with the recent Argonne 
measurements of weak pion production. 10 Because 
all terms contributing to the weak-production am¬ 
plitude in the model are proportional to nucleon 
elastic form factors, the model fails badly in the 
region fe a »0.5 (GeV/c) 2 , where scaling effects be¬ 
come visible and leptonic inelastic cross sections 
decrease more slowly with increasing k 2 than elas¬ 
tic form factors squared. Fortunately, this re¬ 
gion of large k 2 makes a relatively small contribu¬ 
tion to the individual cross sections in Eq. (2), and 
the errors will furthermore tend to cancel between 
numerator and denominator. 

In the Weinberg model, the effective Lagrangian 
for the semileptonic strangeness-conserving weak 
interactions is 


j 


£ = COS0 C { MT'vU +Ys) l/ ii& v 1 v 2 + v 1 + tJ v 2 ) 

+^7x(l+y 5 KUr i U -2 sin?0 v ) + J\ 3 -2 ain 2 0 w J !] + •••}, (3) 

where we have shown both the charged- and the neutral-current terms contributing to Eq. (2). In terms of 
the isospin matrix elements defined in Eqs. (2B.4) and (2B.5) of Ref. 7, the hadronic matrix element of the 
neutral current is 

out (* N \ J P(1 - 2 sin***) + J f -2 3in 2 e w j{\N) = (1 - 2 sin 2 0*)[<* ( l /2) V ' ( £ /2) Wj /a) V' ( J /2) ] 

-2 sin 2 ^a (0) +a ( l /2) A ( l /2) +a (l E /2) A ( i /2) . (4) 


9 229 


Copyright © 1974 by the American Physical Society. Reprinted with permission. 




230 


STEPHEN L. ADLER 


9 


The amplitudes appearing in Eq. (4) are all ones 
■which appear in either the pion-electroproduction 
or the weak-production calculations of Ref. 7, and 
so R can be evaluated by a simple adaptation of the 
computer routines used in the earlier work. The 
result of such a calculation is shown in Fig. 1, 
where we have assumed an incident lab neutrino 
energy = 1 GeV and a nucleon axial-vector elas¬ 
tic form factor 

1.24 

^ (fe2) = [ 1 +Jfe 2 /(1 GeV/c) 2 ] 2 ’ (5) 

and have integrated over the (3, 3)-resonance re¬ 
gion up to a maximum isobar mass of W = 1.47 
GeV. Curve a gives the result obtained from our 
model when both resonant and nonresonant multi¬ 
poles are kept; curve b is the corresponding re¬ 
sult obtained when only the resonant multipoles 
are kept, and hence when l = i amplitudes are ne¬ 
glected. As expected, curve a lies below curve 6, 
but the effect of the / = £ corrections is not dramat¬ 
ic. For comparison, we give in curve c the result 
obtained from Lee’s static-model calculation. 11 If 
one assumes that 9 W is restricted as in Eq. (1), and 
includes a safety factor of 2 for charge-exchange 
effects, curve a is barely consistent with the pres¬ 
ent experimental upper bounds. Put conservative¬ 
ly, our calculations indicate that an experiment to 
measure R at the level of a few percent should be 
decisive. 

Note added in proof. Recently, the possible ob¬ 
servation of neutral current events has been re¬ 
ported in deep-inelastic inclusive neutrino reac¬ 
tions by the CERN Gargamelle group. 12 If con¬ 
firmed, this experiment will establish the exis- 



FIG. 1. Ratio R of Eq. (2) vs Weinberg angle By. 

Curve a —resonant and nonresonant multipoleS; curve b — 
resonant multipoles only; curve c—resonant multipoles 
in Lee’s static-model calculation. 

tence of neutral currents; however, more detailed 
questions, such as whether the phenomenological 
form of Eq. (3) is correct, will require the inde¬ 
pendent study of many different neutral-current 
induced reactions, among them the pion-produc- 
tion reaction considered in this note. 
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We discuss nuclear charge-exchange corrections to leptonic pion production in the region of the (3, 3) 
resonance, both from a phenomenological viewpoint and from the evaluation of a detailed model for 
pion multiple scattering in the target nucleus. Using our analysis, we estimate the nuclear corrections 
needed to extract the ratio 

R = [o(i» M +n-* Vp + n + n°) + <r{v^ + p -+ v,, + p + it 0 )] / + n + p + ir°) 

from neutral-current search experiments using , 3 A1 27 and other nuclei as targets. 


I. INTRODUCTION 


Although weak-interaction experiments on hy¬ 
drogen and deuterium targets are most readily in¬ 
terpreted theoretically, experimental considera¬ 
tions necessitate the use of complex nuclear tar¬ 
gets in many of the current generation of acceler¬ 
ator neutrino experiments. As a result, in such 
experiments, corrections for nuclear effects must 
be made in order to extract free nucleon cross 
sections from the experimental data. Our aim in 
the present paper is to analyze these corrections 
in a particularly simple case: that of leptonic 
single-pion production in the region of the (3, 3) 
resonance. This reaction has gained prominence 
recently because measurement of the ratio 


R _ q(p y + v u +«+7r°)+q(^u — i/ u +p + ir°) 

2a(y M + w— ii'+p + n 0 ) 

appears to be one of the better ways of searching 
for hadronic weak neutral currents. 1 Experiments 
measuring R use aluminum (and in some cases 
also carbon) as target materials, so the experi¬ 
mentally measured quantity is actually (T', T" 
denote unobserved final target states) 


B VtA g(t^M + T-V„ + T'+ir°) 

' ' 2ff(i/ (I + T-M" + 7" + 7r 0 )’ 


r= 13 Ai 27 , 




As Perkins has emphasized, 2 nuclear charge-ex- 
change effects can cause substantial differences 
between R and R' t which makes reliable estima¬ 
tion of these effects an important ingredient in 
correctly interpreting the experiments. Fortu¬ 
nately, pion production in the (3, 3) region is also 
a particularly favorable case for theoretical anal¬ 
ysis, primarily because the elasticity of the (3, 3) 


resonance implies that nuclear effects will not 
bring multipion or other more complex hadronic 
channels into play. 

Our discussion is organized as follows. In Sec. 

II we introduce our basic phenomenological as¬ 
sumption: that leptonic pion production on a nuclear 
target may be represented as a two-step com¬ 
pound process in which pions are first produced 
from constituent nucleons with the free lepton- 
nucleon cross section (apart from a Pauli-prin¬ 
ciple reduction factor), and subsequently undergo 
a nuclear interaction independent of the identity of 
the leptons involved in the production step. This 
assumption allows us to isolate nuclear effects 
(pion charge exchange and absorption) in a 3x3 
“charge-exchange matrix/* We analyze the struc¬ 
ture of the charge-exchange matrix in the par¬ 
ticularly simple case of an isotopically neutral 
target nucleus. [For 13 A1 27 , with a neutron excess 
of 1 and a corresponding isospin of the approxi¬ 
mation of isotopic neutrality should be quite ade¬ 
quate. For 6 C 12 , of course, no approximation is 
involved.] Our main phenomenological result is 
that parameters of the charge-exchange matrix, 
which can be measured in high-rate pion electro¬ 
production experiments, can be used to calculate 
the nuclear corrections to the weak pion produc¬ 
tion process and in particular to give the connec¬ 
tion between R' and R . In Sec. HI we develop a 
detailed multiple-scattering model for the charge- 
exchange matrix. Our model is quite similar to a 
successful calculation by Sternheim and Silbar 3 of 
pion production in the (3, 3)-resonance region in¬ 
duced by protons incident on nuclear targets, and 
uses the nuclear pion absorption cross section 
which they determine (as well as the experimental 
pion-nucleon charge-exchange cross section) as 
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input. The principal difference between our cal¬ 
culation and that of Sternheim and Silbar (apart 
from obvious changes stemming from the fact that 
they deal with a strongly absorbed rather than a 
weakly interacting projectile) is that we use an 
improved approximation to the multiple-scattering 
problem, based on a one-dimensional scattering 
solution introduced by Fermi in the early days of 
neutron physics. 4 Using our model for the charge- 
exchange matrix, and a theoretical calculation of 
electroproduction and weak production of pions 
from free nucleons which has been described else¬ 
where, 5 we present detailed predictions for R' in 
the Weinberg weak-interaction theory and some of 
its variants. We also use the production model to 
test averaging approximations implicit in the phe¬ 
nomenological discussion of Sec. II. In Sec. IV we 
summarize briefly our conclusions. Three ap¬ 
pendixes are devoted to mathematical details. In 
Appendix A we formulate and solve the one-dimen¬ 
sional scattering problem which forms the basis 
for the approximate solution of the three-dimen¬ 
sional multiple-scattering problem actually en¬ 
countered in our model. To calibrate this approx¬ 
imation, in Appendix B we compare the approxi¬ 
mate solution with the exact multiple-scattering 
solution for the simple case of isotropic scattering 
centers uniformly distributed within a sphere. 
Finally, in Appendix C we collect miscellaneous 
formulas for cross sections and for Pauli exclu¬ 
sion factors which are needed in the text. 

H. PHENOMENOLOGY 
A. Kinematics 5 

We consider the leptonic pion-production reaction 

l{k 1 )+T~l'(k 2 ) + T'+ 7T <10) , (3) 

with k L and k 2 respectively the four-momenta of 
the initial and final lepton l and V , with T a nu¬ 
clear target initially at rest in the laboratory, and 
with T f an unobserved final nuclear state. Let fe 2 
= (k L - k 2 Y be the leptonic invariant four-momentum 
transfer squared, and let fef 0 - k 2Q be the lab¬ 
oratory leptonic energy transfer to the hadrons. 
Corresponding to the three pionic charge states 
in Eq. (3) we have three doubly differential cross 


sections with respect to the variables k 2 and 
which we denote by 

d 2 a(lVT;± 0) 

dk 2 dk\ * {4) 

When the target T is a single nucleon N (of mass 
Mjf), below the two-pion production threshold the 
recoil target T f must also be a single nucleon, 
and we can specify the kinematics more precisely. 
We write in this case 

UkJ + N(p l ) - V(k 2 )+N’(p 2 ) + n ii0) (g ), (5) 

with the hadron four-momenta indicated in paren¬ 
theses. We denote the final pion-nucleon isobar 
mass by W, 

W z = {p 2 +qf. (6a) 

This variable is evidently related to the leptonic 
energy transfer fej by 

r 2 = M/-|i!! 2 |+2M ff feJ. (6b) 

B. Factorization assumption 

We now introduce a factorization assumption 
which is basic to all of our subsequent arguments. 
We assume that leptonic pion production on a nu¬ 
clear target may be regarded as a two-step com¬ 
pound process. In the first step of this process 
pions are produced from constituent nucleons of 
the target nucleus with the free lepton-nucleon 
cross section. In the second step the produced 
pions undergo a nuclear interaction, dependent on 
properties of the target nucleus and on the kine¬ 
matic variables k% and (possibly) k 2 , but indepen¬ 
dent of the identities of the leptons involved in the 
first step. Since we are considering only excita¬ 
tion energies below the two-pion production thresh¬ 
old, the nuclear interaction in the second step in¬ 
volves only two types of processes, (i) scattering 
of the pion and (ii) absorption of the pion in two- 
nucleon or more complex nuclear processes. In 
particular, the two-pion production channel cannot 
come into play, and hence the factorization as¬ 
sumption allows us to write a simple matrix rela¬ 
tion between the cross sections for leptonic pion 
production on nuclear and on free nucleon targets. 
We have 


d&dkf 


dWll' N r ; +)„ 
dk z dk L 0 

d 2 o(U' *T A ; 0) 
dk 2 dk{ 

= [M( z T A ;kX)] 

d*o(ll’Nr; 0), 
d&dkt 

dWtT*;-) 

dk 2 dk% 


d a o(ll' N T ',—) r 
d&dkt 


(7) 
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with 


d 2 o{U'N r :±0) r d 2 o(ll'P',±0) P 


dk*dk 


0 


+ u _ z) aw^ 


( 8 ) 


an appropriately weighted linear combination of 
free proton and free neutron cross sections. The 
subscript F indicates that these cross sections 
are to be averaged over the Fermi motion of the 
individual target nucleon in the nucleus, which 
substantially alters the shape (but not the inte¬ 
grated area) of the (3, 3) resonance 7 when plotted 
versus the excitation energy k%. In writing Eq. (7) 
we have obviously used rotational symmetry, 
which implies that when the angular variables of 
the pion emerging from the nucleus are integrated 
over, no dependence remains on the angles char¬ 
acterizing the initial production of the pion. The 
matrix Af appearing in Eq. (7) is a 3x3 “charge- 
exchange matrix” which is independent of the na¬ 
ture of the initial and final leptons l and V. In ad¬ 
dition to including pion scattering and absorption 
effects, M also takes into account the reduction of 
leptonic pion production in a nucleus resulting 
from the Pauli exclusion principle . 8 We will keep 
this effect in M in the ensuing phenomenological 
discussion, but when we make our multiple-scat¬ 
tering model in Sec. Ill we will separate it off as 
an explicit multiplicative factor. 


C. Structure of M 


equal to zero. 9 (See Added Note preceding Ac¬ 
knowledgments.) As noted in the Introduction, this 
approximation is reasonable for the targets of 
greatest experimental interest. With this restric¬ 
tion, the pion charge structure of the matrix M n 
(/, f = ±, 0) is that of the inclusive reaction 

7 + T(l = 0) — ?r f + T '(unobserved) , (9a) 

or equivalently, of the forward scattering pro¬ 
cess 

7T i + 7f / + T(/=0) — 7T|+ 7fy+ T(/=0) . (9b) 

Since the isospin of the system tt, + fi f can be ei¬ 
ther 0, 1, or 2, we conclude that the matrix M fi 
involves three independent parameters. We intro¬ 
duce them by writing 

M fi = A(c - d)ip f - ipi iff ' ft + Ad iff 

+ A(1 — c — 2d)ipf '(pf , (10) 

with if { and the isospin wave functions of ir t and 
7 r ft respectively. Substituting 



and writing our Eq. (10) component by component, 
we get the basic form 



M +0 

M+- 



M a - 


M. o 

M _ 


=A 


1 — c — d 
d 
c 


d 

1-2 d 
d 



Up to this point Eq. (7) applies to all nuclei, even 
those with a large neutron excess. In order to 
simplify the subsequent discussion we now restrict 
ourselves to the case of isotopically neutral tar¬ 
gets, with the neutron excess and isotopic spin 

- ■ 


It is useful to consider the form which the above 
equations take when no distinction is made be¬ 
tween tt + and tt~, but only between charged and 
neutral pions. Equation (7) is then replaced by 


d*o(ll' 7 T A \+) d 2 a(ll' t T a \ —) 


d 2 o(ll‘N T -, + )r dMU’Nr,-) r 

dk*dk% ' dtfdk L a 

= [N( z T A ;k 2 k l 0 )] 

dk 2 dk% ' dk 2 dk% 

d z a(ll' Z T A \ 0) 


d 2 a(WN T ;0) F 

dk 2 dk L a 

-J 


dk 2 dk% 



with N a 2x2 matrix. When the target T is iso¬ 
topically neutral, the matrix N can be expressed 
in terms of the parameters of Eq. (12), giving 

(N* + tfchoV ,/!-d 2d \ 

Woch *oo ; \d 1 — 2d) * (14) 

The dependence on the parameter c has dropped 


out, leaving only two parameters which determine 
the nuclear corrections in this case. 

D. Applications 

Equations (13) and (14) can be applied in two 
ways. First, they can be used to generate a spe¬ 
cific theoretical prediction for the ratio R ' of Eq. 
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(2), by integrating with respect to k 2 and fej [with the latter integration extending only over the (3, 3)- 
resonance region] to give 

o(u, + T -„ M+ r + 0= f&dkiAlTi 


+ [l-2rf(T;fe^)J rf2g( X^ r;0V } . 


oU+T-ii'+T' + tt 0 ) 


- fdJ?dk%A(T; U; 


+ 


[1-2 d(T;k*ki)] 


d 2 a(i/ u ii~N T \ 0)jr 
- dk 2 dkf 



In Sec. m we will evaluate Eq. (15) (and thus R') 
using our multiple-scattering model for M and the 
weak pion production calculation of Ref. 5 as in¬ 
puts. 10 

The second way of applying Eqs. (13) and (14) is 
to use them in a purely empirical fashion to ex¬ 
tract the charge-exchange matrix parameters 
A(T;fe 2 feJ) and d(T;fe 2 fe£) from a comparison of pion 


r.. ■ "■ ...— 

electroproduction on free nucleons with pion elec- 
troproduction on a nuclear target T. Specifically, 
we find from Eqs. (13) and (14) that 


d(T;k 2 kf;) 


r(eeT)- r(eeN T ) 

[2 - r(eeN T )] [1 + r(ee 70] 9 


(16) 


with 



the electroproduction charged-pion-to-neutral-pion ratios on targets T and N T , and 


t.\ r d 2 o(eeT;+) "*a(eeT; 0) d 2 o(eeT; -)~\\ d 2 o{eeN r \ +) F 

MT,kk 0 ) = l ■ fodJ + - ^ dk L + dk 2 dk l a JL dk 2 dk L 0 


Once d(T; k 2 k%) and A(T; k 2 k%) have been extracted 
from electroproduction data by use of Eqs. (16)- 
(18), they can be substituted into Eqs. (13) and (14) 
and used to calculate the nuclear corrections to 
toeak pion production on the same target T. Note 
that Eqs. (16) and (17) for d are independent of the 
absolute normalization of the electron cross sec¬ 
tions, and that A , which does depend on absolute 
normalization, appears as a simple multiplicative 
factor in both numerator and denominator of Eq. 

(2) for R*. Hence the relation between R and R* 
given by our empirical procedure is also indepen¬ 
dent of the absolute normalization of the electron 
cross sections used to extract A and d . 

In many applications it is convenient to deal not 
with the doubly differential cross sections of Eq. 
(4), but rather with these cross sections integrated 


d 2 o(eeN r i 0) y d 2 o{eeN T \ -) y ~[~ 1 

d&dk\ d&dk% J ’ 

( 18 ) 


in excitation energy over the (3, 3)-resonance re¬ 
gion, 

d 2 a(U'T;± 0) f ,. L d 2 a{U'T\ ±0) 

dk 2 "Ain—c.^ 0 dk 2 dk L 0 • 

(19) 

In order to write simple formulas directly in 
terms of these integrated cross sections we note 
that, to a good first approximation, the fej depen¬ 
dence of the doubly differential cross sections is 
governed by the dominant (3, 3) channel, and hence 
is independent of the identities of l and V and of 
the pionic charge. This near-identity of excitation 
energy dependence allows us to make the averag¬ 
ing approximation of replacing Eqs. (7), (12), (13), 
and (14) by equations of identical form written di¬ 
rectly in terms of the cross sections of Eq. (19), 
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~do(ll’T-,+)l 


~da(,ll' N t \ +)~ 

dk 2 


dk 2 

do{U f T\ 0) 

= [M(T; k 2 )] 

do(ll'N T \0) 

dk 2 

dk 2 

do(ll'T;-) 


do(ll'N r ;-) 

dk 2 


dk 2 


( 20 ) 


dajll'T; +) dojll'T;-) 
dk 2 + dk 2 

do(ll'T-, 0) 
dti 2 


= [MT,fe 2 )] 


do(ll'N T ; +) da(ll'N T ; -) 
dti 2 + dk 2 


dadl'Nr, 0 ) 

dk 2 


with M and N the lepton-independent matrices 

/I -c-d d 

[M(T-,k 2 )]=X I d 1-2 d 

\ c d 

Because the excitation energy k% has been inte¬ 
grated over, 7 we can use free nucleon cross sec¬ 
tions on nucleon targets at rest in the right-hand 
side of Eq. (20); hence we have omitted the sub¬ 
script F which indicated smearing of the produc¬ 
tion cross section over nucleon Fermi motion. 
(Any residual effects of nucleon Fermi motion on 
the excitation-energy-integrated pion production 
cross sections will, in this formulation, be ab¬ 
sorbed in the phenomenological matrices M and 
N.) The formulas for extracting A(T ; k 2 ) and 
d(T ; fe 2 ) from electroproduction data are identical 
in form to Eqs. (16)—(18): 



HtT ■ h r(eeT)~r(eeN T ) 

[2-HeeN T )][l+HeeT)] ' 


dk* 


dojeeT ; 0) 


-i 


yt peN ) - [MeeN T ; +) do(eeN T ;-) 


dk 2 

da(eeN T ;0y'- 1 


dk ' 


A(T\ k 2 ) = 


da(eeT\+) do(eeT\ 0) da(eeT;-) 

da{eeN T \ +) do{eeN r \ 0) do(eeN T ; -) 

i — 

& 

N 

& 

b) 

a. 

N 

L dk 2 ' dk 2 + d& 


In terms of d and A , the expression for R* analogous to Eqs. (2) and (15) is 


R'(T) = 


Jdk 2 A(T; k 2 )jd(T; + ~Y 


2 Jdk?A(T;k 2 )\d(T;k z ) 


da(u li ji~N T ;+) da(^ 1 i4~iV' T ; -) 
dk 2 + dti 2 


+ [l-2d(r;fe 2 )] dg( ^ r - ?0) l 


(22a) 


(22b) 


+ [l- 2 d(T; fe 2 )] ^^ fe > 0) | ' 


(23) 


Equations (20)-(23) are in a form convenient for 
direct comparison with experimental data, and 
constitute our principal phenomenological result . 

We continue by introducing one further averaging 
approximation. To the extent that d(T* k 2 ) and 
A(T;k 2 ) are slowly varying functions of k 2 (and this 
is suggested by the numerical work of Sec. m) we 
can replace them by average values d(T) and A(T) 
in the integrals of Eq. (23). The parameter A(T) 
then cancels between numerator and denominator 
and the integration over k 2 can be explicitly car¬ 
ried out. We are left with a simple formula re¬ 
lating R ' to R, 


R'(T)=R 


d(T)r(v» VuN t ) + 1 - 2 d(T) 

d(T)r(v tl tJ.-N T )+l-2d(T) 9 


(24) 


i 

with 


„ N ) = ?(^ v P n t> + ) + A ^ ^ N T t ~) 
r(v^N T ) oUv v N t ; 0 ) 


M 




o( v» M ~N r ; +) + o(u li p. ~N T \ -) 

0 ) 



the charged-pion-to-neutral-pion ratios produced 
on an average nucleon target by neutral and 
charged weak currents, respectively. In the ap¬ 
proximation of Eq. (24), nuclear charge-exchange 
effects are isolated in the single parameter d(7). 
This description is particularly useful for giving 
a simple comparison of the charge-exchange cor¬ 
rections expected for different nuclear targets T. 
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E. Discussion 

We conclude by pointing out an experimental 
problem which will limit the direct applicability 
of the phenomenological results of Eqs. (16)-(23). 
In all of the above equations, we have assumed 
that the angular variables of the produced pion are 
unobserved, which corresponds to an experimen¬ 
tal situation in which the acceptance for produced 
pions is 4 tt sr. However, in realistic experiments 
observing the weak production and electroproduc¬ 
tion of pions, the pion acceptance will, in general, 
be rather small. Since the pion angular distribu¬ 
tions do depend on the leptons involved in the pro¬ 
duction process, 11 the introduction of acceptance 
restrictions will tend to spoil the simple relation 
between nuclear charge-exchange corrections to 
weak production and electroproduction of pions 
which we have developed above. There are two 
possible ways of dealing with this problem. One 
would be to simply go ahead and apply Eqs. (21)- 
(23) to the limited-acceptance case, interpreting 
the cross sections on T and N T as being limited to 
the actual pion acceptance. If both the value of d 
extracted from electroproduction 12 and the pion 
charge ratios observed in weak production were 
found to be only weakly acceptance-dependent, one 
would have an a posteriori justification for apply¬ 
ing the phenomenological recipe of Eq. (23) to tY\e 
acceptance-limited case. An alternative procedure 
would be to develop a detailed model for the 
charge-exchange parameters d, c, and A , and then 
to numerically fold these charge-exchange correc¬ 
tions into experimental or theoretical cross sec¬ 
tions for pion production on a free-nucleon target, 
taking acceptance limitations into account. Al¬ 
though, in this approach, one would forego the 
possibility of direct phenomenological application 
of electroproduction data, a comparison of the the¬ 
ory with electroproduction experiments on nuclear 
targets would still be essential to test (and pos¬ 
sibly revise) the charge-exchange model. Once 
validated in this way, the charge-exchange param¬ 
eters could be substituted into Eqs. (15) and (23) 
to generate predictions for weak-production ex¬ 
periments. The question of constructing a suitable 
model for the charge-exchange parameters will be 
pursued further in Sec. m. 

III. MULTIPLE-SCATTERING MODEL 

We proceed in this section to develop a detailed 
multiple-scattering model for nuclear charge-ex¬ 
change corrections. Our motivations are, first, 
to get an estimate of the magnitude of charge-ex¬ 
change corrections to be expected for various tar¬ 
get nuclei, and second, as discussed above, to 
facilitate comparison with experiment in the real¬ 


istic case in which there are pion acceptance lim¬ 
itations. 


A. Formulation of the model 

Our model closely resembles (with differences 
which we explain below) a successful semiclassi- 
cal treatment of tt* production in proton-nucleus 
collisions which has been given by Sternheim and 
Silbar. 3 The ingredients of the model are as fol¬ 
lows: 

(1) We regard the target nucleus as a collection 
of independent nucleons, distributed spatially ac¬ 
cording to the density profile determined by elec¬ 
tron scattering experiments. For aluminum and 
lighter nuclei, it is convenient to parameterize 
the nucleon density in the so-called “harmonic 
well” form 


p(r) = p(0)e- r2/R2 \l + + c i(^) 



with the values of the various parameters given in 
Table I. 

(2) In discussing pion multiple scattering within 
the target nucleus, we regard the nucleons as fixed 
within the nucleus, thus neglecting Fermi motion 
and nucleon recoil effects. [A numerical estimate 
of the importance of these effects will be made in 
Sec. mB2 below.] This approximation allows us 
to characterize interactions of the pion with the 
constituent nucleons by a unique center-of-mass 
energy W, related to the lepton energy transfer 

ko by Eq. (6b). Through all stages of the multiple 
scattering we approximate the target nucleus to be 
isotopically neutral, composed of equal numbers 
of protons and neutrons. (See Added Note.) 

(3) Interactions of pions in the nucleus are 


TABLE I. Nuclear density parameters. 3,15 


Nucleus Z T A 

C 

C 1 

R (F) (Ref. c) 

Rp (0) (P) -2 

T}10 

5 B 

1 

0 

2.45 

0.251 

r 12 

e u 

1.333 

0 

2.41 

0.268 

7 N 14 

1-667 

0 

2.46 

0.263 

B o 16 

1.600 

0 

2.75 

0.247 

■3A1" 

2.000 

0.667 

1.76 

0.241 


a The data are taken from H. R. Collard, L. R. B. Elton, 
and R. Hofstadter, in Landolt-Bomstein: Numerical 
Data and Functional Relationships; Nuclear Radii, 
edited by K.-H. Hellwege (Springer, Berlin, 1967), 

New Series, Group I, Vol. 2. 
b The density p(r) is normalized so that Jd*rp{r)=A. 
c For the first four nuclei, the rms charge radius is 
equal to R. For aluminum, the rms charge radius 
corresponding to the listed parameters is 2.91 F. 
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treated in the approximation of complete incoher¬ 
ence, involving the use of pion-nucleon cross sec¬ 
tions rather than scattering amplitudes in the 
multiple-scattering calculation. In the region of 
the (3, 3) resonance, pion production and more 
complex hadron production channels are closed, 
and so there are only two relevant cross sections. 
The first is the cross section per nucleon oa> t (W) 
for pion absorption via various nuclear processes; 
for this quantity we use the best-fit value obtained 
by Sternheim and Silbar in their study of pion pro¬ 
duction by protons, 


o+JW) = 



0, 


mbx— 


— 0.788Af 
2.077M, * 


7\ <0.788M W 

W n 

T V >0.1SSM V 


scattering experiments at small momentum trans¬ 
fer k 2 provide some empirical evidence for the 
presence of the production factor g . The argument 
for including h is less compelling, since we are 
using a semiclassical picture, with fixed constit¬ 
uent nucleons, for treating the pion multiple scat¬ 
tering in the nuclear medium, and in a semiclas¬ 
sical picture there are no Pauli effects. To take 
this objection 13 into account, in the numerical work 
below we also calculate results for the case in 
which h is replaced by unity. 

(5) The approximation of keeping only / = f pion- 
nucleon scattering allows us to reduce the prob¬ 
lem of calculating the charge-exchange matrix M 
to a one-component scattering problem. To see 
this we let 


(27) 


where 


, W 2 -{M„ + M v y 
2 M 


N 


To allow for the considerable uncertainties in this 
expression for a,b« we examine numerically the 
effect on the charge-exchange corrections of mul¬ 
tiplying Eq. (27) by factors of i or 2. (See also 
Added Note.) The second cross section needed is 
the usual elastic cross section for pion-nucleon 
scattering. Since in the (3, 3) region the / = \ pion- 
nucleon cross section is very small, we neglect it 
entirely and regard all pion-nucleon scattering as 
proceeding through the / = f channel. The elastic 
cross section is then simply proportional to the 
cross section 


V+P 


(w) 


(28) 


for which a simple parameterization is given in 
Appendix C. In order to solve the pion multiple¬ 
scattering problem, we actually need the differ¬ 
ential cross section for elastic scattering; in the 
approximation of (3,3) dominance, this is given 
by 


do daitic 

dQ 


cco^ 


p (W)( 1 + 3cos 2 4>), 


(29) 


with (j> the pion scattering angle. 

(4) When a pion is produced by leptons incident 
on a nucleus or undergoes subsequent rescatter¬ 
ings, with small momentum transfer to the nuclear 
system, the corresponding production or scatter¬ 
ing cross section is reduced by the Pauli exclusion 
principle. 8 We take this effect into account, with¬ 
in the framework of the independent nucleon pic¬ 
ture, by multiplying the pion-leptoproduction 
cross section and the pion-nucleon rescattering 
cross section by respective reduction factors 
g(W , k 2 ) and h(W , <£). Formulas for these factors 
are given in Appendix C. Neutrino quasielastic 



(30) 


denote the pion charge multiplicities initially pres¬ 
ent in a beam of pions, at a fixed isobar energy 
W. A simple isospin Clebsch analysis then shows 
that when the pion beam undergoes a single scat¬ 
tering on an equal mixture of protons and neutrons 
through the / = J channel, the effect is to replace 
^ by with Q the matrix 


«- 


Sl X 
6 6 

X 2. 
6 3 

.0 4 - 



(31) 


Obviously, the natural way to describe a multiple¬ 
scattering process in which Q acts on ijt repeatedly 
is to decompose ^ into a sum of eigenvectors of 
Q. These eigenvectors, with their corresponding 
eigenvalues X, are 





Q2 



> - Sl 
* A 2 8 


(32) 



_ _l_ 


and the decomposition reads 
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^^llC k q bt 


Cj = i[ n t (v*) + n t (ir°) + m,(iT)], 

C 3 = -in^TT 0 ) + £[ n x (tt + ) + n { (ir ~)]. 


(33) 


The effect of a multiple-scattering process on Eq. 
(33) will be to lead to a final pion multiplicity state 
ip f , related to by 


f^h)C b q b , 


(34) 


with /(x) a function of the eigenvalue X which con¬ 
tains all geometric and dynamical information con¬ 
cerning nuclear parameters, magnitudes of cross 
sections, etc. Taking now to be the initial dis¬ 
tribution of lepto-produced pions in target T, 


d 2 o(ll'N r ;+) 

dk 2 dk{ 


<l>t=g(w,k 2 ) 


d 2 o(ll'N T ; 0) 
dk 2 dk% 

d 2 o(ll'N T ‘, —) 
dk 2 dki 


and ip y to be the distribution of exiting pions, 


d 2 o(ll'T\ +) 
dk 2 dk% 

d 2 o(ll'Tj 0) 

dl?dk% 

d 2 a(ll'T\ -) 
dk 2 dk\ 


(35) 


*/ = 


(36) 


we find that the connection of Eq. (34) takes the 
form of Eqs. (7) and (12), with 


A=g(W,k 2 )a, a = /(l) 
c = 3-i/(f-)//(l)+i/(i)//(l) , 
rf-i[ l-/(i)//(l)J- 


(37) 


(6) We turn finally to the function /(x), which 
contains the dynamical details of pion multiple 
scattering in the nucleus. The precise statement 
of the problem defining /(x) is as follows: We in¬ 
troduce an initial distribution of monoenergetic 
pions into a nucleus, with the pion density propor¬ 
tional to the nuclear density [ given by Eq. (26)]. 
The pions are multiply scattered, with absorption 
cross section given by Eq. (27) and with elastic 
scattering cross section given by Eq. (29). At each 
elastic scattering the pion number is multiplied 
by a factor X. The function /(x) is then defined as 
the expected number of pions eventually emerging 
from the nuclear medium, normalized to unit in¬ 
tegrated initial pion density. 

To get a simple (and, it turns out, surprisingly 
accurate) approximation to /(x), we replace the 
actual angular distribution [Eq. (29) times 
k(W, cos<£)] by a modified elastic scattering dis¬ 
tribution, in which all forward-hemisphere scat¬ 
tering (0 $ (p 2) is projected onto the forward 
direction (<p~ 0), and all backward-hemisphere 
scattering (tt/ 2 is projected onto the back¬ 

ward direction (<£ = 7r). In this approximation, once 
a pion is produced in the nucleus, it scatters back 
and forth along its initial line of motion until it 
either is absorbed or it leaves the nucleus. Since 
both the initial pion distribution and the interac¬ 
tion probabilities are proportional to nucleon den¬ 
sity, the nucleon density profile along the line can 
be scaled out of the problem by an appropriate 
change in length variable. Thus, for each line 
passing through the nucleus the expected fraction 
of pions which exit is independent of the density 
profile along the line, but depends only on the in¬ 
tegrated density along the line (the so-called opti¬ 
cal thickness), which we denote by L. Once we 
have solved for the one-dimensional exit fraction 
/(x, L ), we need only average over the distribution 
of optical thickness in the nucleus to get an ex¬ 
pression for /(x). 

To put these remarks in quantitative form, let 
us take the central nucleon density p(0) as the 
“standard density” relative to which densities 
elsewhere in the nucleus are measured. For given 
impact parameter b relative to the center of the 
nucleus, the optical thickness is then given by 



(38) 
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Averaging over impact parameters, the relation 
between /(x) and /(x, L) is given by 


C bdb L(b) f(\, L(b)} 
fW ~ fbdbUb) 



The one-dimensional problem defining/(x, L) is 
formulated precisely as follows: We consider a 
uniform one-dimensional medium of length L , in 
which pions are uniformly initially produced mov¬ 
ing (say) to the right. The pions propagate in the 
medium with inverse interaction length k, given 
in terms of the nucleon density and the absorption 
and scattering cross sections by 

K=p(°)<7tol. , „ 

, (40) 

= *o r + p (W)[h+(W) + h_(W)]. 

The factors h+ and h_ describe the forward- and 
backward-hemisphere projections of the Pauli re¬ 
duction factor h(W, <£), 

J rff/2 

3iri(j)d<p(l+Zc03 2 <p)h(W i <p ) , 

0 

(41) 

^- = i I sin<£d0(l + 3cos 2 </M(W, <f>) , 

•'it/2 

and are explicitly calculated in Appendix C. At 
each interaction the pions are forward-scattered 
with probability /x + and back-scattered with prob¬ 
ability (and, of course, absorbed with probabil- 
ity 1 - m+ - M-), with 

M* = ia ir+> (W)A*(H0/a tol , (42) 


and, concomitantly with each scattering, the pion 
number is multiplied by a factor x. The desired 
quantity /(X, L) is the expected number of pions 
eventually emerging from the medium, normalized 
to unit integrated initial pion density. An explicit 
expression for / is calculated in Appendix A [see 
Eq. (AI2) and Eqs. (A25)-(A27)], as well as ex¬ 
pressions for / + and/_, the expected fractions of 
pions eventually emerging with and without a net 
reversal of direction of motion along the line. In 
Appendix B we compare the approximate solution 
for /(x) given by Eq. (39) with the exact solution 
in the simple geometry of a uniform sphere com¬ 
posed of material which scatters isotropically, 
and find very satisfactory agreement. Since the 
actual angular distribution of interest to us, 1 + 3 
cos 2 ^, is already peaked in the backward and for¬ 
ward directions, 14 our approximation should be at 
least as accurate for this case as it is for handling 
isotropic scattering. 

This completes the specification of our multiple¬ 
scattering model. As we have already noted, it 
closely resembles the calculation of Sternheim 


and Silbar, and the reader is referred to Ref. 3 
for an excellent, detailed analysis of the approxi¬ 
mations and physical assumptions which are in¬ 
volved. The aspects in which our model differs 
from that of Ref. 3 are the following: (1) We take 
into account the diffuseness of the nuclear edge, 
rather than treating the nucleon distribution as a 
uniform sphere; (2) we take Pauli exclusion effects 
into account in a crude way; and (3) we use an 
improved approximation for solving the pion mul¬ 
tiple-scattering problem. Instead of using the 
back-forward approximation described above, 
Sternheim and Silbar use the considerably less 
accurate approximation of treating all scattering 
as purely forward scattering. A comparison of 
their approximation with the exact solution, in the 
case of a uniform sphere composed of material 
which scatters isotropically, is given in Appendix 
B. 


B. Numerical calculations 

We turn now to numerical calculations, in which 
we combine our model for nuclear charge-ex- 
change corrections with the theory of electropro¬ 
duction and weak production of pions from free- 
nucleon targets developed in Ref. 5. For the had¬ 
ronic weak neutral current, we adopt the Wein¬ 
berg-model form 15 

j «"«'*' m j n +j a 3 _ 2 sln » flr j «■ . (43) 

we will say a few words below about variants of 
this model in which Eq. (43) contains an additional 
isoscalar current. We assume throughout an inci¬ 
dent lab neutrino energy k f 0 = 1 GeV and a nucleon 
elastic form factor 2 

= [l+Jfe a /(0.9 GeV/c) a ] a ’ * 44) 

and take integrations over the (3, 3)-resonance re¬ 
gion to extend from the pion production threshold 
up to a maximum isobar mass of W = 1.47 GeV. In 
our calculations on aluminum, we weight the free- 
nucleon production cross sections according to the 
actual neutron/proton ratio in aluminum (i.e., we 
take iV r =13/> + 14w), but as emphasized above, we 
adopt the approximation of isotopic neutrality in 
calculating charge-exchange corrections. 

1. Calculation of R ' from Eq. (15) 

(with Fermi motion, neglected) 

In Table II we present results for the ratio R ' 
on an aluminum target, calculated by using Eq. 

(15) to fold the W-dependent charge-exchange ma¬ 
trix into the production cross sections from a 
free-nucleon target at rest [i.e., we neglect the 
Fermi-motion average symbolized by the sub- 
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script F in Eq. (15)]. In the second column we 
tabulate 


olvr+N T -» u +N' T +* 9 ) 

^ T ’ 2<j(v v +N t - n'+N’i. +n°) ’ 


(45) 


which is the ratio predicted by the production 
model when no charge-exchange corrections are 
made. In the third through seventh columns we 
tabulate values of the charge-exchange-corrected 
ratio R f obtained under various alternative as¬ 
sumptions. The column labeled “no variations” 
is the result obtained from the multiple-scatter¬ 
ing model of Sec. m A above; the next three col¬ 
umns show how this result changes when the 
Pauli factors h in Eqs. (40) and (42) are replaced 


by unity, or when the absorption cross section of 
Eq. (27) is modified. The predictions for R f are 
evidently quite insensitive to these variations. 

The seventh column gives the result for R ' when 
all isoscalar multipoles are omitted. Since the iso¬ 
scalar multipoles only contribute quadratically to 
/2', 16 this column gives a lower bound on R ' for any 
variant of the Weinberg theory in which the had¬ 
ronic neutral current differs from Eq. (43) by 
purely isoscalar terms. In the final column we 
have used our production and charge-exchange cal¬ 
culations to generate simulated pion weak-produc- 
tion and electroproduction cross sections on alu¬ 
minum, which are then used to evaluate the lower 
bound on R f derived by Albright et al. 17 in the iso¬ 
scalar-target approximation, 


K'( 13 A1 27 )s 4 |[r(^/i 13 A1 27 ) l] I/2 -2sin 2 0 w ^^^.^ Al27;O) J j, 

(46) 


as shown in the second column. Finally, in the 
third column we give the prediction of 2.63:1 
which results when Eq. (15) and its analog for 
charged pions are used to fold in charge-exchange 
corrections for aluminum. 19 It would obviously be 
very desirable to try to check this prediction for 
f simultaneously with the experimental determina¬ 
tion of /?'. 


TABLE n. Calculations of R r ( 13 A1 2T ) based on Eq. (15). 

R'( 13 A1 2T ) Simulated 


sin 2 G w 

RWt)* 

No 

variations 

Pauli factors 
h-»l 

with 

Kb. 

with 

2°-.b. 

Isoscalar 
multi poles 
omitted 

Albright et al. 
lower bound 
on R' (i 3 A1 2T ) 

0 

0.697 

0.422 

0.396 

0.411 

0.435 

0.422 

0.408 

0.1 

0.573 

0.346 

0.325 

0.337 

0.356 

[0.346 

0.321 

0.2 

0.465 

0.280 

0.264 

0.273 

0.289 

0.280 

0.245 

0.3 

0.374 

0.225 

0.212 

0.220 

0.232 

0.225 

0.179 

0.4 

0.300 

0.180 

0.170 

0.176 

0.186 

0.180 

0.123 

0.5 

0.242 

0.146 

0.138 

0.143 

0.150 

0.145 

0.078 

0.6 

0.200 

0.122 

0.115 

0.119 

0.125 

0.120 

0.043 

0.7 

0.175 

0.108 

0.102 

0.106 

0.110 

0.106 

0.019 

0.8 

0.166 

0.104 

0.099 

0.102 

0.107 

0.102 

0.004 

0.9 

0.174 

0.111 

0.106 

0.109 

0.113 

0.108 

0.000 

1.0 

0.198 

0.128 

0.123 

0.126 

0.131 

0.125 

0.006 


rlv il ~ 13-^1 > + )+g(^VM isAl 27 ; -) 

r(u u n I3 A1 ) oiW 13 A1 , 0) 


7° = 

r cin 


G 2 cos 2 fl< 

IT 




We see that the bound of Eq. (46) provides a satis¬ 
factory estimate of R' for small values of sin 2 0 v . 

We turn next to Table III, where we have tabu¬ 
lated charged-pion to neutral-pion production 
ratios for the usual charged weak current. The 
first column gives the standard 5:1 prediction for 
an isotopically neutral target, assuming complete 
/ = ! dominance. When /=| multipoles are taken 
into account, 18 the prediction is lowered to 3.67:1, 


a The numbers in this column are slightly smaller than those plotted in curve a of S. Adler 
[Phys. Rev. D j), 229 (1974)1, because we have reduced the axial-vector mass parameter 
[See Eq. (44)1 from 1.0 to 0.9 GeV/c in the present calculation, and have also weighted the 
production cross sections according to the actual neutron/proton ratio in aluminum. 
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TABLE m. Charged-pion-to-neutral-pion ratio 


r{Vp\fn +p) 

rlv^iTN T ) 

r'^n-jjAl 27 ) 

r' iT 13 A1 27 ) 

pure (3, 3) 

with 1-2 

from Eq. (15) and 

from Eq. (48) 

approximation 

corrections 

charged-pion analog 


5 

3.67 

2.63 

2.68 


2 . Averaging approximations , comparison 
of different nuclei , and estimate of 
nucleon motion effects 

We conclude with a test of the averaging approx¬ 
imations introduced in Sec. n and a discussion of 
related topics. To study Eqs. (20)-(23), we fold 
together the electroproduction and charge-ex- 
change models, as in Eq. (15), to give simulated 
data for pion electroproduction cross sections on 
aluminum. Substituting this data ijito Eqs. (21) 
and (22) then gives the values for d and A tabulated 
in Table IV. The charge-exchange parameters ob¬ 
tained this way are seen to be nearly independent 
of the incident electron energy fef 0 , and are slowly 
varying functions of k 2 except in the region k 2 ^0.3, 
where Pauli exclusion effects and I=\ multipoles 
arising from the pion exchange graph become im¬ 
portant. Substituting the 2-GeV/c values of d and 
A into Eq. (23), and continuing to use our produc¬ 
tion model for the neutrino cross sections, gives 
the values of R' tabulated in the second column of 
Table V. In the third column we transcribe from 
Table n the values of R' obtained directly from 
Eq. (15); the good agreement indicates that the av¬ 
eraging approximation is working. 

We turn next to the "double-averaged” approxi¬ 
mation of Eqs. (24) and (25). We define the tilded 
charge-exchange parameters by averaging the 
charge-exchange matrix over the leading W-depen- 
dent part of the production cross section as ob¬ 
tained in the static approximation 80 : 


.(dWq(W)- 1 a <s ^(W)f(\) 

’ fdWq(W)-‘a (ii 3 ) (W) ’ 

5 = 7 ( 1 ), 

c = t- f/(■§•)//(i) + 17(f)/7(i), 

5 = 'j-[ i - 7(f)/7(i)] • 


(47) 


Expressions for the resonant pion-nucleon scatter¬ 
ing cross section <7(3 3 ) (W) and the pion momentum 
q( W) are given in Appendix C. Evaluating Eq. (47) 
for aluminum gives d( 13 Al 27 ) =0.162, which, when 
substituted into Eq. (24) along with the charged- 
to-neutral ratios tabulated in the second and third 
columns of Table VI, gives the predictions for R* 
tabulated in the fourth column. These agree well 
with the corresponding values of R ' obtained di¬ 
rectly from Eq. (15). As another test of the "dou¬ 
ble averaged” approximation, we consider the for¬ 
mula giving the charge-exchange corrections to 
the charged-to-neutral ratio f, 

u - a 2^(»A1 27 ) + [ 1 -d(, 3 Al”)]f(^M-*r) 
,3 ^ ’ l-2d( 13 A.l 27 ) + i( 13 M 27 )r(i'»H-N T ) • 

(48) 


Substituting r = 3.67, d = 0.162 into Eq. (48) gives 
r f =2.68, as tabulated in the final column of Table 
HI. This again is in close agreement with the val¬ 
ue of r ' obtained directly from Eq. (15). 

As we remarked in Sec. n, the double-averaged 
approximation provides a convenient format for 
comparing charge-exchange effects in different 


TABLE IV. Simulated d{ t 3 A1 21 ;k *) and A^jAl 27 ;* 2 ) obtained from electroproduction and 
charge-exchange correction models. 

k f 0 = 2 GeV/c _ fe f 0 = 6 GeV/c 


k 2 (GeV/c) 2 

d( 13 Al 27 ;* 2 ) 

A( 13 A1 27 ;* 2 ) 

d ( 13 Al 27 ;k 2 ) 

A( ls Al 27 ;* 2 ) 

0 

0.191 

0.606 

0.188 

0.608 

0.1 

0.181 

0.688 

0.179 

0.682 

0.2 

0.169 

0.702 

0-167 

0.694 

0.3 

0.164 

0.702 

0.162 

0.694 

0.4 

0.160 

0.698 

0.158 

0.690 

0.6 

0.157 

0.694 

0.154 

0.684 

0.8 

0.156 

0.690 

0.152 

0.680 

1.0 

0.155 

0.690 

0.150 

0.676 

1.4 

0.155 

0.688 

0.147 

0.672 

1.8 

0.156 

0.686 

0.145 

0.666 
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TABLE V. Test of first averaging approximation for 
R' (ijAI 27 ). 


sin 2 ^ 

3 Al 27 ) 

From Eq. (23) 

From Eq. (15) 

0 

0.433 

0.422 

0.1 

0.355 

0.346 

0.2 

0.288 

0.280 

0.3 

0.232 

0.225 

0.4 

0.186 

0.180 

0.5 

0.151 

0.146 

0.6 

0.127 

0.122 

0.7 

0.113 

0.108 

0.8 

0.109 

0.104 

0.9 

0.116 

0.111 

1.0 

0.134 

0.128 


nuclei. In Table VII we have tabulated the charge- 
exchange parameters 5, c, and d for a range of 
light and medium-weight nuclei up to aluminum. 
The key point to notice is that the parameter d is 
slowly varying, indicating that charge-exchange 
effects in different medium-weight targets, such 
as, for example, freon (CF 3 Br) and aluminum, 
should be quite similar. 

Finally, we apply the double-averaged approxi¬ 
mation to estimate the effect on our numerical re¬ 
sults of including nucleon Fermi motion and nucle¬ 
on recoil. Obviously, to include nucleon motion in 
a realistic way one would have to go outside the 
framework of the one-speed scattering theory used 
above, since once the nucleons are not regarded as 
fixed the pion changes energy in each collision. 
Rather than attempting to follow these energy 
changes in detail (which would require an elaborate 
numerical calculation), we adopt a simple approx¬ 
imation which can be treated by the methods used 
above. We observe that in the (3, 3)-resonance re¬ 
gion typical nucleon recoil momenta are of the 
same order as the nucleon Fermi momentum 
(—1.6 M n /c); hence a rough estimate of nucleon- 


9 

recoil and Fermi-motion effects should be given 
by the simple randomizing approximation of re¬ 
garding the pion energy as a constant throughout 
its motion in the nucleus, but replacing the pion- 
production and charge-exchange-scattering cross 
sections by corresponding cross sections which 
are smeared over nucleon Fermi motion. Evalu¬ 
ating Eq. (47) using these smeared cross sections 
gives d( 13 Al 27 ) = 0.142, as compared with the value 
of 0.162 which results when nucleon motion is ne¬ 
glected. We see that the change in d is relatively 
small and is in the direction of reducing the size 
of charge-exchange effects; we expect these quali¬ 
tative features to survive in a more careful treat¬ 
ment of nucleon-motion effects. In Table VIII we 
summarize the values of J( 13 A1 27 ) obtained in our 
original model and when various modifications are 
made. 

C. Pion angular distributions 

Up to this point we have only discussed charge- 
exchange corrections to cross sections in which 
the pion angular variables have been integrated 
out. Our model, however, makes specific predic¬ 
tions for angular distributions as well, and al¬ 
though they are much more subject to error than 
the integrated predictions, 21 they are essential for 
describing experimental situations in which the 
pion acceptance is limited. To describe the angu¬ 
lar distribution predictions, we let the column 
vector 

/ do(N T q; + )\ 

*(Nrf)=f da(N T q; 0) (49) 

\ MNrf; -)) 

denote the free-nucleon-target pion-production 
cross section, with the pion emerging in direction 
q . In the backward-forward scattering approxi¬ 
mation, after undergoing nuclear interactions the 


TABLE VI. Test of second averaging approximation for R ' (^Al 27 ). 


sln 2 0y 


**"N r ) 

R' 

From Eq. (24) 

(isAl*) 

From Eq. (15) 

0 

0.692 

3.67 

0.432 

0.422 

0.1 

0.697 

3.67 

0.356 

0.346 

0.2 

0.707 

3.67 

0.289 

0.280 

0.3 

0.727 

3.67 

0.234 

0.225 

0.4 

0.763 

3.67 

0.189 

0.180 

0.5 

0.820 

3.67 

0.154 

0.146 

0.6 

0.903 

3.67 

0.129 

0.122 

0.7 

1.01 

3.67 

0.116 

0.108 

0.8 

1.12 

3.67 

0.112 

0.104 

0.9 

1.19 

3.67 

0.119 

0.111 

1.0 

1.22 

3.67 

0.136 

0.128 
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TABLE VII. Averaged charge-exchange parameters 
for various nuclei. 


Nucleus 

a 

? 

d 

6 b 10 

0.846 

0.0363 

0.125 

.C 12 

0.811 

0.0450 

0.138 

7 n* 4 

0.790 

0.0498 

0.144 

8 o“ 

0.807 

0.0460 

0.139 

13-A-l 27 

0.724 

0.0642 

0.162 


pion can emerge either in direction q or with re¬ 
versed direction In Appendix A, in addition to 
calculating the total expected fraction of emerging 
pions/(A, L), we also calculate the expected frac¬ 
tions/ + (A, L), /_(a, L) which emerge, respectively, 
with or without a net change in direction. Using 
these to define a forward charge-exchange matrix 
M + and a backward matrix in analogy with Eqs. 
(12), (37), and (39), 

/l - Ci -d ± d t 

[Mj = A t I d t 1-2 d i 

\ c t 

A t =g(W,k*)a t , a t = / t (l) 

c* = i- */.(*)//*(!)+ */*(t)//*U), (50) 

<**-■§■[! “/* (*)//* (Dl. 

, /\\_ S“bdbL(b)f i [KL(b)] 

J * W -- » 

j 0 bdb L{b) 

we get for the charge-exchange-corrected pion an¬ 
gular distribution 

da(Tq) = [M J da(N T q) + [M _] da(N T - q ). (51) 

Since 

[Mj+[M.] = [M], (52) 

Eq. (51) implies that 

da(Tq) +da(T - q) = [M][ da(N T q) +da(N T - ?)], 

(53) 

and so Eq. (51) reduces to our previous result for 
charge-exchange corrections when integrated over 
pion angle. 

IV. CONCLUSIONS 

We briefly summarize the results of the preced¬ 
ing sections, with particular emphasis on their 


implications for further experimental and theoret¬ 
ical work. 

(1) Our model calculations confirm the suggestion 
of Perkins 2 that charge-exchange corrections to 
weak pion production are a substantial effect, even 
for relatively light nuclear targets. To improve 
our understanding of these corrections it is impor¬ 
tant to do the analogous pion electroproduction ex¬ 
periments on nuclear targets, both to implement 
the phenomenological procedures of Sec. II and to 
test the predictions of the detailed multiple-scat¬ 
tering model of Sec. III. In the context of the mul¬ 
tiple-scattering model these electroproduction ex¬ 
periments have an independent nuclear physics in¬ 
terest, since they will permit a determination of 
the pion absorption cross section a abi (W) entering 
into the Sternheim-Silbar 3 calculation, independent 
of assumptions about the magnitude of proton ab¬ 
sorption in nuclear matter. 

(2) Again, in the context of the multiple-scatter¬ 
ing model, it is important to repeat the calcula¬ 
tions of Sternheim and Silbar using the improved 
scattering approximation developed in Sec. n and 
Appendix A (as extended 9 to the case of a neutron 
excess). This will permit the extraction of an op¬ 
timized pion absorption cross section a abi ( W) ap¬ 
propriate to the precise model which we use, and 
hopefully, may reduce some of the remaining 
areas of disagreement between the Sternheim-Sil¬ 
bar calculation and experiment. 

(3) Our calculations suggest that the ratio 
R'( 13 A1 27 ) is larger than about 0.18 when the Wein¬ 
berg parameter is in the currently interesting 2 
range sin 2 0 v 5 0.35. We do not attach great signif¬ 
icance to the fact that this theoretical estimate of 
R ' exceeds the upper bound of 0.14 reported by 

W. Lee, 1 since the discrepancy is easily of the or¬ 
der of uncertainties in the predictions of our pro¬ 
duction and charge-exchange models. We believe 
that a reasonably conservative statement is that if 
the hadronic neutral weak current has (up to iso¬ 
scalar additions) the form of Eq. (43), and if 
sin 2 0 v £ 0.35, then R ' on an aluminum target is in 
the neighborhood of a 15% effect. Thus, an exper¬ 
iment capable of measuring R ' to a level of a few 
percent will provide a decisive test of Eq. (43), 
and if Eq. (43) is correct, should permit a crude 
determination of sin 2 #*. 

Added note . A more recent calculation of pro¬ 
ton-induced pion production on nuclear targets by 



TABLE vm. Effect of modifications of the model on5( 13 Al 27 ). 

Nucleon motion Pauli factors with with 

No variations included fc-* 1 i?abf 2(r tbi 

^(igAl 27 ) 0.162 0.142 0.187 0.175 0.145 
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R. R. Silbar and M. M. Sternheim [ Phys. Rev. CJJ, 492 (1973)] gives a best-fit a lbf given by 




30mbx r43WT W * T ' <1 - 433 "-' 

51.3 mb(l- g JsVj , 1-433 <T W <3.455M W . 


(AN1) 


Equation (ANl) is substantially larger than Eq. (27) 
in the region of low pion energy; Silbar and Stern- 
heim attribute this difference largely to the inclu¬ 
sion of various nuclear corrections in their new 
calculation. Although it may not be consistent to 
use Eq. (ANl) in a model, such as ours, in which 
most of these nuclear corrections are neglected, 
we have nonetheless repeated the computation of 
Table II using Eq. (ANl), instead of Eq. (27), for 
a ab5 . The effect is to give values of R* which are 
about 17% larger than those tabulated in column 3 
of Table II. 

We have also repeated our calculations using the 
work of Ref. 9 to take the neutron excess in 1S A1 27 
into account. The effect is to reduce R* by about 
2.5% as compared with column 3 of Table II, indi¬ 
cating that the approximation of isotopic neutrality 
is a good one for 13 Al 27 . This calculation also sug¬ 
gests that our neglect of changes in the nuclear 
isospin in the course of the multiple-scattering 
process (see Sec. IIIA2) should cause an error of 
perhaps 10% at most in R\ Similarly, in analyzing 
the structure of M in Sec. IIC we have implicitly 
neglected a possible change in the nuclear isospin 
arising from the pion production step (i.e., we 
continue to treat an initially /- 0 nucleus as being 
in an /=0 state after the pion is produced); again, 
for nuclei which are not very light, the error re¬ 
sulting from making this approximation should be 
small and the three-parameter form given in Eq. 
(12) should give a reasonably good description of 
M. 
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APPENDIX A; ONE-DIMENSIONAL 
SCATTERING PROBLEM 

In this appendix we solve the one-dimensional 
multiple-scattering problem on which our approx¬ 
imate solution for pion three-dimensional multiple 


scattering is based. 22 We briefly recapitulate the 
formulation of the problem given in the text. We 
consider a uniform one-dimensional medium ex¬ 
tending from x = 0 to x = L, in which pions are uni¬ 
formly initially produced moving (say) to the right. 
The pions move in the medium with inverse inter¬ 
action length k, and at each interaction the pions 
are forward-scattered with probability and 
back-scattered with probability /x_, with a con¬ 
comitant multiplication of the pion number by a 
factor of A. The probabilities and /i- satisfy 
the constraint 

M+ + M- ; (Al) 

when Eq. (Al) holds with the inequality, pion ab¬ 
sorption is present. The problem is to find the ex¬ 
pected numbers f ± of pions eventually emerging 
from the medium either moving to the right (/+: 
no over-all direction reversal) or to the left (/_: 
over-all direction reversal), normalized to unit 
integrated initial pion density. 

We begin by remarking that since/+ (/_) is 
even (odd) in the direction-reversal probability 
/x_, it suffices to calculate 

/=/♦ + /-. (A2) 

the expected amplitude for pions to emerge in ei¬ 
ther direction. We then recover f k by splitting/ 
into parts even and odd in /i_. To formulate the 
multiple-scattering problem, we let P(xj\yi)dx be 
the probability that a pion which after collision 
n — 1 was at coordinate y moving in direction i (i 
= 1, r = left, right) is, after collision w, in an inter¬ 
val dx at x moving in direction j. From the defi¬ 
nitions of k and given above, one easily finds 
that P y which does not depend on w, is given by 

P(xr \yr) = /i + Ke~ K(,x ~ y) Q(x-y) t 

P(xl\yr) = 6(x - y), 

(A3) 

P(xl\yl) = n+Ke’ K{y ' x) 6(y-x)y 

P(xr \yl)= M-^e" K(y ' 3t> 0(y -x), 

with 6 the usual step function. Since the composi¬ 
tion laws for conditional probabilities are the same 
as the quantum-mechanical composition laws for 
probability amplitudes, it is convenient to intro¬ 
duce a Dirac state notation by writing 
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(xj\P\yi) = P{xj\yi)\ 

( xj\P 2 \yi )= f dzT](xj\P\zk)(zk\P\yi) 9 (A4) 

J ° k 

{xj\P n \yi)~ ( dz^ ( xj\P\zk)(zk\P n ~ l \yi ). 

•'O fc 

Letting p ( 0 ) (yi) be the initial density of produced 
pions moving in direction i, we then find that the 
density p w (x/) of pions which have undergone ex¬ 


actly n collisions and are moving in direction j is 
p (n> (x;)= f dyy'(x)\P n \yi)f> {Q) {yi). (A5) 

J ° i 

The number of pions N^ n) emerging from the medi¬ 
um after exactly n interactions is equal to the total 
number of pions present after n interactions less 
the number of such pions which interact once more 
in the medium, 


N {n) = j dx[p {n) (xl)+p {n) (xr)]- j df dz fce~ Kix ~* ) p <ri) {xl) + j da Ke"* ( *" x) p w (xr)J. 


(A 6 ) 


Since each interaction multiplies the pion number 
by one factor of X, the number N (n> must be 
weighted by X" in forming the expected number of 
pions leaving the medium. Taking p < 0 ) (yi) to have 
the unit normalized value 


Equations (A3)-(A8) constitute the statement of 
our scattering problem. To write these equations 
more compactly, we introduce the additional nota¬ 
tions 


we get finally 

ao 

/= X) 


(A7) 


( 21111 x;) = 6(2 - x)6 iJ , 
(z\P VA \xr) = 6(z -x), 

(2 | Ptot | xl) = Ke~9(x -z), 


(A9) 


(A8) 


in terms of which Eqs. (A5)-(A8) take the form 


fl =0 


/= (T d 2 dx{[ 6 (z-*)-(z|J°toil* 0 ] £ ^p ( " > (xi)+[ 6 ( 2 -x)-< 2 |P t 0 ,|xr)] £ X* p (, ° (* r) L 

dodo ( n=0 n=0 ) 

T l ^P W (xj)=y f L dy(xj yr) ( L dy{xj\(l-\P)- , \yr). 

L J ° ' ! L J ° 


(A10) 


Equation (A10) can be further simplified by noting 
that 


5(z 


-x)-(z\P tot \xj)=( 1 — ~ + 1 g — ) 

+ ST7r£ < * i|1 ~ x,,|xj> ’ 

+ “ i 

(All) 


with 

o ± =Xp ± . (A12) 

Substituting Eq. (All) into Eq. (A10) we obtain, 
finally, 


Equations (A12)-(A14) give a formal expression 
for /; to evaluate this expression explicitly we 
must determine the inverse operator appearing in 
Eq. (A14). Writing 

(zi\(l - \P)~ l \ yj) = &(z - y)6ij +F(zi\yj) (A15) 


and defining 


f(yj)= f dzj^ F(zi\yj), 
J n 


(Ai 6 ) 


/ = ( 1 -— ^((l-xp)- 1 ),^—-—, 

J \ o + +a_/ xv ' * v a + +a„ 


(A13) 


we find that Eq. (A14) can be expressed in terms 
of f(yj) as 

((l-XP)' 1 )„ = l + -|-J dyfiyr) 


with 


= 1 + 


dyf(yl). 


((1 - x P)~‘)*y = £ f Q f o dzd yYL <«l(l-AP)* , |yr>. 

(A14) 


(A17) 


while the relation (1 - XP)(1 - XP )" 1 = 1 implies that 
fiyj) satisfies the integral equation 
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9 


f(yj)=g(yi) + f dzJZf(zi)(zi\\P\yj), 

• / o i 

giyj) = j dzj^ (zi\\P\yj). 


(A18) 


Referring back to Eq. (A3) for P , we easily see 
that/(yj) and g(yj ) have the reflection symmetry 


As a check on Eq. (A27), we consider the special 
case in which there is no backward scattering, 
i.e., = We find 


/- = 0 , 

1 _ g-KX(l-\M + ) 


(A28) 


/♦» 


kL(1- A/x + ) 


= ij 0 d ye~ Ky(, ~ Xl ‘+ ) > 


f(yl) = f(L-yr), giyl)=g(L-yr ). (A19) 

Substituting Eq. (A3) into Eq. (A18) and using this 
symmetry, we find that Eq. (A18) reduces to the 
single integral equation 


which is Just the elementary exponential decay law 
appropriate to the case of forward propagation with 
effective absorption constant k( 1 - X/x + ), averaged 
over the length of the one-dimensional medium. 


f(yl) = (o + +v.)( l-e~ Ky ) 

+ ( dz[KQ + f(zl)+KQ„f(L-zl)]e~ K ^ y ~' E) . 

•'O 

(A20) 


Multiplying Eq. (A20) by e Ky and differentiating, 
we find the equivalent differential equation and 
boundary condition 


Kf(yl) + f'(yl) = (o + +vjK+KV + f(yl) 

+ KO„f(L-yl ), 


/( 0 *) = 0 . 

The solution to Eq. (A21) has the form 


f(yl)= J 


a + +q_ 

- (a + +a_) 



My)' 

M0)J’ 


(A21) 


(A22) 


with ft a solution of the homogeneous equation 


APPENDIX B: COMPARISON OF APPROXIMATE 
AND EXACT SCATTERING SOLUTIONS FOR 
A UNIFORM SPHERICAL GEOMETRY 

In this appendix we calibrate the accuracy of the 
approximate scattering solution used in the text by 
comparing the approximate solution with the exact 
scattering solution in the case of a simple geome¬ 
try. We consider a uniform sphere of radius R 
composed of material which scatters isotropically. 
Particles (“pions”) are produced uniformly 
throughout the sphere and propagate with inverse 
interaction length k . At each interaction the par¬ 
ticles scatter isotropically, with the particle num¬ 
ber simultaneously multiplied by a factor X. We 
wish to find the expected number / of particles 
eventually emerging from the sphere, normalized 
to unit integrated initial particle density. We dis¬ 
cuss successively the exact solution, two approxi¬ 
mate solutions, and the numerical comparison. 


nh(y) +h f (y)~KO + h (y) + koJi(L - y) . (A23) 

To solve Eq. (A23), we try an exponential ansatz 
of the form 

h(y) = e KOy + Me- Koy , (A24) 

which we find gives a solution when a and /x are 
related to k and a k by 


1. Exact solution 

The formulation of the solution to the spherical 
problem is closely analogous to the formulation of 
the one-dimensional problem in Appendix A, and 
we omit all details. Corresponding to Eqs. (A13), 
(A17), and (A18) we find 23 


o = [(l-v + ) 2 -oJY'\ 
° +1 ~°-+ e «°L % 


(A25) 


It is now a matter of simple algebra to combine 
Eqs. (A13), (A17), (A22), (A24), and (A25) to give 


our final result for/, / + , and/..: 


e KaL -1 l+ne- KOL 

J koL 1 + M 


* 

1 

+ 

+ 

ii 

(A26) 

e KaL -1 uL 2 e- KOL -l 

U koL m 2 -1 ’ 

e RaL -1 >i(l -e~ Kat ) 
koL jx 2 -1 

(A27) 


/ = (l-i)<(l-XP)- , >„ v + i > (Bl) 

<(l-XP)-«)„-l + t ^3 f d 3 yf( y), (B2) 

/(y) =g( y) + f. d 3 zf(z)x £ , (B3) 

J\ zi z-y 


g(y)= 

» I 7.1 


5R 


d 3 2 xf 

4tt 


€ -k\ z — y 1 

WW 


After spherical-averaging the scattering kernel, 
scaling out the sphere radius R , and expressing 
the solution of the integral equation in iterative 
form, we find 
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/ = 1+3^1-^J^ u*du g M (p, u), 

S (0> -1, 

g M (p,u) = p f %vdvg { ”~ l) (p,v)l- 
J o w 

x[ E,(p I v - u I)- £,(p(i; + w))] 

f* e~* 

l dt ^- 9 

P = kR. 


(B4) 


Since we are only interested in values of A which 
are smaller than 1, the series in Eq. (B4) is con¬ 
vergent and / is readily calculated by repeated 
numerical integration. 


2. Approximate solutions 


We recall that the approximate scattering solu¬ 
tion used in the text is obtained by projecting all 
forward- and backward-hemisphere scattering, 
respectively, onto the forward and backward di¬ 
rections, solving the resulting one-dimensional 
scattering problem as a function of optical thick¬ 
ness, and then integrating over the distribution of 
optical thickness actually present. For the spheri¬ 
cal problem considered here substitution of Eqs. 
(A25) and (A26) into this recipe gives the following 
approximate formula for /: 


/<»= fiu 

Jo 

a- (1-A) 1 ' 2 , 


'du 


e pou -l 1 + iie~ pau . 


pcrw 


1+M 


(B5) 


M = 


a +1 - 





which is readily evaluated by a single numerical 
integration. We also include in our comparison 
the scattering approximation used by Sternheim 
and Silbar, in which all scattering is projected on¬ 
to the forward direction. In this case the relevant 
one-dimensional solution becomes the pure-for¬ 
ward-scattering solution of Eq. (A28) and we find 
a second'approximate formula for /: 


/ 


< 2) = 



e p(\-l)H _ l 

p(A-l)w 


(B6) 


3. Numerical comparison 

Numerical results for/, / (l) , and/ (2> are given 
in Table IX for a wide range of values of A and p. 
Agreement between the exact result / and the ap¬ 
proximation / (1) used in the text is excellent over 
the entire range of parameters. The approxima¬ 
tion f {2) used by Sternheim and Silbar is qualita¬ 
tively correct, but develops significant deviations 
from the exact answer for large values of p. To 


interpret the parameter p in terms of nuclear size, 
we note that for a uniform spherical nucleus of ra¬ 
dius R~ 1.3 A 1/3 F, and an interaction cross section 
characteristic of the peak of the (3, 3) resonance 
(a m „ -210 mb = 21 F 2 ), we have 

A 2 n 
P ~ 4 * “Omax 

~2A 1/3 

~ 6 for aluminum 

-12 for lead. (B7) 

Hence for aluminum our simple forward-backward 
approximation solves the multiple-scattering prob¬ 
lem to an accuracy of better than 1%; even for the 
heaviest nuclei the approximation (with appropriate 
modifications to take neutron excess into account) 
should be good to better than 3%. 

APPENDIX C: MISCELLANEOUS FORMULAS 

We collect here the formulas for cross sections 
and Pauli factors used in the text. 


1. Cross sections 

For a n+t ,(W) we use the simple form 
o r , p (W)=o i3 ' 3) (W) + 20 mb, (Cl) 


TABLE IX. Comparison of exact and approximate 
multiple-scattering solutions. 


A 

P 

/ [Eq. (B4)l 

/ (1) [Eq. (B5)l 

/® [Eq. (B6)l 

0.5 

0.5 

0.827 

0.827 

0.835 


1 

0.687 

0.686 

0.707 


2 

0.489 

0.488 

0.527 


4 

0.290 

0.289 

0.332 


8 

0.154 

0.153 

0.182 


16 

0.0790 

0.0773 

0.0930 

0.6667 

0.5 

0.878 

0.877 

0.885 


1 

0.766 

0.764 

0.789 


2 

0.584 

0.583 

0.638 


4 

0.370 

0.368 

0.445 


8 

0.203 

0.200 

0.262 


16 

0.105 

0.102 

0.138 

0.8333 

0.5 

0.935 

0.934 

0.940 


1 

0.867 

0.865 

0.885 


2 

0.733 

0.731 

0.789 


4 

0.524 

0.522 

0.638 


8 

0.310 

0.307 

0.445 


16 

0.166 

0.161 

0.262 

0.9167 

0.5 

0.966 

0.966 

0.969 


1 

0.928 

0.927 

0.940 


2 

0.845 

0.842 

0.885 


4 

0.678 

0.676 

0.789 


8 

0.446 

0.443 

0.638 


16 

0.251 

0.244 

0.445 
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with the first term the resonant cross section and 
the second term a constant approximation to the 
nonresonant background. (This formula slightly 
overestimates the cross section at and below the 
resonant peak, and underestimates it above reso¬ 
nance.) For o (3 3) (tr) we use the Roper para¬ 
meterization, 24 




iir? 


(Qo-Qor'f+iiTf ’ 


(C2) 


with 




W 2 - M J +M 2 


2 W 




Qor — 1.621 Q r —1.640 M n t 

1,262 q*/M, 

r (<7 0 + <7 0r )(l+0.504? VM T *)’ 
°max - ~2 “185 mb. 

Hr 


(C3) 


, 1 59 3 

h * =a Wl 0 ““ 


1 29 

y/T 420 


136- 59/v^2~ 
-70-“ 


176-29/V2 - 

420 


, a ** 1 


2. Pauli factors 

We calculate the Pauli factors in the approxima¬ 
tion of treating the nucleus as a collection of inde¬ 
pendent protons and neutrons with equal Fermi- 
sea radii R„ =R p =R «1.6Af „. Then the fraction of 
nucleons which can contribute, for given momen¬ 
tum transfer A to the nucleus, is the fraction of 
the volume of a sphere of radius R centered at 0 
which lies outside a second sphere of radius R 
centered at A. That is, 8 


h(W,<)>) = 
with 25 


in--kv 3 , n* 2 

i, 


(C4) 


7= :s ^ sin (^^)- 


(C5) 


Performing the integrations over in Eq. (41), we 
find 


f 1 59 3 1 

h + ~ a JT 70 a 


29 
7T 420 


« 4 - 18 . 4 _ 6 1 59 

fe-=2--a 2 +-a 4 -— a -a^™ +a ' 


35 21 


h + =1 - ♦<*"* + - ^a" 8 


ft. = 1 


, V2~ 


1 29 

J2 420 


with 


a -q/R . 


, 1 


(C6) 


(C7) 


For the production Pauli factor g(W y Jr) we use the expression 

-/J, 2 L I u2 I \l/2 


26 


*0 = 


= W 2 -M n 2 - \k 2 \ 
2 W 


fr = (V+ r 


*♦«*» 

g(W,k 2 ) = 1, 2 R<k-q. 


(C8) 


♦Operated by Universities Research Association, Inc,, 
under contract with the U. S. Atomic Energy Commis¬ 
sion. 

tPermanent address: Tel-Aviv University, Ramat-Aviv, 
Tel-Aviv, Israel, 
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3 S. M. Berman, CERN report, 1961 (unpublished). 

9 Thls restriction is of course not necessary in principle. 
The extension of our multiple-scattering model to take 
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Erratum: Nuclear charge-exchange corrections to leptonic pion production 
in the (3,3)=resonance region [Phys. Rev. D 9, 2125 (1974)] 

Stephen L. Adler, Shmuel Nussinov, and E. A. Paschos 


Page 2138: In the second paragraph of the added 
note, the statement “The effect is to reduce R' by 
about 2.5%... should cause an error of perhaps 10% 
at most in R'* 1 should be changed to read“The effect 
is to reduce R' by about 1%.. . should cause an er¬ 
ror of at most a few percent in R'. n 

The following misprints should be corrected; 

(i) Page 2127, Eq. (9b): The n t to the right of 
the arrow should read n f . 

(ii) Page 2128, Eq. (15): The quantity a(^ + T 
-y.~ + T' + 7r°) should read v(v u + T -+ T" +tt°). 


(iii) Page 2134, Eq. (46): The quantity 

l;1 Al 27 ) should read l3 Al 27 ). 

(iv) Page 2139, Eq. (A14): The quantity 

should read ^ 

i 

(v) Page 2143: In Ref. 4 “G. M. Wing, Ref. 20” 
should read “G. M. Wing, Ref. 22”; in Ref. 26, 
“Eq. (6C.6) of Ref. 18” should read “Eq. (6C.6) of 
S. Adler, Ann. Phys, (N,Y,) 50, 189 (1966),” 
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Application of Current Algebra Techniques to Neutral-Current—Induced 

Threshold Pion Production 

Stephen L. Adler 

National Accelerator Laboratory, Batavia , Illinois 60510 , and 
The Institute for Advanced Study, Princeton , New Jersey 08540 

(Received 7 October 1974) 

S apply current-algebra techniques to study threshold pion production induced by the 
weak neutral current. In addition to specific predictions for the Weinberg-Salam-model 
current, I find upper bounds on the magnitude of threshold pion production for an iso¬ 
scalar neutral current and for a general hadronic neutral current formed from the usual 
vector and axial-vector nonets. Violation of these bounds would suggest the presence of 
new coupling types in the neutral semileptonic interaction. 


The initial experiments discovering weak neu¬ 
tral currents in high-energy deep-inelastic neu¬ 
trino reactions 1 have now been supplemented with 
the observation of neutral-current effects in low- 
energy neutrino pion production. 2 ’ 3 Obtainable 
invariant-mass resolutions will permit the study 
of ttN production in the threshold region below 
the (3, 3) resonance, and in fact preliminary Ar- 
gonne data 2 (without final corrections for neutron 
background) raise the possibility that the thresh¬ 
old cross section for v~p production by the neu¬ 
tral current may be appreciable. In this Letter 
we study threshold pion-production processes by 
using current-algebra, soft-pion techniques. I 
briefly describe the methods used in making such 
an analysis, and summarize the results obtained. 

I begin by giving a simple analytic treatment of 
threshold pion production, which, although some¬ 
what naive, illustrates the basic ideas which we 
exploit in our more careful numerical calcula¬ 
tions, According to standard soft-pion lore, 4 the 
amplitude for the pion emission process 8 +a 
- tt' + j 3, with a and |3 hadronic states and £ an ex- 


1 do(v+N- v + N + n J ) 

Iql d(k 2 )dW 

Here M N , M„ are the nucleon and pion mass, W 
is the mass of the final ir J N isobar, I ql is the 
pion momentum in the isobaric rest frame, k 2 is 
the leptonic squared four-momentum transfer 
(spacelike, fe 2 >0), g r * 13.5 is the pion-nucleon 
coupling constant, and the isospin matrix element 
a takes the values I a I = V2" for tt j = tt* and I a I = 1 
for tt j -tt°. The significance of Eq. (1) is that it 
allows one to translate an upper bound on the 
cross section for v^+p- v^+p into an upper bound 
on the strength of threshold pion production by 
the weak neutral current 
As I have already suggested, the above deriva- 


ternal current, is given as the sum of two terms. 
The first consists of a sum of external line inser¬ 
tions in which the pion tt j is emitted from the ex¬ 
ternal hadronic lines of the pionless process 8 
+ a - /?, while the second is an equal-time com¬ 
mutator term proportional to the amplitude for 
the reaction 8 ' + “ 0, with 8 ' the modified cur¬ 
rent obtained from the commutator 8 f ]. 

In the case of neutral-current weak pion produc¬ 
tion, the current 8 is, of course, the hadronic 
weak neutral current and the states o t and 0 are 
each a single free nucleon. For simplicity, let 
us restrict ourselves for the moment to cases in 
which the equal-time commutator term vanishes, 
as occurs, for example, if the current 8 is an 
isoscalar V-A structure containing an arbitrary 
linear combination of 9 : 0 X , ff B x , ff 0 sX , 9: 8 5X - 5 The 
pion emission amplitude then consists entirely of 
the external line insertion terms. Evaluating 
these terms at threshold (where the insertion on 
the outgoing nucleon line vanishes) and neglect¬ 
ing the pion mass in all kinematics, we find the 
following relation between threshold pion produc¬ 
tion and neutrino proton elastic scattering: 

( 1 ) 


tion is too naive in a number of respects. First 
of all, the external line insertion terms are rap¬ 
idly varying pole terms, and so the kinematic 
approximation of neglecting M * in calculating 
them is dangerous. Secondly, by considering on¬ 
ly cases in which the equal-time commutator 
term^J' vanishes, we exclude from consideration 
such processes as production in the SU(2) 

® LT(1) gauge model. And finally, it is important 
to estimate the leading 0(q) corrections to the 
soft-pion approximation, and to calculate the ef¬ 
fects in the threshold region of the tail of the 


« 2 ?g r M*\ 2 j ? 2 a ( 1 + j^y 2 ^iii±g-ji+£), 

threshold = 4* 2 mA 2M n M/V + 4M n 2 )\ + 2M n 2 ) d{k 2 ) 
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(3, 3) resonance. We deal with these problems 
by using an extended version of a model for weak 
pion production which has been described in de¬ 
tail elsewhere. 6 In its original form, the model 
included the rapidly varying pole terms and the 
resonant (3,3) multipoles, with no kinematic ap¬ 
proximations. The extensions consist of adding 
subtraction constants (in the dispersion-theory 
sense) to the non-Born terms of the model, which 
guarantee that it satisfies the relevant soft-pion 
theorems and which include the leading correc¬ 
tions (of first order in the pion four-momentum 
q and zeroth order in the lepton four-momentum 
transfer k) to the soft-pion limit. These latter 
corrections are calculated by the method of Low 7 
and Adler and Dothan 7 ; for the vector current 
amplitude they vanish, while for the isovector 


axial-vector amplitude they are related by par¬ 
tial conservation of axial-vector current to mo¬ 
mentum derivatives of the pion-nucleon scatter¬ 
ing amplitude at the crossing-symmetric point. 

For an isoscalar axial-vector current the order- 
q corrections cannot be precisely calculated, but 
a heuristic resonance-dominance argument sug¬ 
gests that they should be much smaller than in 
the isovector axial-vector case, and so we neg¬ 
lect them. 

I give now the results of numerical calculations 
using the extended model in various cases, focus¬ 
ing attention on the reaction 6 v^+n- + +p . 

(1) Isoscalar neutral current .—For the vector 
and axial-vector form factors in this case we 
take, for definiteness, a dipole formula with char¬ 
acteristic mass M Nf 


F 1 s (fe 2 )=X 1 (l + fe 2 /M/)- 2 , 2M „F 2 S (k 2 ) = X 2 (l + k 2 /M tr 2 )" 2 , g A s (k 2 ) = X 3 (1 + k 2 /M s 2 T 2 , (2) 

with x lf x 2 , and x 3 free parameters. Assuming the 95% confidence bound 2 

otyp+p- +/>)** 0.32a(i^ M +w- +/>), (3) 

we find that the cross section for v^ + n- v^+i r" +p , with tt~P invariant mass W between 6 1080 and 1120 
MeV, is bounded by 0 

+ + +/>)« 0,32a(i^ M +w - p" +p)[o(v fl + n - + +p)/o(v ]1 +p - i ^+p )\, (4a) 

<1.0xl0“ 41 cm 2 , (4b) 


The inequality in Eq. (4b) is obtained by maximizing the ratio in square brackets with respect to varia¬ 
tion of x lf x 2 , and x 3 . We find in this case that the naive form of the low-energy theorem in Eq. (1) is 
reasonably good, predicting a bound about one-third as large as that of Eq. (4). 10 

(2) Weinberg-Salam SU(2) ®U(1) model ,—In the simplest, one-parameter version of this model, the 
neutral current has the form 


5 jv X =9 : 3 X -9 : 3 5X -2x(JF3 X + 3‘ 1/2 9 : 6 x )+A^ x , x=sin 2 6> 

with A $ x an isoscalar, V-A, strangeness-and 
“charm”-current contribution which is conven¬ 
tionally assumed to couple only weakly to non- 
strange low-mass hadrons. Neglecting A g x for 
the moment, we can make an absolute calculation 
of the cross section for v^+n - i/ m + tt“ +p . We 
find, for ir m p invariant mass W between 1080 and 
1120 MeV, a predicted cross section of 0.75 
x 10" 41 cm 2 . To assess the reliability of our cal¬ 
culations, Fig. 1 gives a comparison of our mod¬ 
el with the Argonne National Laboratory results 
for the charged-current reaction v u +p - \i m + ir + 

+/>. The predicted cross section for 7 T*p invari¬ 
ant mass W between 1080 and 1120 MeV is 6.9 
xlO ’ 41 cm 2 , in satisfactory agreement with the 
observed cross section of (9.3± 4.7)x 10" 41 cm 2 . 

In certain extensions of the original Weinberg- 
Salam model, the neutral current has the gener- 


(5) 


al form of Eq. (5), but with an adjustable strength 
parameter k in front. A useful upper bound on 
the magnitude of k is provided by deep-inelastic 
neutrino-scattering neutral-current data. In 
terms of the standard ratios v =o(v f V + N 

-v,V+ T )/o(v,V + N- p + + r), we find 11 the 95% 

confidence limit 1 

1.5s3«„+« I7 S(c 2 [1+(1-2x) 2 ]. (6) 

Continuing for the moment to neglect the isosca¬ 
lar addition A$ x we can combine the bound of 
Eq. (6) with the extended model to predict that 
the cross section for neutral-current 7 r“ produc¬ 
tion, with 7 T m p invariant mass W between 1080 
and 1120 MeV, is bounded by 1.5 xlO’ 41 cm 2 for 
all allowed values 12 of k and x. Finally, we can 
include the isoscalar addition A$ x by parametriz- 
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tt + p Invariant Mass (GeV) 

FIG. 1. Comparison of the extended pion production 
model with the Argonne National Laboratory charged- 
current data. Each event represents an Argonne flux- 
averaged cross section of 2.3 x 10" 41 cm 2 . 


ing the total isoscalar contribution to $ N x as in 
Eq. (2), giving a cross section dependent on the 
five parameters k, x, \ 2f and A 3 . Combining 
the bounds of Eqs. (6) and (4a) with the extended 
model and maximizing over the five-parameter 
space, 12 we find that the cross section for v^ + n 
— + +p, with W between 1080 and 1120 MeV, 

is bounded by 4.4xl0“ 41 cm 2 , for a general had¬ 
ronic neutral current formed from the usual vec¬ 
tor and axial-vector nonets. 13 

Experimental violation of this general bound, 
or the observation of evidence for an isoscalar 
neutral current together with violation of the 
bound of Eq. (4b), would suggest that the neutral 
current involves unusual types of coupling, in ad¬ 
dition to or in place of the usually assumed V — A 
structure. One possible source of violations 
could be an interaction of the V —A type involving 
currents outside the usual quark-model vector 
and axial-vector nonets. An alternative source 
of violations could be the presence of S-, P-, 
and T-type neutral-current couplings. 14 If we de¬ 
fine S, P, and T hadronic “currents” 9^, ff/, 

9^ Xo and abstract their commutation relations 
from the quark-model forms 


$ J X7 ' = qix i o X7 '<l, 



then the commutator term 3* appearing in the 


soft-pion analysis above will have SU(3) .0- rath¬ 
er than F-type structure. This will substantially 
alter the structure of the low-energy theorems; 
for instance, the commutator term will no longer 
vanish for an isoscalar neutral current. The ef¬ 
fect of this altered structure on the bounds given 
above is presently under study. 

I wish to thank S. F 0 Tuan for stimulating dis¬ 
cussions about the structure of neutral currents, 

S. B. Treiman for many helpful critical com¬ 
ments in the course of this work, and P. A. 
Schreiner and W. Y. Lee for conversations about 
the Argonne National Laboratory and Brookhaven 
National Laboratory neutrino experiments. I 
have also benefitted from discussions with R. F. 
Dashen, S. D. Dr ell, E. A. Paschos, and S. Wein¬ 
berg. 


1 F. J. Hasert et cd., Phys Lett. 46B, 138 (1973); 

A. Benvenuti et ah, Phys. Rev. Lett. 32, 800 (1974). 

2 P. A. Schreiner, in Proceedings of the Seventeenth 
International Conference on High Energy Physics, Lon¬ 
don, England, July 1974 (to be published). 

3 Columbia-Rockefeller-Illinois Collaboration, in 
Proceedings of the Seventeenth International conference 
on High Energy Physics, London, England, July 1974 
(to be published), 

4 S. L. Adler and R. F. Dashen, Current Algebras 
(Benjamin, New York, 1968). 

Vanishing of the equal-time commutator in this case 
was noted by J. J. Sakurai, in Proceedings of the Fourth 
International Conference on Neutrino Physics and As¬ 
trophysics, Philadelphia, Pennsylvania, April 1974 
(to be published). 

6 S. L. Adler, Ann. Phys. (New York) 50, 189 (1968). 
[See also S. L. Adler, Phys. Rev. D 9, 229 (1974).] 

The extended model is obtained_by adding as subtrac¬ 
tion constants Eq. (5A.2l)_for A/^L, A 4 c ‘Mo>_? nd 
A/ + >| 0 ; Eq. (5A.22) (or^, ( + ) l 0 , ^i‘ 0) l 0 . and V (->| 0 ; 

Eq. (5A. 9) for A 3 c + *Jjj; and Eq.J5A.30) forA/">| 0 . 

The orders terms A 3 f+ >| 0 andA/ - ^ were assumed 
to have k 2 dependence (1 + k 2 /M N 2 )~ 2 ; variation of this 
assumed dependence produced only small changes in 
the results. We took the axial-vector form-factor 
mass as M A = 0.9 GeV. 

? F. E. Low, Phys. Rev. 1A0, 974 (1958); S. L. Adler 
and Y. Dothan, Phys. Rev. 151 , 1267 (1966). 

Analogous bounds can be given for other pion-pro- 
duction channels and for larger invariant-mass inter¬ 
vals than the one considered here. 

®The quoted bounds are not corrected for possible 
differences in the k 2 distributions of the reactions v u +p 
—■ p +p and v u +n-*n~ + p. For neutral-current form 
factors which decrease much more slowly than the 
charged-current form factors, the effect of such cor¬ 
rections would be to decrease the bounds. 

10 In the case of + N— +tt° + n in the Weinberg- 


1513 




484 


Adventures in Theoretical Physics 


Volume 33, Number 25 PHYSICAL REVIEW LETTERS 16 December 1974 


Salam model, where Eq. (1) should formally hold, we 
find that the order-*} corrections increase the (greatly 
suppressed) threshold pion production by an order of 
magnitude. As a result, the threshold t r° production 
becomes comparable to that in +n -*■ + tt~ +p 

(where the order-^ corrections have oily an ~2o% ef¬ 
fect) . 

^Equation (6) assumes scaling, and also uses the fact 
that <r{v u + +r)/ff(i; M +N — m" + T) See A. Pals 

and S. B. Treiman, Phys. Rev. D 6, 2700 (1972). 


APPLICATION OF CURRENT ALGEBRA TECH¬ 
NIQUES TO NEUTRAL-CURRENT-INDUCED 
THRESHOLD PION PRODUCTION. Stephen L. 
Adler [Phys. Rev. Lett. 33, 1511 (1974)]. 

An inadvertent confusion of meaning has result¬ 
ed from replacement of commas by minus signs. 
On page 1511, column 2, and page 1513, column 
1, “V-A” should read “V, A.” Only on page 
1512, column 1, was the “V -A” intended to 
mean “V minus A.” 


I2 We search over all real values of x, even though 
only the range 0 ^ x ^ 1 Is physically meaningful in the 
SU(2)®U(1) model. 

l3 This bound would be reduced if Eq. (6) were strength¬ 
ened to include the isoscalar current contributions on 
the right-hand side. 

14 Tests for such couplings in the neutral current have 
been discussed by B. Kayser, G. T. Garvey, E. Fisch- 
bach, and S. P. Rosen (to be published) and by R. L. 
Kingsley, F. Wilczek, and A. Zee (to be published). 
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Application of current-algebra techniques to soft-pion production by the weak neutral current: 

F, A case * 
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We apply current-algebra techniques to study the constraints imposed on neutral-current- 
induced soft-pion production, using as input existing bounds on neutrino-proton elastic scat¬ 
tering and existing data on neutral-current-induced deep-inelastic scattering, hi the case of 
a purely isoscalar weak neutral current, a simple soft-pion argument relates the cross sec¬ 
tion for threshold (in pion-nucleon invariant mass) weak pion production directly to the cross 
section for neutrino-proton elastic scattering. Hence, a bound on the latter cross section 
implies a bound on the former. To apply the method away from threshold and to nonisoscalar 
neutral currents, we extend a model which we had developed earlier for weak pion produc¬ 
tion in the (3,3) resonance region so as to Include the low-energy-theorem constraints. 

Numerical work using the extended model shows that a threshold peak (now attributed to 
background) In preliminary Argonne data on v + x —■ v +p + ir~ would have implied a threshold 
cross section much larger than can be obtained with any neutral current formed solely from 
members of the usual V t A nonets. We analyze recently reported Brookhaven National Lab¬ 
oratory results for neutral-current-induced soft-pion production under the simplifying as¬ 
sumption of a purely Isoscalar V , A neutral current. We find in this case that the magnitude 
of the Brookhaven observations exceeds the theoretical maximum by more than a factor of 2 
unless the assumed Isoscalar current either contains a vector part with an anomalously large 
gyro magnetic ratio \g\"\2M N F t /Fi\ or involves the ninth (SU a singlet) axial-vector current. 

A vector part with a large | g\ value leads to characteristic modifications in the pion-nucleon 
invariant-mass spectrum for M(kN) ^ 1.4 GeV, an effect which should be testable in high- 
statistics experiments. Two other qualitative predictions of Isoscalar V, A structures are 
(1) except for a narrow range of values oig, constructive V, A Interference in v +N—v +N 
+ it implies constructive Interference in v+p — i'+p and vice versa, and (li) If V, A interfer¬ 
ence is observed in neutral weak processes then (as is well-known) the neutral interaction 
may make a parity-violating contribution to the pp, ep, and fjp interactions. These features 
may help to distinguish V t A neutral-current couplings from alternative coupling types, which 
will be discussed in detail in subsequent papers of this series. 


I. INTRODUCTION 

The initial experiments discovering weak neutral 
currents in high-energy inclusive neutrino-nucleon 
scattering 1 have now been supplemented with the 
observation of neutral-current effects in the ex¬ 
clusive channel containing a pion-nucleon final 
state. Obtainable resolutions will permit the de¬ 
tailed study of pion-nucleon invariant-mass dis¬ 
tributions in the region at and below the (3, 3) reso¬ 
nance. Some of the issues raised by recent 
Argonne National Laboratory (ANL) 2 and Brook¬ 
haven National Laboratory (BNL) 3 data on neutral - 
current exclusive channels are: (i) What is the ex¬ 
pected magnitude for threshold (in invariant mass) 
neutral-current pion production? (ii) What are 
the implications if (3, 3) resonance excitation is 
not observed in neutral-current pion production? 

In the present paper we analyze these questions 
under the conventional assumption that the weak 
neutral current has a V t A spatial structure. A 
preliminary account of the analysis has appeared 
elsewhere. 4 In subsequent publications, 5 the same 

12 


methods will be applied to the more general cases 
in which neutral-current couplings of $, P , T type 
appear, or in which V, A neutral currents with 
abnormal G parity are present. 

The paper is organized as follow^. In Sec. II we 
give a simple (although somewhat naive) analytic 
treatment of threshold pion production in the case 
when the neutral current is of pure isoscalar form, 
and use it to illustrate the methods employed in 
the more detailed treatments which follow. In Sec. 
Ill we develop the ingredients needed for a more 
elaborate treatment of bounds on soft-pion produc¬ 
tion. We first review the standard formulas de¬ 
scribing neutrino-proton elastic scattering and 
deep-inelastic inclusive neutrino-nucleon scatter¬ 
ing, the latter both in a general framework and 
within the context of quark-parton-model assump¬ 
tions. We then describe the modifications needed 
to make our old dispersion-theoretic model for 
soft-pion production in the (3, 3) resonance region 6 
consistent with all soft-pion theorem constraints, 
and discuss the inclusion of a well-defined set of 
corrections to the soft-pion limit. In Sec. IV we 
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give results of numerical studies of the pion pro¬ 
duction model, which show its validity in the 
charged current case. We then apply the formulas 
developed in Sec. Ill to a detailed numerical anal¬ 
ysis of threshold pion production for the ANL neu¬ 
trino flux case, considering a succession of more 
complex models for the structure of the neutral 
current, leading up to the most general neutral 
current which can be formed from members of 
the usual V,A nonets. We finally analyze low-in¬ 
variant-mass [W=M(ttN) <1.4 GeV] pion production 
for the BNL neutrino-flux spectrum, under the 
simplifying assumption of a pure isoscalar V, A 
neutral current. In Appendix A, we give the 
threshold low-energy theorem (analogous to that 
developed in Sec. II) which applies in the case of 
the SU(2)®U(1) model neutral current. In Appen¬ 
dix B, we attempt a rough estimate of the leading 
corrections to the soft-pion limit in the case of an 
isoscalar (octet) axial-vector current, and esti¬ 


mate the extent to which the corresponding cor¬ 
rections in the isovector current case are already 
included in the basic pion production model as a 
result of unitarization of the (3, 3) multipoles. In 
Appendix C, we discuss nuclear charge-exchange 
corrections for low-invariant-mass weak pion 
production and give a tabulation of the charge-ex¬ 
change matrices for various nuclear targets of 
current theoretical interest. 

II. SIMPLE ANALYTIC TREATMENT 

We begin by giving a simple analytic treatment 
of threshold pion production, which, although 
somewhat naive, nonetheless illustrates the basic 
ideas exploited below in our more careful numer¬ 
ical calculations. The starting point for our deri¬ 
vation is the standard soft-pion formula 7 for pion 
emission in the process 3+N-* v J + N, with 3 a gen¬ 
eral external current and N a nucleon. This reads 6 


j 


m 13(0) i =- 31#, * a) r ttz- j; (* -«) 

L MsSa 


_ dy T - ^2 - T(h\ —J(k) _A _ g T J 

2 JK* 5 i M 1l 2 +2p 2 *q J{k) JW M*-2p x 'q 2 M N ty * T * 

+possible additional pion-pole “seagull” contribution] u(p x )tp* + 0(q) , 



with 


dl N - 


_(m n m n y /2 


PlO p2 0 


01* = W*(2tf 0 ) 


-1/2 


(N(p 2 ) 1 3(0) I N(p x )) =01* U(P 2 ) J(P 2 - p x ) u(p x ), (2) 

I [Ff, 3(0)] | N(p x )) *** u(p 2 ) JJ (p 2 - Pl ) u(p x ) . 


In Eqs. (1) and (2), k=p 2 +q-p x denotes the four- 
momentum carried by the external current, 
g r «13.5 is the pion-nucleon coupling constant, 
g A »1.24 is the nucleon axial-vector renormaliza¬ 
tion constant, w(/> 2 ), u(p x ) are nucleon spinors (in¬ 
cluding isospinors), and if)j is the isospin wave 
function of the emitted pion. The first term on the 
right-hand side of Eq. (1) is the usual equal-time 
commutator term which appears in soft-pion the¬ 
orems, while the second and third terms are ex¬ 
ternal-line-insertion terms in which the soft pion 
is emitted, respectively, from the final and initial 
nucleon lines. The additional pion-pole “seagull” 
piece 9 is necessary only when the pion-pole con¬ 
tributions of the first three terms do not add up to 
give the full pion-pole contribution expected for the 
reaction 3 + N-* v* + N. 


Let us now specialize to the case of an isoscalar 
V,A external current 3 t for which the equal-time 
commutator term vanishes 10 and for which there 
is no additional pion-pole “seagull” contribution. 
The entire soft-pion emission amplitude then 
comes from the external-line-insertion terms, 
which are most conveniently evaluated in the iso- 
baric frame in which the final pion-nucleon sys¬ 
tem is at rest. In this frame, the insertion on the 
outgoing nucleon line vanishes at threshold in in¬ 
variant mass, since when pj=q =0 we have 

^(^ 2 ) ^"¥*($2 /t) | p^ss qao - Qq u(P 2 ) To ^ 5(^2 if) |p 2 *o 

= 0. (3) 

So at threshold, for an isoscalar V,A current 3, 
the matrix element of Eq. (1) reduces to the single 
term 

<MA)ir(?)|a(0)|MA)> 

(4) 

On replacing the projection operator (& x +M N )/2M N 



R38 


487 


12 


STEPHEN L. ADLER 


2646 


by Yja u (Pi s ) a nd explicitly indicating the nuc¬ 

leon isospinors x£» Xi and the helicity Sj of the ini¬ 
tial nucleon spinor w(/0, we find 

(*A) *(q)\3(o)\M,p l )) 

= £ X* IT “(A) J(k) U(p x s) M g J 2p ° 
x[“(/»iS) r 0 y 5 uip^)] a , (5) 


with 


a =x 2 t ^x 1 v; 


S' 


= 


V. 


1 for p-»p + 7i° 

- 1 for n— n + u° 
V2 for n — p + TT~ 
/2 for p-*n + tt + 


( 6 ) 


The factor in square brackets in Eq. (5) is readily 
evaluated by using explicit expressions for the 
spinors, giving 


“(/>! s) y q y 5 k(M) = (^ i 'im n n ) xi (i, 


ff-k 


PlQ + M/( 



= -X 


0 1 

-1 o) ^-cr-k/(/. 10 +AT^ 

t <?*£ 

3 Tr~X, 

M n 1 


X,, 


Ik I e 

M n 1 ’ 


(7) 


where we have used the definition 

— $*k X# t sS iXs 1 (®) 

of the initial-state helicity. Since s 1 ~±l t this fac¬ 
tor disappears when we square and sum over ini¬ 
tial and final nucleon spins, so we get 

i £ I<MA)^)I4(0)|M/> 1 ))| 2 

N spins 

<l*,l ^threshold 


di N 




N spins 


Sr V L^l .2 


<M w -2p l0 / M n 


a‘ . 


(9) 


If we now make the approximation of neglecting 
the pion mass in all kinematics, the factor in 
square brackets in Eq. (9) becomes just the 
squared, spin-averaged matrix element (\M | 2 ) for 
elastic scattering, and Eq. (9) tells us that 

( I 3K x I 2 )threshold 




'10 




f/4Af* 2 ) 


t/2M/) 


Inserting phase-space factors according to 
do^ + p-v + p) 1 1 2/I9KI2) 

dt 4tt E 2 v ^ 1 ' ’ 


. . t=-k 2 


( 10 ) 


( 11 ) 


do(v + N-v + N+Tr) 1 Iql ... 2 /icm ia\ 
dtdW 16tt 3 E 2 ' ’ 


with |q| the pion isobaric frame three-momentum, 
W the invariant mass of the final irN isobar, and 
E the initial lab neutrino energy, we get finally 
the relation 

1 do(v + N-» v + N+n) I 

Iq I dt dW {threshold 

B a —(*rMA 2 _L + 

4tt 2 M x 2 \ 2M n ) M n 2 \ 4 M h 2 ) 


X 1 + 


t 


-2 


2 M/ 


da(v+p— v+p) 

dt 


a 2 = 


1 for 7T° production 

2 for it* production . 


( 12 ) 


We see that in the special case which has been 
under consideration, instead of obtaining a soft- 
pion relation between matrix elements, we obtain 
a relation directly in terms of reaction cross sec¬ 
tions. The significance of Eq. (12) is that it allows 
one to translate an upper boimd on the strength of 
v + /> — v + p into an upper bound on the strength of 
threshold pion production by the weak neutral cur¬ 
rent . 

As an illustration, let us apply Eq. (12) to the 
ANL data 2 by integrating over t and averaging 
over the ANL neutrino energy flux 11 w AN1 ,(£), 
giving 


/ 


dE "anl ( £ ) 


M n 2 dxj{v+n-»v + p + 7T~') 

IqT 


dW 


2 M 


4 it 2 M 2 




max 


») 


dt 


Ms 


1 + 


4 Ms 1 


1 + 


-±-\ 
2 Ms 2 ) 


~ 2 d0(v+p— v + P) 
dt 


(13) 
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We have multiplied both sides of Eq. (12) by M m 2 so that they have the dimensions of a cross section; also 
for convenience, we assume the flux w ANL (£) to be unit normalized, 

fdEn ANL (E) = l, 04) 

so that we are considering flux-averaged cross sections. Using the ANL 95% confidence bound 12 

ct anl (i * + p-»v+p) ^0.32a ANL (i/ + w— +P) «0.25xl0" 36 cm 2 , (15) 

and assuming the / dependence of the charged-current quasielastic and neutral-current elastic cross sec¬ 
tions to be similar, 13 * 14 we find that the right-hand side of Eq. (13) is bounded by 0.32x0.46 xlO' 36 cm 2 
= 0.15xl0' 36 cm 2 . Using 20-MeV invariant-mass bins, we can then estimate a bound on the flux-averaged 
cross section in the two bins nearest threshold as shown in Table I, giving the result 

o A bin 5 a ( v + n v + P + ANL flux averaged, 1.08 GeV ^ 1.12 GeV) ^0.6 x 10 41 cm 2 . (16) 


For comparison, the preliminary ANL data on 
+ v + p + tt“, before final background subtrac¬ 
tion, showed ~5 events in the first two bins, which 
would have corresponded to a cross section of 

° , 2 bi>> (before background subtraction) 

~20xi0~ 41 cm 2 , (17) 

in strong violation of the bound of Eq. (16). It is 
now considered very probable that these events do 
not represent a true neutral-current effect, but 
arise from various neutron-induced backgrounds. 

As we have already remarked, the above treat¬ 
ment is too naive in a number of respects. First 
of all, the restriction to cases, such as 10 that of 
an isoscalar neutral current <0, for which the 
equal-time commutator term vanishes excludes 
from consideration such processes as ir' produc¬ 
tion in the SU(2)<S>U(1) gauge model. Secondly, the 
external-line-insertion terms are rapidly varying 
pole terms, and so the kinematic approximation 
of neglecting M w in calculating them can be dan¬ 
gerous. Finally, it is important to estimate the 
leading 0(q) corrections to the soft-pion approxi¬ 
mation, and to calculate the effects in the thresh¬ 
old region of the tail of the (3, 3) resonance. As 
discussed in detail in Sec. Ill, we deal with these 
problems by using an extended version of a model 
for the weak pion-production amplitude which we 
have described elsewhere. 6 The extension will 
permit us to study the entire low-invariant-mass 


region W ^ 1.4 GeV, rather than just the first 40 
MeV or so around threshold. For completeness, 
however, we give in Appendix A the analog of the 
threshold low-energy theorem of Eq. (12) for the 
case of the SU(2)®U(1)-model neutral current. 

The formulas of Appendix A still neglect the pion 
mass in the kinematics [as well as the leading 
0(q) corrections and the (3, 3) resonance tail] and 
are not used in the subsequent numerical work. 

HI. DETAILED TREATMENT 

We proceed in this section to set out the basis 
for a more detailed numerical treatment of bounds 
on weak pion production by a V,A weak neutral 
current. The basic idea, as developed above, is 
to use soft-pion techniques to relate weak pion 
production to elastic neutrino-proton scattering, 
and to use experimental bounds on the latter. It 
will also be useful, at some stages of the analysis, 
to impose constraints obtained from experimental 
data 1 * 15 on deep-inelastic inclusive neutrino-nu¬ 
cleon scattering induced by the weak neutral cur¬ 
rent. In Sec. Ill A we give the necessary vertex 
structure and cross-section formulas needed to 
describe neutrino-nucleon elastic scattering. In 
Sec. Ill B we give the necessary formulas for using 
deep-inelastic information; first, in a rather gen¬ 
eral form assuming only scaling and the fact that 

v(v + N-» m + + T)/v(v + N— + r) , 


TABLE I. Application of Eq. ( 13 ) to bound the cross section for v+n — v+p +7r" in the two 
ANL 20-MeV bins nearest Invariant-mass threshold. 


W at bin 

center 

|q|/M* 

dW/M N 

Bound on 
right-hand side 
of Eq. (13) 

Bound on cross section in bin 
[(|q|dH , /M JV 2 )x0.15xl0~ 33 cm 2 ] 

1.09 

6.4 xlO' 2 


0.15x 10 -38 cm 2 

0.20 xlO' 41 cm 2 

1.11 

1.1 xlO' 1 


0.15X10" 33 cm 2 

0.35 xlO' 41 cm 2 
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and then in a more restrictive form which makes 
use of quark-parton model and quark-model as¬ 
sumptions. Finally, in Sec. Ill C we develop the 
extended weak-pion-production model which rem¬ 
edies the defects in our naive treatment enumer¬ 
ated at the end of Sec. II. 

A. Elastic neutrino-nucleon scattering 

We shall consider in what follows the most gen¬ 
eral V,A weak neutral current which can be formed 
from members of the usual vector and axial-vec¬ 
tor nonets. We write for the neutral-current ef¬ 
fective Lagrangian 


£j * ff= 7T ~ Y ^ 

= Sy 0 + Sr a &a +Sr e 

- SAo &i! X - gA 3 31 X - gAe X , 


(18) 


with nonet currents represented in the 

quark model (with quark field ip) by 16 




(19) 


We express the nucleon matrix elements of the 
neutral members of these current nonets in the 
form 16 


( 20 ) 


<MWl Wr'+iF}* (feV^feJ *,«(/>,) , 

(n(P 2 )I S5 X IMA» = **u(A)tsi' > (ft 2 ) y\ +j6' ) (k 2 )k\]t jU ( Pl ), 

^0 = 2(t) 1</2 , t 3 = 2 T a, = 2 (I)) 1 / 2 . 

The vector and axial-vector form factors defined in Eq. (20) are related to the standard nucleon form fac- 
tors if; 2 s (ft 2 ), gAk 2 ), hAk 2 ) by 

*! 3 > 2 ) = F l 2 (fe 2 ), ifi 3) (* 2 ) =rf) , 

^(fe 2 ) = 3J?f, 2 (fe 2 ), h'-2 ] (ft 2 ) = h A (&) . 

Defining total form factors J*f #2 (fe a ) p gx(k 2 ) by 

fT. 2 (* 2 ) +^g r3 Ft\(k 2 ) ^^) l/2 gr e F\ a U k2 ') . 

£a(* 2 ) =i (sV^gAoSAhk 2 ) +UgA3gA } (k 2 ) +2(5) l/2 ^A8-?i 8) (* 2 ) , 

with e=l for v+p~ v+p and e = -l for v+n~ v+n t the differential cross section for neutrino-nucleon scat¬ 
tering takes the form 


( 21 ) 


( 22 ) 


do(v +N— i/+N) _ G 2 

dt ~ ZtiE 2 M n 2 


{[1^ | 2 + l^| 2 +i|lf \ 2 ][4M„ 2 E 2 ~W m 2 +2 M„E)\ 

+ii[li£l 2 (i+4M A , 2 ) + |ff +2M*if | 2 t]+Re[^*(lf +2M „ F?)]t(AM M E - t)} . 

(23) 


For incident antineutrinos, the sign of in Eq. 
(23) is reversed. 


B. Deep-inelastic inclusive neutrino-nucleon scattering 

We turn next to the constraints on the coeffi¬ 
cients appearing in Eq. (18) which are imposed 
by experimental measurements 1 * 15 of the deep- 
inelastic inclusive neutrino-nucleon scattering 
ratios 8 

Rv =a(p+N— v + r)/o(v+N~ +r) , 

(24) 

R„ s a(i/ +N— v + T)/v(v + N~ + T) . 

The charged-current-induced reactions in the de¬ 
nominators in Eq. (24) are described by the usual 
charged-current effective Lagrangian 


£>% = ~^ 2 ~ M7\(l +adjoint , (25) 

with 

eJ C h = CO80c (^i + i2 “ 

+ sin0c (^ 4 +is “ ® 4 +ts) • (26a) 

In what follows we aim only at getting formulas 
which hold to an accuracy of 10 or 20%, and so we 
make at the outset the approximation of taking the 
Cabibbo angle 6 C to be zero, which simplifies Eq. 
(26a) to read 

* (26b) 

The virtue of using Eq. (26b) is that the vector 
and axial-vector parts of the charged current are 
then related by an isospin rotation to the corre- 
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sponding iso vector vector and axial-vector terms 
In the neutral current of Eq. (18). 

To proceed with the analysis, we assume the 
validity of Bjorken scaling 17 in deep-inelastic 
charged-current and neutral-current-induced 
inclusive neutrino reactions. Considering for the 
moment the charged-current cross sections 
ct(v+N— iT +T) and v(V+N~ n + + D, we review a 
standard analysis 18 starting from the formula 

q(v+N— + r) 
ct(v+n— + r) 

= Ja dxa s + 3 Jndxxa L +fg dxxa R 

J 1 dxa s + J l dxxa L + \ J*dxxa R 9 


where 

a L =F l -iF 3 zO , 

a R = ^+^3^0 , 

x=l/a> = scaling variable , 


contributions 


F u2 (x) = FUx) + Ff; 2 (x) , 


(29) 


(30) 


we may rewrite the relations of Eq. (28) as 

F Y 2 (x) +i^(x)»24i^(*)+*•?(*)]. 

Comparing Eq. (30) with the Schwarz inequality 19 
i | F 3 (x) | [i^(*) Ff (x)] l/ ^l[F Y l (x) + F*(x)) (31) 
and the positivity inequalities 
F y 2 (x)>2xF\(x) , 

Ff(x)»2xFt(x) , 
we learn that 

F\(x) « Ff(x) *Yx Fr * (x) = Tx F t {x) 


(32) 




(33) 


with F lt2 , 3 the deep-inelastic structure functions 
in the scaling limit, and where an average nu¬ 
cleon target N~^(n+p) has been assumed. Em¬ 
pirically, it is found that 

v(V + N~ ^ + + r)/a(v + N— + r) * 3 , 

which implies that a s ” a R that is, 

F 3 (x)*-2F 1 (x), F 2 (x)*2xF 1 (x) . (28) 

Splitting F x and F 2 into vector and axial-vector 


Now let us turn our attention to the deep-inelas¬ 
tic ratios of Eq. (24). If we again take Nto be an 
average nucleon target, the isovector and iso¬ 
scalar terms in the neutral current of Eq. (18) do 
not interfere, and so we get a lower bound on R v 
and Rv by neglecting the isoscalar contributions 
to the cross section. Using the fact, already men¬ 
tioned, that the isovector pieces of Eq. (18) are 
related to the corresponding isovector pieces of 
Eq. (26) by an isospin rotation, we find that 


( R A> I -C dx Wa 2 [ 3 1 (*) + 2^(*)] +gAr. 2 [?*Ff(x) +S F*(x)\?gy 3 g A3 %XF,(X)} 

\Rv/ 2 £ dxlkxF^x) +| F 2 (x) t $x J? 3 (x)] 


Substituting now the relations of Eq. (33), we get 
the simple inequalities 20 

Rv *HgV3 +gA 3 2 +gY 3 gA 3 ) , 

(*5) 

Rv *2 (gy 3 +gA 3 -gy 3 gA 3 ) ■ 

When added in the linear combination 3 R v +R» 
which eliminates the vector—axial-vector inter¬ 
ference term, and combined with 95% confidence 
limits inferred from current measurements of 
R v and R^ t the inequalities of Eq. (35) yield the 
constraint 

1.5 ^ ZR V +Rv^g Vs 2 +gA3 2 , (36) 

which will be used in our subsequent analysis. As 
we have already emphasized, in getting Eq. (36) 
we have only used the assumption of scaling to¬ 
gether with the empirical observation of a 


charged-current antineutrino-to-neutrino inclu¬ 
sive cross-section ratio of 
In order to strengthen Eq. (36) so as to include 
the isoscalar current terms in Eq. (18), it is nec¬ 
essary to go beyond the assumptions just stated 
by using information from the quark-parton model. 
Specifically, we will make use of the standard 
spin-i quark-parton model for deep-inelastic 
scattering, 21 with the additional assumptions that 
the strange parton and the antiparton content of the 
nucleon may be neglected 22 [the latter of these as¬ 
sumptions is suggested by the approximate rela¬ 
tions of Eq. (33)]. The quark-parton model in this 
form is expected 82 to be good to an accuracy of 
order 20%, and has the great virtue that all x 
dependence (for an average nucleon target) ap¬ 
pears in a single universal over-all factor which 
drops out in the ratios R Vt u* A straightforward 
calculation then gives 
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By 

MBy. 


= f {[gv oi(f)‘ /2 +gy e i (i) 1 /2 ] 2 +(igy 3 ) 2 +[**, i (!)‘ /2 +«* * (i)‘ /2 ] 2 +(4^,) 2 } 
±!{^a(!) 1/2 + ^.i(i) 1/2 ] [^ 0 i(f) 1/2 W(i) 1/2 MifK3 W , 


1.5 s Us- + 3R„ = \gy 0 (I ) 1 /2 +gra (*)» '*] 2 +gr * + [ gAa (l ) 1 ' 2 + gAt (i)‘ ' 2 ] 2 +^ 2 . 


(37) 


r 


One additional piece of information which will be 
needed, in order to use the constraint of Eq. (37) 
in an analysis of low-energy pion production, is 
knowledge of the renormalization constants 
^i°’ 8) (0) and f£°’ 8, (0) which describe the one-nu¬ 
cleon matrix elements of the isoscalar currents 
appearing in Eq. (18). The constant ^i 8) (0) is 
fairly reliably fixed by SU(3) to have the value 23 

gi B \0) »(3-4x0.66) 1.24 =0.45 , (38a) 

while the measured value of f£ 6) (0) is 

2M*F< 6) (0)/Fi 8) (0) = -0.12 . (38b) 

For the constants ^i 0) (0) and F£ o) ( 0) recourse 
must be made to a quark-model analysis of cur¬ 
rent-renormalization constants, 16 which gives 24 


^i o) (0)=! 1.24 = 0.74 , 

(39) 

2 M„ E' o) (0)/E ( 1 0, (0) s- 0.1 
for the unitary-singlet renormalization constants. 

C. Extended model for weak pion production 

We turn finally to a description of the extended 
model for weak pion production which we will use 
in the numerical calculations of Sec. IV. As an 
aid to the discussion which follows, let us first 
rewrite the pion-production matrix element of Eq. 
(1) in an alternative form, obtained by rearrang¬ 
ing the pseudovector-coupling external-nucleon- 
line-insertion terms which appear there into 
pseudoscalar-coupling Born terms of the usual 
form. This gives 


i 


(N[p,)v(q) 13 (0) I mpi) = - X « y “(&) 




* T , Alt!".A: --i+Bi 

2 Mn v— v B v + v B 2M /( 


possible additional pion-pole “seagull” contribution 


uiPjili*+0(q) , 


(40) 


with 


v=(Pi +p2)' k /( 2M /t) • 


(41) 


v B = -g*fc/(2Af*) , 

and with all other quantities as defined above. 

The anticommutator term which has appeared in 
Eq. (40) is the PCAC (partial conservation of 
axial-vector current) “consistency-condition” 
term 25 arising from the pseudovector-to-pseudo- 
scalar rearrangement. 

With the aid of Eq. (4), we can now proceed to 
discuss the pion-production model, which is an 
extension of a calculation of weak pion production 
in the (3, 3) resonance region which we have de¬ 


scribed in detail elsewhere. 6 In its original form, 
the model included the pseudoscalar-coupling Bom 
terms and the pion-pole terms of Eq. (40), with no 
kinematic approximations. In addition, the dom¬ 
inant (3, 3) multipoles were unitarized by the 
method used in the CGLN treatment of pion photo- 
production, 26 so as to correctly describe (3, 3) 
resonance excitation. Our basic extended pion- 
production model is obtained by adding the com¬ 
mutator term in Eq. (40) (evaluated at q =0, ex¬ 
cept where a pion pole appears) and the “consis¬ 
tency-condition” term to the Born approximation 
and resonant terms of the original model, yield¬ 
ing a pion-production amplitude which has the cor¬ 
rect soft-pion limit. In terms of the amplitudes 
Vj * 0) , 7 = 1,..., 6 and A<f \ j = 1,..., 8 used in 
Ref. 6 the additions are 27 
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AV< +) 

~ Sr 

Fl(k 2 ) , 



av'; 1 


\gA(k 2 ) p 


Af* 

- Sa 




.2\-l 


r\ 


(42) 


AA<"> TTT- [F\(&) -gAgAik 1 ) 

Afy gA 

+2 M K Fl(#)] . 

Note that the terms referred to in Ref. 6 as “dis¬ 
persion-relation corrections to the small partial 
waves” are omitted from the amplitude, since 
including them along with the additions of Eq. (42) 
would involve double counting (and also for the 
practical reason that the numerical evaluation of 
the dispersion-relation terms is very costly in 
terms of computer time). 

A further elaboration on the pion-production 
model consists of adding in the leading correc¬ 
tions (of first order in the pion four-momentum q 
and zeroth order in the lepton four-momentum 
transfer k) to the soft-pion limit. These correc¬ 
tions are calculated by the method of Low 28 and 
Adler and Dothan 28 ; for the vector amplitude they 
vanish (as a result of vector current conservation), 
while for the isovector axial-vector amplitude 
they are related by PCAC to momentum deriva¬ 
tives of the pion-nucleon scattering amplitude at 
the crossing symmetric point. For an isoscalar 
axial-vector current the 0(?) corrections cannot 
be precisely calculated, but an heuristic reso¬ 
nance dominance argument given in Appendix B 
suggests that they may be relatively considerably 
smaller than in the isovector axial-vector case, 
and so we neglect them. For the isovector axial- 
vector amplitudes, the calculations of Ref. 28 
tell us that 27 


Sr 




* /1 + K. 

2 ' " A gA 


2AV \g A 
»0.36 , 




(43a) 


gr 0 *'a 


l/= l/£ a 0 



I 0 I L'sif^|9fe 2 B0 t 

with the superscript B indicating the Born approx¬ 
imation and with the numerical values in units in 


which Af ff = l. In order to apply Eq. (43a), we must 
first estimate the extent to which the amplitudes 
A^A^ in our basic pion-production model differ 
from their Born approximations as a result of 
unitarization of the (3,3) multipoles. This is done 
at the end of Appendix B, with the result 

[A! _) | J“ slcra0<lel »0.21 , 

(43b) 

[A< +) -A' +)B ] |ba=c modd^Q.84 . 


Hence to bring the basic model into agreement 
with Eq. (43a) we add the 0(q) corrections 


AA c f> *0.15(1 +k 2 /M 2 )" 2 , 
AA< +) *1.96(1 +k 2 /M 2 )" 2 . 


(43c) 


Only the k 2 = 0 values of the correction terms are 
actually determined by low-energy theorem argu¬ 
ments; however, to avoid a spurious dominance 
of these correction terms at large fe 2 , we have in¬ 
cluded an ad hoc dipole form factor 27 (1 +fe 2 /Af 2 )“ 2 , 
characterized by a dipole mass AT. In the numer¬ 
ical work of Sec. IV, Af was taken equal to the nu¬ 
cleon mass My =0.94 GeV, which is rather typical 
of the dipole mass values 29 found in both the vec¬ 
tor and the axial-vector form factors. As we will 
see in Sec. IVA, substantial variations of Af about 
this value have a relatively small effect on the 
magnitude of the 0(q) corrections to the threshold 
pion-production cross sections. While the inclu¬ 
sion of the order ~q corrections may be an im¬ 
provement in the amplitude near threshold (or at 
a minimum, should give an idea of the likely im¬ 
portance of corrections to the basic soft-pion ma¬ 
trix element), their undamped growth as q in¬ 
creases makes their inclusion of doubtful value 
away from the threshold region. To illustrate 
this, we also evaluate the 0(q) corrections ac¬ 
cording to the modified recipe 

AA<-> =(M*/in0.15(l+fe7M 2 )" 2 

/ x (43d) 

*{M„/W) 1.96(1 +k 2 /M 2 )~ 2 , 

which agrees with Eq. (43c) at p = v B =0, but which 
grows less rapidly with increasing W. To sum up, 
the fully extended pion-production model which we 
have Just described contains both nucleon and pion 
Born diagrams with no kinematic approximations, 
includes the dominant (3, 3) multipoles in unitar- 
ized form, and agrees with all low-energy-the- 
orem constraints through terms of first order in 
q and k , with an error of order qk at most. It 
should thus give a reasonably accurate descrip¬ 
tion of low-invariant-maaa pion production, par¬ 
ticularly in the region close to invariant-mass 
threshold. 
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IV. NUMERICAL RESULTS 

We turn now to numerical calculations using the 
pion-production model developed above. In Sec. 

IV A we give the results of numerical studies of 
the model, in which we examine the effect of the 
0(q ) corrections of Eqs. (43c), (43d) and explore 
their sensitivity to the mass parameter Af, and in 
which we compare the predictions of the model for 
charged-current-induced neutrino pion production 
with experiment. In Sec. IV B we use the model 
to give bounds on the ANL neutral-current-in¬ 
duced threshold pion-production cross section, 
for a variety of different models for the structure 
of the weak neutral current. Finally, in Sec. IV C 
we study low-invariant-mass pion production in 
the BNL neutrino flux, in the special case of a 
pure isoscalar weak neutral current. 

A Numerical studies of the model 

We begin our numerical examination of the pion- 
production model of Sec. Ill C with a study of the 
0(q) correction terms added to the weak pion pro¬ 
duction amplitude in Eq. (43). In Table n we give 
theoretical ANL 2-bin cross sections, defined as 
in Eq. (16), for the seven allowed charged- and 
neutral-current-induced pion-production reactions. 
In column 2 we give the cross section obtained 
without the 0(q) correction [that is, from the basic 
pion-production model including the soft-pion ad¬ 
ditions of Eq. (42), but without the additions of 
Eqs. (43c) or (43d)]. In columns 3, 4, and 5 we 
give the corresponding cross sections with the 
0(q) corrections included as in Eq. (43c), taking 
the ad hoc dipole mass Af as Af*, Af*/V"2 , and 
M N if 2, respectively. In column 6 we give the 
cross sections calculated with the 0(q ) correc¬ 
tions included as in Eq. (43d), withAf We 

see that the 0(q) corrections have a substantial 
effect on threshold cross sections for v+p~ v 
+P + tt° and v+n + ir 0 , a moderate effect on 


the cross section for v+p~ +p + ir+ 9 and a rela¬ 
tively amall effect on the remaining reactions. As 
expected In the threshold region, the recipes of 
Eq. (43c) and Eq. (43d) for the 0(q) corrections 
give similar results; we also see that the varia¬ 
tion in the threshold cross section asM 2 is 
changed by a factor of 2 in either direction from 
M z ~M n 2 is smaller than the effect of including the 
0(q) corrections, indicating that the sensitivity 
to the value of the mass parameter Af is not ex¬ 
cessive. In Table m we show the effect of the 
0(q) additions on the ANL and BNL cross sections 
integrated over the low-invariant-mass region 
W^IA GeV- Again, the reactions v+N- v+N+tt° 
are sensitive to the 0(q) additions, with the 
effects on the other cross sections ranging from 
moderate to small. Here, however, we see a sub¬ 
stantial dependence on whether the recipe of Eq. 
(43c) or of Eq. (43d) is used, indicating that the 
0(q) additions do not constitute a well-defined 
correction to the basic pion production model 
outside the threshold region. A satisfactory treat¬ 
ment of the 0(q) terms away from threshold would 
require their interpretation as the low-energy 
limits of appropriate particle exchange terms. In 
the numerical work on the ANL threshold cross 
sections for v+n-* v+p + ir~ described in Sec. IVB, 
we will include the 0(q) correction terms with 
M~M n . In the numerical work of Sec. IV C ana¬ 
lyzing the isoscalar case for the BNL spectrum, 
we will neglect the 0(q) corrections—they vanish 
for an isoscalar vector current and, as argued in 
Appendix B, may be relatively small (although 
hard to estimate precisely) for an isoscalar axial- 
vector current. 

Obviously, the best way to assess the reliability 
of the pion-production model developed in Sec. 

Ill C is to compare its predictions for charged- 
current-induced pion production reactions with 
experiment. In Fig. 1 the ANL data 30 for v+p 
— +p+ir + are plotted together with predictions 


TABLE II. Effect of O (q) additions of Eqs. (43c), (43d) and sensitivity to the ad hoc dipole 
parameter M in the ANL threshold region. Neutral-current cross sections are calculated in 
the Weinberg-Salam model, with s'm 2 fl w = 0.35 and = 0 [see Eq. (52)]. 


Reaction 

Without O (q) 

Values of aj bb 11110 ~ 41 
With Ofq) from Eq. (43c) 

Af -M/f M =M ff'/Z 

cm 2 

With O iq) 
from Eq. (43d) 

v +n — n~ + p +ir° 

3.8 

3.5 

3.6 

3.5 

3.6 

v +p -*p,~ + p + 7 r + 

4.5 

5.9 

5.6 

6.4 

5.7 

i ' + n-~ n~ + n +7r + 

2.3 

2.1 

2.0 

2.2 

2.0 

v+p-~ v + p + 1 ° 

0.42 

0.73 

0.61 

0.88 

0.64 

u + n — u+ n + 

0.47 

0-77 

0.65 

0.92 

0-69 

i' + n— u + p + iT 

0.91 

0-80 

0.82 

0.77 

0-81 

v + p — v + n +7r + 

0.97 

0.87 

0.89 

0.84 

0.88 
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TABLE m. Effect of 0(q) additions of Eqs. (43c), (43d) in the ANL and BNL (3,3) reso¬ 
nance regions, defined by W ^ 1.4 GeV. Neutral-current cross sections are calculated in the 
Weinberg-Salam model, with sin 2 e„,=0.35 and AcJ^ =0 [see Eq. (52)]. As is evident from the 
differences between the two recipes for adding in the Oiq) terms away from the threshold re¬ 
gion, the O (q) additions do not constitute a well-defined correction to the basic pion produc¬ 
tion model when the entire (3, 3) resonance region is considered. However, they do usefully 
indicate which channels maj' prove to be particularly sensitive to corrections to the basic 
model. 


Values of Values of 

tr ANL 1.4 GeV) <r BNL (W ^ 1.4 GeV) 

in 10- 3B cm 2 in 10~ 38 cm 2 

Without With 0(q) Without With 0(?) 

Reaction Ofc) Eq. (43c) Eq. (43d) 0<q) Eq. (43c) Eq. (43d) 


V + «—'* H~ +P +7T° 

0.0733 

0.0694 

0.0698 

0.147 

0.143 

0.143 

V + p-»H~ +p + IT* 

0.219 

0.237 

0.228 

0.427 

0.499 

0.466 

i' + n—fx“ + »+7r + 

0.0571 

0.0534 

0.0491 

0.129 

0.134 

0.116 

V + P — V + P + 7T° 

0.0283 

0.0348 

0.0308 

0.0532 

0.0765 

0.0629 

y + n — u + n +tt° 

0.0288 

0.0350 

0.0311 

0.0539 

0.0768 

0.0634 

y + n — V + P + TT~ 

0.0192 

0.0181 

0.0181 

0.0366 

0.0352 

0.0351 

V+p-» V + n +7T + 

0.0204 

0.0191 

0.0192 

0.0383 

0.0367 

0.0367 


of the pion production model, both with 0(q) ad¬ 
ditions (curves b and c) and without these additions 
(curve a). The theoretical curves are evidently 
low by 30-40% in the case of curve a and by 
smaller amounts in the cases of curves b and c. 
Part of this discrepancy may arise from uncer¬ 
tainties in the absolute level of the ANL neutrino 
flux (these uncertainties are included in the ex¬ 
perimental error bars) and in the value of the 
axial-vector mass parameter 31 Af A , but part is 
probably due to the known 8 tendency of the pion- 
production model to underestimate pion-produc- 



FIG. 1. Comparison of the extended pion-production 
model of Sec. me with the ANL data for v + /> — 

+ P + 7r + . Curve a, basic model containing Bora, reso¬ 
nant, and soft-plon terms; curve b, basic model with 
O (q) additions from Eq. (43c); curve c, basic model 
with O (q) additions from Eq. (43d). 


tion cross sections for | fe 2 | ^ 0.6 (GeV/c) 2 . To 
minimize this problem, in discussing the BNL 
isoscalar-current case in Sec. IV C we will always 
compare ratios of cross sections computed within 
the pion production model with the corresponding 
ratios obtained experimentally, rather than mak¬ 
ing direct comparisons of cross sections between 
theory and experiment. In Table IV we compare 
preliminary ANL values 32 of the ratios a(i/ + w — /x" 

+ P + v°)/o(v+p-* +/> + tt + ) and o(v + n-» /x~ + n + w*)/ 

v(v+p- +P + tt + ) with the corresponding theoret¬ 
ical predictions for the invariant-mass interval 
W ^ 1.4 GeV. The agreement is seen to be gen¬ 
erally satisfactory. In Fig. 2 we compare the 
area normalized theoretical invariant-mass dis¬ 
tribution for v+p-* y.~ +p + TT + [including 0(q) cor¬ 
rections from Eq. (43c)] with the corresponding 
ANL experimental histogram 33 ; the agreement in 
this case is excellent. In Fig. 3 we give the same 
comparison for the reactions 34 v + w-* ii~ +p + tt° 
and v + n-+ ii +n + n + . The agreement is again sat¬ 
isfactory. In general, the comparisons given above 
suggest that the pion-production model developed 
in Sec. IIIC should be reliable to better than a 
factor of 2 in the region at and below resonance. 
The reliability should be substantially better than 
this for relative cross-section ratios or reactions 
without large 0(q) corrections. 

B. Threshold neutral-current-induced pion production 

in the ANL flux 

We consider now the application of the formulas 
developed in Sec. HI to the study of neutral-cur- 
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TABLE IV. Comparison of theoretical predictions for charged-current pion final-state 
ratios with preliminary ANL experimental results (Ref. 32). 


Ratio 

Experiment 

Without 0(g) 

Theory 

With Oig) 

Eq. (43c) Eq. (43d) 

(TO'+rt—' +7^) 

<r(v+p — + 7r + ) 

0.27 ± 0.06 

0.34 

0.29 

0.31 

O’O'+rt —+ » + 7T + ) 
(T(V+p-~lT + p + 7T + ) 

0.31± 0.07 

0.26 

0.23 

0.22 


rent-induced threshold pion production in the ANL 
neutrino flux. 4 We start with the general six-pa¬ 
rameter neutral-current structure inEq. (18), 
but we note that since the isoscalar axial currents 
contribute only 35 through the ^i 0,a) y X y 5 term in 


their nucleon vertices, the effective number of 
parameters entering the pion production calcula¬ 
tion can be reduced to 5. These are conveniently 
introduced by writing the one-nucleon matrix ele¬ 
ment of the neutral current as 


j 


{N[p s ) I Air I - 3 WUh) {[ - W* 2 )A 5 + *2 F \(* 2 ) Fl (* 2 ) o x 'k ,] i r, 

+ {-\D(k 2 )y x ys+\D(k 2 )y' +iA s (2M Af )" 1 £>(fe 2 ) ff ,1 * , ( 44 ) 


with D(k z ) a dipole structure characterizing the 
isoscalar current vertices, which for definiteness 
we take as 

D(fe 2 ) = (l -feVM* 2 ) -2 . (45) 

To a first approximation, we expect 36 that small 
changes in the isoscalar dipole mass parameter 
from the assigned value of Af* can be compensated 
by making appropriate rescalings of the isoscalar 
parameters Ag 4(5 . In terms of the parameters 
the couplings Svo.s.a.S'Ao.s.a intro¬ 
duced in Eq. (18) are given by 

£K 0 (f) l/Z + ^K 8 (£) l/2 ^ SVs~^2 t 

(46a) 

&Aq( f) 1/fZ + 5 a8(3‘) 1 ^ 2 “ » 


with g* an effective isoscalar axial-vector renor¬ 
malization constant defined by 


. ^n(l) l/2 gl 0) (O) + ^ fl (l) l/2 gl 8) (0) 


(46b) 


In terms of these definitions, the deep-inelastic 
constraint of Eq. (36) becomes 

1.5& 3R„+R|7^ A^+Aa 2 , (47) 

while the stronger constraint of Eq. (37), which 
follows when quark-parton -model information is 
used, takes the form 

1.5 s ZR v +Rv=X 1 2 +X 2 2 + yty (48) 


Equations (15) and (47), or (15) and (48), are the basic 
constraints which will be imposed in maximizing 
Oabin ove r the space of parameters A lt . .., A s . 

Obviously, to recompute the pion-production and 
neutrino-proton-elastic-scattering cross sections 
for each distinct set of parameter values being 
studied would be a very inefficient procedure from 



7T + p Invariant Mass (GeV) 

FIG. 2. Comparison of the area-normalized theoreti¬ 
cal invariant-mass distribution for v +p — v~ +p + 7 r + 
[calculated with 0 ( 3 ) additions from Eq. (43c)] with the 
ANL histogram for this reaction. 
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fJL~ 7T + n 


in 

CVJ 


c n 
H- 



{JT 7T°P 
FV°> 500 MeV/ r 



FIG. 3. Comparison of the area-normalized theoretical 
Invariant-mass distributions for v + n — + n + ft* and 

v + +p +7T° [calculatedwith 0(q) additions from 

Eq. (43c)] with the corresponding ANL histograms. The 
theoretical predictions have been folded with the experi¬ 
mental invariant-mass resolutions of 25 MeV for n + 7 r + 
and 40 MeV for p + 7 r°. 


a numerical point of view. Rather, we exploit the 
fact that the cross sections are quadratic forms 
in the parameters taking the form 

s 

<*2 bin /iO' 38 cm 2 = X! L 

,ml lsi *' (49) 

5 

v AUL (v+p-v+p)/10-™ cm 2 = 53 S E u x i x Ji 

1*1 i<><i 

so it is only necessary to perform the cross-sec¬ 
tion calculation for the 15 parameter sets 


X| - 0, i ^ If *J ) 

> 1 5 

X, = 1, X,-l \ 


(50) 


to extract the coefficients 4 P iJt E u , which are 
tabulated in Table V. The quadratic forms of Eq. 
(49) are then used to compute the cross sections 
when searching over parameter values, permitting 
a complete survey of the five-parameter space 
using a very reasonable amount of computer time. 


TABLE V. Coefficients determining and 
<r ANL (v + p-~ v + p) via the quadratic forms of Eq. (49). 
Note the comment in Ref. 4. 


Pion-production 

coefficients 

Elastic-scattering 

coefficients 

Pu 

0.62 lx 10' 3 

E n 

0.692 xlO" 1 

P 22 

0.807 xlO** 3 

E 2 2 

0.767 xlO' 1 

P 33 

0.163xl0' 3 

^33 

0.478 xlO" 1 

P 44 

0.244 xlO" 4 

£44 

0.364 xlO' 1 

^55 

0 .121X10' 4 

^55 

0.300 xlO" 2 

P 12 

0.534xl0' 3 

E 12 

0.656 xlO' 1 

^13 

0.772xl0" 6 

E l3 

0.115 

^14 

0.166 xlO' 4 


0.158 xlO -1 

P IS’ 

-0.211X10' 4 

£l6 

0.159 xlO' 1 

Pn- 

-0.393 xlO" 4 

#23 

0.554xl0' 1 

■I 3 24 

0.143 xlO" 3 

E 2i 

0.858xl0' 1 

Pzs 

-0.312 xlO' 4 

E Zb 

0.201 xlO' 1 

^34 

0.328 xlO' 4 

e m 

0.134 xlO" 1 

^35 

0.532xl0" 4 

E 35 

0.134 xlO" 1 

^45 

0.996xl0" 5 

E 45 

0.229 xlO' 2 


The results, for various assumptions about the 
structure of the weak neutral current, are as fol¬ 
lows: 

(1) Pure isoscalar weak neutral current. Taking 
X t =X 2 =0 and maximizing over the X 3 , X 4 , X 5 
subspace subject to the constraint of Eq. (15) gives 
the upper bound 

l.OxlO' 41 cm 2 . (51) 

(2) Weinberg-Salam SU(2)<&-U(1) model. In the 
simplest, one-parameter version of this model, 37 
the neutral current has the form 

si = S 3 X - S a 5X - 2*(S X +3-/ 2 S x ) +A3\ 

(52) 

x- sin 2 0 w , 

with AeJ x an isoscalar V-A strangeness and 
“charm” current contribution which is convention¬ 
ally assumed to couple only weakly to nonstrange 
low-mass hadrons (such as the low-energy pion- 
nucleon system). Neglecting for the moment 
we can make an absolute calculation of the cross 
section for v +«- v +p + ?r', giving 

<Cn =0.75xl0- 41 cm 2 . (53) 

In certain extensions of the original Weinberg- 
Salam model, the neutral current has the general 
form of Eq. (52), but with an adjustable strength 
parameter k in front, 

$h=k[5$ -2x(3 r J+3- 1/2 ff 8 > )]+A5 x . (54) 

Still neglecting and comparing with Eq. (44), 
we now see that the parameters X^ have the values 
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*i=k* 

X 2 =*(l_2x), X 4 =-2 kx, (55) 

X 3 =0. X 5 ^0.24kx. 

Maximizing over the k , x parameter space 
(allowing all real values of x, rather than restrict¬ 
ing x to lie between 0 and 1) subject to the con¬ 
straints of Eqs. (15) and (47) gives the upper bound 

<* 2 A bin « 1.5X10' 41 cm 2 . (56) 

Finally, we can include the isoscalar addition AoJ x 
by regarding X 3 , X 4 , X 5 as free parameters, rather 
than relating them to k and x as in Eq. (55). 
Searching now over the five-parameter k , x, X 3 , X 4 , 
X 5 space (again allowing all real values of x) sub¬ 
ject to the constraints of Eqs. (15) and (47) gives 
the upper bound 

(T 2 A bin<4.6xlO' 41 cm 2 . (57) 

We emphasize that Eq. (57) is the upper bound on 
^2 bin f or the most general hadronic neutral current 
formed from the usual vector and axial-vector 
nonets. If Eq. (47) is replaced by the stronger 
constraint of Eq. (48), and if the parameters g 
s X 5 /X 4 and are restricted [as suggested by the 
quark-model 16 ,24 values of Eq. (39)] by 

1*1 *1.5, 1*11*0.74, (58) 

then the bound in the general F,A case is substan¬ 
tially reduced, to 

ff 2 A bin* 1.5x10'- 4I cm 2 . (59) 


C. Analysis of low-invariant>mass (W < 1.4GeV) pion production 
at BNL: Isoscalar current case 

We turn finally to an analysis of low-invariant- 
mass (W « 1.4 GeV) pion production in the BNL 
neutrino flux. 38 Recently, the Columbia-Illinois - 
Rockefeller collaboration at BNL has reported a 
measurement of the ratio 

p/ o(v +T-V +7T° + * * *) 

°~ 2u(y+T- t i-+n a + ’ ■ •)’ (60) 

T=±( 6 C 12 ) + i( 13 Al 27 ), 
with the preliminary result 3,39 

Ro = 0.17±0.06. (61) 

This measured value of R£ is in accord with the 
value expected 40 in the Weinberg-Salam model 
when sin 2 #* is in the currently favored range of 
0.3-0.4. Hence if( 3, 3) resonance excitation, 
which is expected in the Weinberg-Salam model 
(see Fig. 4), is observed in the BNL experiment, 
the presumption would be strongly in favor of the 
standard gauge-theory interpretation of neutral 
currents. However, preliminary BNL invariant- 


mass spectra 3 for jt° production in the charged- 
and neutral-current cases show a clear (3, 3) peak 
in the charged-current case, but indicate no com¬ 
parable peaking in the neutral-current reaction, 
suggesting that perhaps the (3, 3) resonance is not 
excited by the neutral current. In what follows we 
analyze the implications for neutral-current struc¬ 
ture if this indication is confirmed both by more 
detailed analysis of the BNL data and by other ex¬ 
periments. 

Since the isovector V and A neutral currents 
both 41 strongly excite the (3, 3) resonance, the ab¬ 
sence of a (3, 3) peak in the V, A case would sug¬ 
gest an isoscalar neutral-current structure, and 
we assume this in what follows. Applying nuclear 
charge-exchange corrections as described in Ap¬ 
pendix C, we find that the nuclear target ratio 
quoted in Eq. (61) implies the free-nucleon target 
ratio 

= q BNL (i/ + n - y + n +7T°) +q PNL (i/ +p-v+p +7T°) 

0 a BNL (y +« — ii~ +p +77°) 

= 2/2'x 1.4 = 0.48± 0.17. (62) 

Let us now compare the experimental result of 
Eq. (62) with theoretical predictions obtained from 
the extended pion production model developed in 



FIG. 4. Area-normalized, BNL-flux-averaged 
theoretical invariant-mass distribution in the Weinberg- 
Salam model (with sin 2 #* =0-35) for the reaction v +p 
v +p + jr®. Curve a, basic model containing Born, 
resonant, and soft-pion terms; curve b, basic model 
with 0 ( 4 ) additions from Eq. (43c); curve c, basic model 
with 0(4) additions from Eq. (43d). 
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Sec. IRC, in the case of a pure isoscalar neutral 
current. Again we parameterize the neutral cur¬ 
rent as in Eq. (44), with = X 2 = 0. The BNL flux- 


averaged pion production and elastic cross sec¬ 
tions are then quadratic forms in X 3 , X 4 , X 5 , taking 
the form 


[a BNL (y+w-y+w+7r 0 , W * W M ) +u BNL (v + p 


i/+p+ir°, W^lM/10- 36 cm 2 =y; X) p iA w *)*i*J. 


a BNL (y + p —v + p r Cundy cuts)/10' 36 


cm 2 = 


X X E< h )x i x J> 

1 = 3 3 


a BNL (y +p- v +p t no cuts)/10" 36 


cm 2 = 


2 2 E u x * ] 

1*3 3***1 


j* 


(63) 


Cundy cuts: 1 GeV^E^4 GeV, 0.3 (GeV/c) 2 « \k 2 \ < 1 (GeV/c) 2 . 


The pion-production coefficients P is (for W„= 1.2, 1.3, and 1.4 GeV) and the cut and uncut 42 elastic-scat¬ 
tering coefficients E u required in Eq. (63) are tabulated in Table VI. In maximizing the pion-production 
cross section over the space of parameters X 3 , X 4 , X 5 , we impose the constraints 


CT BNL (y+/>- v+p, Cundy cuts) ^ 0.24a BNL (^ +n-v ' +p r Cundy cuts) * 0.085 x 10' 36 cm 2 , 


1.5 ^ 3R„ +R„ - 


+ ^X 


2 

4 ’ 


(64) 


the first of which is the Cundy et al. 4a 95% confi¬ 
dence-level limit from the CERN neutrino experi¬ 
ment, which has a neutrino flux similar 43 to that 
of the BNL experiment, while the second is the 
deep-inelastic quark-parton-model constraint of 
Eqs. (37) and (48) above. The results of the maxi¬ 
mization are expressed as theoretical upper 
bounds on the ratio 2R 0 defined in Eq. (62), with 
both the numerator and the denominator calculated 
from the pion production model. As stressed in 
Sec. IV A, the procedure of comparing theoretical 
cross-section ratios with experimental ratios 
should minimize the effects of discrepancies be¬ 
tween the experimental and theoretical cross-sec¬ 
tion magnitudes. For the denominator cross sec¬ 
tion a BNL (i^+w- p.' +P + n°, 1.4 GeV) we use the 

value 0.143x 10” 36 cm 2 listed in columns 6 and 7 of 


Table III, corresponding to inclusion of 0(g) cor¬ 
rections; however, as is apparent from the table, 
the effect of the 0(q) corrections on this cross sec¬ 
tion is very small. 

Results of the maximization calculation 44 are 
given in Figs. 5 and 6. Curve a of Fig. 5 gives 
the maximum for an isoscalar pure vector current, 
while curve b gives the maximum when an octet 
isoscalar axial-vector current is also present 
(corresponding to^® =0.45), both plotted versus 
the isoscalar current gyromagnetic ratio ^=X 5 /X 4 . 
Evidently, both curves lie below the BNL data, 
with a discrepancy exceeding a factor of 2 unless 
\g\z 4-6, that is, unless the isoscalar vector cur¬ 
rent has a |^| value which is anomalously large 
based on quark-model expectations. 24 Interesting¬ 
ly, a vector current with a large |^| value pro- 


TABLE VI. Coefficients determining a BNL {v + n — v + n +7r°) + o bnl {v + p — v+p +7r°) for 
various W ranges and <r BNL (v +p — v + p), both cut and uncut, via the quadratic forms of Eq. 
(63). 



Pion-production coefficients 

Elastic-scattering coefficients 


W ==1.2 GeV 

W £ 1.3 GeV 

W S 1.4 GeV 


Cuncty cuts 


No cuts 

^33 

0 . 210 x 10 “* 

0.607x10”* 

0.109 xlO ” 1 

4° 

0.176X10' 1 

4' 

0.555X10 ' 1 

Pa 

0.286x10'* 

0.638x10“® 

0.994x10”® 

4 > 

0.158X10' 1 

B® 

0.515 x 10 " 1 

P 55 

0.193x10'® 

0.566X10" 3 

0.106x10”* 

4 > 

0.265x10“* 

E® 

0.480x10'* 

^34 

0.247x10“* 

0.925 xlO " 3 

0.165x10'* 

4> 

0.558X10” 2 

E® 

0.105X10” 1 

^35 

0.467x10”* 

0.138X10" 2 

0.258x10”* 

17 ( 4 ) 

^35 

0.558X10” 2 

B® 

0.105 x 10 ” 1 

^45 

0.169x10'* 

0.519x10^ 

0.941x10”* 

E* (1) 

■£-45 

0.635 xlO -3 

B® 

0.142x10”* 
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FIG. 5. Results of a maximization calculation for BNL 
cross-section ratios in the invariant-mass interval 
W s 1.4 GeV, plotted versus the g value of the isoscalar 
vector current. Curve a is the maximum for an iso¬ 
scalar pure vector current; curve b is the maximum 
when an isoscalar axial current is also present, with the 
axial-vector renormalization constant fixed at ^ = 0.45 
(the octet axial-vector current value), The dashed line 
is the central experimental value from Eq. (62). 


duces a characteristic change in the da/dW plot 
predicted for the BNL flux, as shown in the dashed 
curve in Fig. 7. [The dashed curve is calculated 
for the case of an isoscalar vector current con¬ 
taining only an F 2 form factor; for the F lr F 2 ad¬ 
mixture corresponding to |^| =4, the curve is sub¬ 
stantially the same. Similarly, changing the ad 
hoc dipole mass in Eq. (45) from M N to M N f2 or 



FIG. 6. Results of a maximization calculation for 
BNL cross-section ratios in the invariant-mass interval 
W ^ 1.4 GeV, plotted versus the effective renormaliza¬ 
tion constant g% of the Isoscalar axial-vector current. 
Curve a is the maximum for an isoscalar pure axial- 
vector current; curve b Is the maximum when an iso¬ 
scalar vector current is also present, with g value 
fixed at - 0 .12 (the quark model and octet vector current 
value). The dashed line is the central experimental 
value from Eq. (62). 


L. ADLER 12 



FIG. 7. Shapes of do/dW for an isoscalar vector 
current containing an F t term only or an F 2 term only. 
The two curves are normalized to equal area for W s 1.4 
GeV. 


M N /f 2 produces only a 2% change in the dashed 
curve.] As seen in the figure, an isoscalar vector 
current with large |^| produces, relative to the 
pure Fi case, 45 a depression in the do/dW distri¬ 
bution for small W t characterized by an almost 
linear rise from threshold, and a corresponding 
enhancement at the large W end of the range. An 
experiment with good statistics should be able to 
search for this effect. Continuing with the results 
of the maximization calculation, curve a of Fig. 6 
gives the maximum for an isoscalar pure axial - 
vector current, while curve b gives the maximum 
when an isoscalar vector current is also present 
(with gyromagnetic ratio fixed at the quark-model 
value of -0.12), both plotted versus the effective 
axial-vector renormalization constant Devia¬ 
tion of from the octet value of 0.45 of course 
requires the presence of a contribution from the 
SU(3)-singlet axial-vector current. The curves 
again lie below the BNL data, with a discrepancy 
exceeding a factor of 2 unless 1^1 ^ 1-5, which 
would imply a sizable ninth axial-vector current 
contribution and a relatively large ninth current 
renormalization constant as compared with quark- 
model expectations. 24 

To summarize, if the observed BNL neutral- 
current pion production rate is to be interpreted 
in terms of isoscalar V f A currents, then existing 
elastic and deep-inelastic constraints require that 
the neutral current contain either a vector current 
with anomalously large |^| value, or an apprecia¬ 
ble coupling to the ninth [SU(3) singlet] axial-vec¬ 
tor current. 

We conclude by briefly mentioning two other 
qualitative features of an isoscalar V,A neutral 
current which may help to distinguish it from al¬ 
ternative phenomenological neutral-current struc- 
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tures. First, referring to Table VI we note that 
the V,A interference terms in up elastic scatter¬ 
ing and in weak pion production (for W ^ 1.4 GeV) 
are given by 
elastic scattering; 

V,A interference (positive) 

X ^3^4(1 + g), 

weak pion production; 

V,A interference (positive) 

x X 3 X 4 (0.64 +g) r 

g = *s a 4 . 

Hence, except for the small range of isoscalar 
gyromagnetic ratios 

-1 ^ -0.64, (66) 

the interference terms in neutral-current elastic 
up scattering and weak pion production have the 
same sign. That is, except for g values in the 
range of Eq. (66), constructive V,A interference 
in u +N- u +N + 7 t implies constructive interference 
in u +p - u +p and vice versa. A second useful re¬ 
mark (which has been made by many authors) is 
that if u and u neutral-current cross sections dif¬ 
fer (implying the presence of V,A interference ef¬ 
fects in a V,A current picture), then the neutral 
interaction may induce parity-violating terms in 
the pp t ep r and i±p interactions. The significance 
of this statement is that the same connection be¬ 
tween u, u cross-section differences and parity- 


violating effects does not hold in other neutral- 
current phenomenologies, such as the S,P , T cur¬ 
rent picture to be discussed in the second paper 5 
of this series. 
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APPENDIX A 

We give here the analog of the low-energy 
theorem of Eq. (12) for the case of the SU(2) 

<8> U(l)-model neutral current of Eq. (52), with 

(The following formulas still neglect the 
pion mass in the kinematics and so were not 
used in the numerical work described in the text.) 
The threshold pion production cross section is 
given by 


1 do(u + N— u+N+tt) 

m didW 


T= (2 M/) 


2 \ — 1 


threshold 

t ' -> 


- G 2 1 / Sr 

1677 3 E 2 \2M lf 2 


1 + 


t 




, t \' 2 ~ 

1 + ---I T, 


1+ 4M?) ^i 2+ ^ 2 ) + (tf< 2+ G/ 3 2 ) 


2 M z 

[4M /l 2 E 2 -t(M lf 2 + 2M M E)} 


+ t 


H 2 + n 1 + 


4ilf„ 


H, 


2 11 S 




N 


with £ = 1 (-1) for u (i/)-induced reactions, and with 

H l = d E -\l - 2x)2M„ (l + + JL iF.m + 2M N F z m, 

H 2 = -F 1 (k 2 ) + ^ f -F 2 (k 2 ), 


H 3 = 4'^ W) + 2M h F!<?)] . 




„ _ a+ t/2M„ 2 ) 

* E gA (1 + t/ml 2 ) 

FiA#) = ^r. 2 (^)[4 +) - 4" ) ]U - 2x) + Fl 2 (k 2 )a ( B a H~2x), 

gA0> 2 )=gA<^){^E ) ~ 4'’] • 


(Al) 


(A2) 
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TABLE VII. Iso spin coefficients appearing in Eq. (A2). 


a 




a 




+ />-M _ I + P 
v I \ v 


1 

2 


0 


1 

1 


V ' n —l V Wn + 71® 


1 

2 


0 


.1 

2 



V \ I V . ^ 

+ n — [ + p + it 


‘/2 


0 —75” 


1 
yff 



v Vp—'I v l +n + Ir+ o 

T> 


A. 

/2 


jl 

/2 


As in the text, we have used the abbreviations 
/ =-fe 2 , x=sin 2 0^. The isospin coefficients 
are given in Table VH. As the diligent reader may 
verify, in the case of t r° production (for which the 
low-energy-theorem equal-time commutator van¬ 
ishes) Eqs. (Al) and (A2) reduce to Eq. (12) of 
the text, with dn(y+N— v + N)/dt appropriate to the 
Weinberg-Salam-model case of Eqs. (21)—(23). 
Since Eqs. (Al) and (A2) were obtained by alge¬ 
braic reduction from the Born-approximation ex¬ 
pressions of Ref. 6 , this agreement provides a 
cross check on the extensive algebra involved in 
the extended pion-production model of Sec. IHC. 


Since the over-all magnitudes of the leading terms 
in the axial-vector soft-pion amplitude in the 
isovector and the octet isoscalar cases are gov¬ 
erned respectively by g A and a convenient 
measure of the importance of the 0(q) correction 
in the isoscalar octet axial-vector case, relative 
to its importance in the isovector axial-vector 
case, is given by 


R s 


-ZW7ZT 


0.16—A”""-* w 
bv b 


(B3) 


To estimate the derivative appearing in Eq. (B3), 
we assume an unsubtracted dispersion relation 47 


w°N -*■ 17 * 


=- r +- 7 -^ 

IT \x -x x' + x + 2u 


B 


xlmA f " a, -* w , 


(B4) 


We approximate the integral by supposing A’' 0 * - *''* 
to be dominated by those partial waves containing 
resonances which couple t t°N strongly to tj#. Re¬ 
ferring to the Particle Data Group summary , 48 we 
see that the only such resonances are the #*(1535) 
G/M') and the #*(1470) Writing the 

partial-wave expansion for A ir0N ‘~ 1tN and keeping 
only the / 0+ and partial waves to which the 
#*(1535) and #*(1470) respectively contribute, we 
find 49 


Tm , r°*->,* MW + M„) Im 

[(/>,„+ m *)(/> 20 + m J¥ )] 1 ' 2 


APPENDIX B 

We give first a rough estimate of the 0(q) cor¬ 
rections in the case of an isoscalar octet axial- 
vector current. We start from the analogy 
7T° — SU(3)-3 index, tj-— SU(3)-8 index and the fact 
that A ff * (+) = A/ 0 * - *" 0 *, which allows us to write 48 
(in units with = 1 ) 

Isovector: 


l( P w -m„)(P* 0 -m m )\ 1/z ’ 

(B5) 

which is independent of v B . So in the approxima¬ 
tion of dominance by the #*(1535) and #*(1470), 
we have 


AAi +) = 9 A ff0 ^~ >Tr °^ 

^ “ gr 


® 2.8 , 


Octet isoscalar: 


(Bl) 


3 JP 8 v B 


3 A r0fl 


17* 


Here g r and^^ Tl) are, respectively, the tt°NN and 
the 17 ## coupling constants, which according to 
octet PCAC and SU(3) are related by 


gj^ _ g^ _ 3-(4x0.66) 
St ~ Sa ~ /3 


(B2) 


—ImA”""- ,w = 0. (B 6 ) 

3 V B 

Hence, only the derivative of the explicit v B in 
Eq. (B4) contributes, and we find 



TT 0 *-* 17* 


0 



dx f 
(. X'f 


ImA'^^^U', v B = 0). 


(B7) 

Substituting Eq. (B5) into Eq. (B7), we find the 
bound 
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d An°N-+TiN 

bv B 


2 f* dx r 4 tt(W' +M n ) 

* * J, a (*')* [(p[o+M „)(p!, 0 +M N )\ m 


Im / 


0 + 



+ 



dx r 

(X'f 


4 ir(W'-M m ) 

[(Pio-MM —M n )] 1/2 


|lm/ 1 w ? r - w (x , )| 


(B 8 ) 


To proceed further, we use resonance dominance to approximate the inelastic amplitude imaginary parts 
appearing in Eq. (B 8 ) by 


Im/ 


0 + 



Im f?."-"' M (x')* ^Im/;!*- , 

8i- 


(B9) 


with g TI * ir0 the r j, tt° couplings to the resonance in question. Using the optical theorem to evaluate the right- 
hand side of Eq. (B9), 


Im / 


ir 0 N-+v°N 



® n°N i 


(BIO) 


combining Eqs. (B8)-(B10) in the narrow-resonance approximation, and expressing the coupling ratios 
g v /g n in terms of resonance partial widths r^o and q values q^^, yields the final formula 



2 

n \q„ / r„ 

dv B A 

* 

[(P,o+M H )(p 2a +M H Y'‘> \r w . 


2 

+ — 
IT 


111 


1/2 


/ v**»dx 


1/2 






-!**( 1535) 
i/2 

o N dx 


J 0*1 


J tf*(l470) 


(Bll) 


Remembering that the branching ratio of an /= j 
state into it 0 relative to all pionic modes is i, 
and taking I^J-182 MeV for the nominally for¬ 
bidden decay N*(1470) - Nq (corresponding to res¬ 
onance half maximum), we find numerically that 
the ratio R defined in Eq. (B3) is given by 

R ^ Rtf *(1 535 ) + Rn+( 1470 ) “0.014 + 0.074 

= 0.09. (B12) 

Hence the 0(q) corrections appear to be substan¬ 
tially less important in the isoscalar octet axial- 
vector case than they are in the iso vector axial- 
vector case, and so we neglect them. We simi¬ 
larly neglect the 0(q) corrections in the unitary 
singlet axial-vector amplitude, although an anal¬ 
ogous argument is not possible in this case since 
the ninth axial current does not satisfy a PCAC 
equation . 24 We caution 47 in closing that the above 
argument is very crude at best, particularly 
since the N*(1470) contribution to Eq. (Bll) de¬ 
pends as | <7 J ~ 3/2 on the q v value assumed for the 
N*(1470)-Mf mode. 

-—--- ,i 


r 


We next estimate the extent to which the order -0 
corrections of Eq. (43a) are already included in 
the basic pion-production model as a result of 
unitarization of the (3, 3) multipoles. Using the 
fact that at fe 2 =0 the electric and longitudinal (3, 3) 
multipoles are approximately related by 6 

(B13) 


a simple calculation shows that 



1 

•"-y-'-rr— 

_ 4 S\V 2) - s[y 2)B 


(B14) 


where as in the text the superscript B indicates 
the Born approximation. To evaluate the right- 
hand side of Eq. (B14), we employ the partial- 
wave dispersion relation satisfied by which 

[using Eq. (B13) and making a static nucleon ex¬ 
pansion through terms of order M* “*] takes the 
simple approximate form 


(W + M„)S^ _ (W + M„)S[V 2)b 1 r- r +M y )Im<S ( 1 2/2) ' 

wojgi " wo 1+ I5I + w i/ir L w'oi + |5'| 


1 11 
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1 1 
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J \co / — co 36Af* 9 w'+fjj/* 


(B15) 


oj W M n , 0 1+ = [(Pio + M/f){p 2 o + M/f )] 1/2 . 

Hence we get 


5 (3/2) -<^ 3 ' 2,B 
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IT J u 


du>' 


M„ ( W'+M s ) ImS'y 2) ' 

yr o [ + |q'| 


10 1 11 
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J\0 w' 36M 


(B16) 
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integrating the right-hand side numerically, using 
Eq. (40.22) of Ref. 6 for S'?' 2 ’, gives -0.63 in 
units in which = 1. Finally, as a point of con¬ 
sistency, we note that a simple calculation shows 
that Eq. (B16) makes no contribution to the ampli¬ 
tudes [Aa _) -A. 2 " )b ]| 0 and [A^ -A^"* 3 ]! 0 which 
are determined by the zeroth-order PCAC rela¬ 
tions of Eq. (42). 

APPENDIX C 

We give here the nuclear charge-exchange cor¬ 
rections calculation needed to extract the free- 
nucleon target ratio of Eq. (62) from the measured 
value of Eqs. (60)— (61). We use the averaged 
recipe of Eq. (24) of Ref. 40, as extended 60 to the 
case of nuclei with a neutron excess. In order to 
simultaneously treat all of the nuclei of current 
experimental interest, 61 we have performed the 
calculation outlined in Sec. IIB of Ref. 50 using 
a simple “uniform-well” parameterization of the 


nuclear density, characterized by a well radius 
R , a nucleon density p, and an rms charge radius 
a, given by 52 

A 

P= UR*’ 

R = (\) uz a, (Cl) 

a ~ (0.82i4 1/3 + 0.58) F . 

For each nucleus of interest we have calculated 
two W-averaged charge exchange matrices, one 
(labeled resonant) appropoiate to the (3, 3) domi¬ 
nated BNL cross section for v+p~ p'+/> + 7r + , the 
other (labeled nonresonant) appropriate to the 
dofdW curve labeled “Isoscalar pure in Fig. 7. 
The results are summarized 53 in Table VIII. In 
the case of the/ = 0 nucleus 0 C 12 , the resonant 
matrix of Table VIII implies averaged charge- 
exchange parameters a =0.812, 5 = 0.137, c 
= 0.0392, in satisfactory agreement with the val- 


TABLE VIII. Resonant and nonresonant averaged nuclear charge-exchange matrices for 
low-invariant-mass {W^- 1.4 GeV) weak pion production. Ihe matrix elements are to be read 
according to the scheme 
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ues 3=0.811, d = 0,138, € =0.0450 given in Table 
VII of Ref. 40 and obtained by using a “harmonic- 
well” parameterization of the nuclear density. 
Given the matrices [/ r ] of Table VIE, observed 
pion-production cross sections are related to 
free-nucleon cross sections by the following 
recipe: Let the experimental target contain the 
mass fractions f T of the nuclear species with 
Z=Z Ti A = A t . Then the observed cross section 
per nucleon is given by 


~a (obs; tt + )“ 


• 


cr(obs; O 

- y^/rUr] 

<r(N r ;0 

(C2) 

_o(obs; tt~)_ 

T 

_<r(N r ;0_ 



with N t an effective free-nucleon target given by 

"'-iM'-S)”- (C3 > 

As an illustration, we apply Eq. (C3) in the BNL 
case of a mainly carbon and aluminum target. 
Assuming charged-current pion production to be 
purely resonant, and neutral-current pion produc¬ 
tion to be purely nonresonant, we have 

P / = g(obs;7rV) _ 

0 2o(obs;7rV~) * 

a(obs; ^ / r 2 > 

r*c.Aj js + > o r - 

(C4) 


(r(obs;7rV')= £ f T £ [^r ’ ]«w a ( N r', ** H~) ■ 

r=c.Ai ,/=+.o.— 

Using the BNL target fractions f c = f M = | and 
assuming an isoscalar neutral current, which 
implies 


o(n\ tt'V) = a(p\ ir°u) = %o(n; w~v) 

= ?o(p't v) t (C5) 

Eq. (C4) reduces^ after some simple algebra to 54 
_ o(n + p;ir°v) 

Zi((i • ^ 

a(w;7rV ) 

= 2RJx0.727(l + 0.22r 1 + 0.23r 2 ) (C6) 

with r 1<2 the charged-current it + to tt° ratios 




V * - \ c 7 • t — / y s 

1 o(n; tt°ii ) 9 2 


= q(«; t rV) 
a(w; t rV")' 


(C7) 


Direct measurements of r 12 in the BNL flux are 
unavailable, so we have either to use theoretical 
values for these ratios, or to extrapolate them 
from the ANL measurements, neglecting possible 
variations with neutrino energy. The theoretical 
cross sections tabulated in the fourth and fifth 
columns of Table III give, respectively, 

r 1 = 2.91, r 2 = 0.88 without 0(g) corrections, 

(C8a) 

r 1 = 4.01, r 2 = 1.34 with 0(g) corrections , 


while preliminary ANL data give 
^ = 3.74*0.86, r 2 = 1.14±0.3 


(C8b) 

(C8c) 


Substituting into Eq. (C6), Eqs. (C8a)-(C8c) give, 
respectively, 

2 R 0 = 2R„x 1.59 from (C8a), 

2R 0 = 2R;xl.34 from (C8b), (C9) 


2R 0 = 2R' 0 x 1.52 ±0.15 from (C8c). 

A charge-exchange correction factor of 1.4 has 
been assumed in getting Eq. (62) of the text. 
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evaluation of Eq. (43a) we have used 3.70 and have 
taken the values of the pion nucleon amplitudes 

, (a/3i/ B ) J 4 TrA,(+) at the crossing-symmetric point 
from the tabulation of H. Pilkuhn et al ., Nucl. Phys. 

B65 , 460 (1973). The theoretical analysis leading to 
Eqs. (42) and (43a) is described in detail in Sec. V of 
Ref. 6. [See particularly Eqs. (5A.21), (5A.22), 

(5A.9), and (5A.30).] 

28 F. E. Low, Phys. Rev. 110, 974 (1958); S. L. Adler and 
Y. Dothan, ibid. 151, 1267 (1966). 

2fl The axial-vector form-factor dipole mass is 0.90 
GeV, while the dipole mass appearing in the vector- 
current Sachs form factors is 0.84 GeV. 

30 The experimental points are taken from Fig. 10 of 
B. Musgrave, Argonne National Laboratory Report 
No. ANL/HEP 7453 (unpublished). 
w The ANL result for Af x is Af x =(0.90± 0.10) GeV, and 
we have used the central value of 0.90 GeV in all of the 
numerical work. Increasing M A above the central value 
will bring the theoretical curves in Fig. 1 closer to the 
experimental points. For example, an M A of 1.00 GeV 
gives cross sections 6-9% larger than those in the 
figure. 

K The experimental values were obtained from B. Mus¬ 
grave, private communication. 

33 The histogram was taken from Fig. 2 of P. A. Schreiner 
and F. von Hippel, Ref. 11. 

34 The histograms were taken from Fig. 16 of B. Mus¬ 
grave, Ref. 30. 

35 The induced pseudoscalar from factor h A makes a van¬ 
ishing contribution to neutral-current reactions. 

36 This has been checked in one case, by comparing 
formulas obtained from the X 4 , X 5 terms of Eq. (44) 
with the corresponding formulas which are obtained 
when the nucleon isoscalar electromagnetic form fac¬ 
tors Ff 2 (fc 2 ) are used in the final two terms of Eq. 

(44). 

3? S. Weinberg, Phys. Rev. Lett. 19, 1264 (1967); 27, 

1688 (1972); A. Salam, in Elementary Particle Theory ; 
Relativistic Groups and Analyticity (Nobel Symposium 
No. 8), edited by N. Svartholm (Almqvist and Wiksell, 
Stockholm, 1968), p.367. 

38 The BNL flux table has been furnished to me by W. Y. 

Lee and L. Litt (private communication). 

39 The error ± 0.06 largely represents systematic un¬ 
certainties; the statistical error is considerably 
smaller (W. Y. Lee, private communication). 

40 S. L. Adler, S. Nussinov, and E. A. Paschos, Phys. 
Rev. D 9, 2125 (1974). 

4l For example, in the static limit the cross section for 
(3,3) excitation by 3j x is 0.202/0.263= 0.77 times that 
for (3,3) excitation by fFj 1 ; see S. L. Adler, Ref. 6; 

B. W. Lee, Phys. Lett.40B, 420 (1972). 

^The uncut elastic cross-section coefficients are not 
used in the maximization calculation, but are included 
for completeness. When no cuts are made, o BNL (i' +n 
—\T +p) =0.88 x 10 -38 cm 2 . 

43 D. C. Cundy et al ., Phys. Lett. 31B. 478 (1970). The 
CERN neutrino flux is given in D. H. Perkins, in Pro - 
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ceedings of the Fifth Hawaii Topical Conference in 
Particle Physics, 1973 , edited by p. N. Dobson, Jr., 

V. Z. Peterson, and S. F. Tuan (Univ. of Hawaii Press, 
Honolulu, 1974), Fig. 1.6. Note that the absolute mag¬ 
nitude of the flux is irrelevant in flux averaging—only 
the shape of the spectrum matters. 

44 A preliminary account of this discussion has been given 
in S. L. Adler, in a talk given at the 1975 Coral Gables 
Conference, ^Orbis Scientia II,” 1975 (unpublished), 
and IAS report. As a result of a programming error, 
the curves given in Fig. 1 of the Coral Gables talk are 
too high; the corrected curves appear as Fig. 4 of the 
present paper. 

45 The curve shown for the pure F j case is very similar 
to the curve obtained for S ,P,T coupling mixtures. 

See Ref. 5 for further details. 

4fl Here we are indicating octet isoscalar amplitudes by 
the superscript (tj)j note that in Ref. 6 they were de¬ 
noted by the superscript ( 0 ), while in the text of this 
paper they are denoted by the superscript ( 8 ), with ( 0 ) 
used to indicate singlet isoscalar quantities. 

47 0ur notation follows that of Sec. Ill of Ref. 6 , with 
x = (IF 2 —Af N 2 ) / (2Afjy). Note that writing an unsubtracted 
dispersion relation is not formally justified by a Regge 
analysis of A ^ since the leading Regge trajec¬ 
tory, the Aj trajectory, has too high an intercept for the 
integral in Eq. (B4) to converge. However, a similar 
use of subtracted dispersion relations coupled with 
resonance dominance arguments gives a correct esti¬ 
mate of the magnitude of the isovector correction of 
Eqs. (43) and (Bl), even though in this case also, the 
unsubtracted dispersion relation is formally divergent. 
Hence, our method in the isoscalar case is an heuristic 
one, motivated by methods which work in the isovector 
case. I wish to thank M. L. Goldberger for a helpful 


discussion of the Reggeology of the ir°N-~T}N amplitude. 

4 ®See the Appendix of S. L. Adler, Phys. Rev. 137 , B1022 
(1965). 

B °S. L. Adler, Phys. Rev. D 9 , 2144 (1974). 

51 The CERN Gargamelle group has data in freon (CF 3 Br), 
and a bubble-chamber run at BNL in neon has been 
proposed. I wish to thank P. Musset and C. Baltay 
for raising the question of extending the calculations of 
Ref. 50 to other nuclei. 

52 H. R. Collard, L. R. B. Elton, and R. Hofstadter, in 
Landolt-Bomstein .• Numerical Data and Functional 
Relationships; Nuclear Radii , edited by K.-H. Hellwege 
(Springer, Berlin, 1967), New Series, Group I, Vol. 2. 

5 S B. R. Holstein and M. M. Sternheim are currently 
studying pion production in nuclei induced by incident 
protons, using the multiple scattering model of Ref. 50 
and variants on the model which take nucleon recoil into 
account in a detailed way. This study should lead to an 
improved value of the pion absorption cross section 
<>ab»» which when available will be used to recompute 
the charge-exchange matrices of Table VIE. In cal¬ 
culating Table vm we have in the interim used the 
absorption cross section given in Eq. (27) of Ref. 40. 

B4 Under our assumption of an isoscalar neutral current, 
the simple recipe of Eq. (24) of Ref. (40) gives the 
formula 

2 R = 2 R'„ x (1 -2J) {1 + [<T/(1 — 2 J)](r, + r 2 )} . 

Taking the effective d for the BNL target as S d AL 
+ 0.16, this formula gives 

2 R =2Jy x0.68[l + 0.24(r, + r 2 )], 
a result very similar to Eq. (C 6 ). 
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Renormalization constants for scalar, pseudoscalar, and tensor currents ’ 11 

Stephen L. Adler, E. W. Colglazier, Jr., J. B. Healy, Inga Karliner, 

Judy Lieberman, Yee Jack Ng, and Hung-Sheng Tsao 
Institute for Advanced Study, Princeton, New Jersey 08540 
(Received 27 January 1975) 

We calculate the renormalization constants describing nucleon and pion matrix elements of scalar, 
pseudoscalar, and tensor (5, P, T ) current densities. For certain of the constants, expressions can be 
obtained using standard SU 3 and chiral SU 3 <8> SU 3 methods. To get the remaining constants, we employ 
the quark model with spherically symmetric quark wave functions to relate the S, P, T 
renormalization constants to known parameters of the usual vector and axial-vector (V, A ) currents. 

We also evaluate the renormalization constants using the MIT “bag" model quark wave functions. We 
summarize our results in tabular form, compare the results of the various cakulational methods used, 
and attempt to estimate the accuracy of our predictions. 


I. INTRODUCTION 

A number of recent papers have examined the 
possibility that neutral currents may involve sca¬ 
lar, pseudoscalar, and tensor (S,P, T) weak cou¬ 
plings in addition to or in place of the usually as¬ 
sumed vector and axial-vector ( V,A) Lorentz 
structures. In particular, expressions have been 
given for deep inelastic neutrino nucleon scatter¬ 
ing 1 * 2 (using the quark parton model) and for vari¬ 
ous low-energy nuclear correlations , 1 assuming 
a completely general Lorentz structure for the 
weak neutral current. In order to make phenomen¬ 
ological studies of S,P, T weak neutral couplings 
which simultaneously use deep-inelastic informa¬ 
tion on the one hand, and exclusive channel or low- 
energy nuclear results on the other, it is essen¬ 
tial to know the renormalization constants describ¬ 
ing the nucleon and pion matrix elements of the 
S,P, T current densities. The purpose of this 
paper is to estimate these renormalization con¬ 
stants by using various dynamical models of had¬ 
ron structure. Our results will be applied in a 

{N(p i )\S j \N(p 1 )) = ^ M u(p 2 )Fy\k > )t i u(p l ), 

(NipJ 3*1 N(p l )) = ^ H u(p 2 )Fy\k 2 )Y 5 t j u(p 1 ), 


subsequent publication to a detailed analysis, us¬ 
ing current-algebra techniques, of soft-pion pro¬ 
duction by a weak neutral current of arbitrary 
Lorentz structure. 

Within a general quark-model framework, the 
currents which we study have the form 
(for S,P, V,A, T structures, respectively) 

j ■= ip ip, 

if) = py 5 ^jip, 

= ipy x 2 

35 x = ^ r x r s 2 ^, 

; = 0.....8 

with i p being the quark field, cr Xl1 = (5 i )[y x , y^], 

Ao= (§) 1/a , and with A,. 6 being the usual SU 3 

matrices. For describing AS = 0 neutral current 
effects, only the j =0, 3, 8 components of the above 
nonets are relevant. We write the nucleon matrix 
elements of these components as 3 


<JV( p t )| 3 X \N( /»,)) = u( p 2 )[i*' > (fe a )y x + iFi> > (fe 2 )a x ” k J /, *(/>,), 
(N(p 2 )\ 5) x \N(p 1 ))=VC N u(p i )[g% ) (k 2 )y x y 5 +h l 2 ) (k 2 )k x y s ]t j u(p 1 ), 


(N(p 2 )\ J, Xo |tf (/>,)) = 3lv«(/>,) [ T»>(fe 2 )a 


x. iT¥W) ,x.„ .o 




(y x k°-y°k x ) + 


= 3l W M(/> a )[r}' ) (* 2 )<f XO + * f ll ' )(fe 2 ) (y X fe°-yVU 

L M N 


(pXfe °- p ° feX >] i) 

(o x "k v k°-o°"k v k x )\«,«(/>,), 



r» ) (jfe a ) s r| J) (jfe a ) + 2r» ) (jfe a ), 

Jk-/»,-/»„ P=P2+P X , 
t 3 =ir 3 , / 0 = i(§) 1/a , f 6 = ^) l/a . 


11 3309 
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In the above expression, t 3 is the nucleon Pauli 
isospin matrix and the spinors u(p 2 ), u(p x ) are 
understood to include nucleon isospinors. The 
vector and axial-vector form factors defined 
above are related to the standard nucleon form 
factors by 

^(* 2 )= F r li2 m, 

(3) 

F\^(k 2 ) = 3Fl 2 (k 2 ), AW-fcxtt*). 

The nonvanishing pion matrix elements of the sca¬ 
lar, pseudoscalar, and tensor currents are 

<«*(/»,)1 3, 1 ApJ) = t„ j = 0, 8 

<«*(/»,)! jf 3 Xo I **(/»,»- a, T b —'* ab 3 (P x * e -P°k x ), 

M n 

(4) 


31 


tr 


1 

( 2 /> 10 2 /> 20 ) 1/2 ’ 


C 123 = l . 


Our analysis will give values at k 2 =0 (and, in 
certain cases, first derivatives at f^-O) for the 
various form factors which appear in the above 
expressions. Effectively, the fe 2 =0 values are 
the strong interaction renormalization constants 
describing scalar, pseudoscalar, and tensor den¬ 
sity couplings to nucleons and pions. 

Two principal calculational methods are used 
in what follows. First, values for certain of the 
renormalization constants can be obtained by us¬ 
ing standard SU 3 and chiral SU 3 <S>SU 3 methods. 

For the remaining constants, we use the quark 
model with spherically symmetric quark wave 
functions to relate the S,P, T renormalization 
constants (and certain first derivatives at k 2 =0) 
to known parameters of the usual V,A currents. 

We also give a direct calculation in the quark 
model using the specific quark wave functions 
obtained in the MIT “bag” model. Our calcula¬ 
tional procedures are further briefly described 
in Sec. n below. Results of the computations are 
tabulated in Sec. Ill, while in Sec. IV we compare 
results obtained by the various calculational meth¬ 
ods used and attempt to estimate the accuracy of 
our predictions. 


II. CALCULATIONAL METHODS 


with3C 0 chiral SU 3 ®SU 3 symmetric and with 
tcUF 0 + c$ e ) a symmetry-breaking term. 4 The 
parameter k has the dimension of mass, while 
the parameter c is determined by the pseudo¬ 
scalar meson masses to have the value c«- 1.25. 
Since k is not fixed in the GMOR model, 5 we can 
only determine values of the scalar and pseudo¬ 
scalar renormalization constants relative to any 
one of them, say, relative to f r s 8) (0). 

We begin by getting relations for the scalar den¬ 
sity renormalization constants. Within the scalar 
octet, SU 3 symmetry relates -Fl 3) (0)/-Fs 8) (0) to 
a S s~ - 0.44, the D/(D + F) value of the baryon octet 
semistrong mass splitting, giving 

4 3) (0)/Fl 8) (0) = l/(3-4a ss ). (6) 

The ninth scalar renormalization constant f£ 0 ) (0) 
cannot be calculated by SU 3 symmetry, but can be 
related to the experimentally measurable pion- 
nucleon "a term” parameter 8 o vNN and the nucleon 
SU 3 mass splitting parameter Am defined respec¬ 
tively by 

+c)(N\J2 KJF 0 + K<F 6 |JV> 

» 45 ± 20 MeV , 

A,n={N\K!f e \N) (7) 

= ^ [M n - i(M A + M e )] 

~ 173 MeV, 


giving 

4 o) (0) 1 [ 3a ,,,/A m 

Ff\d) 2 L V2 +c 


(8a) 


We remark that if F% a) (0) and F^ e) ( 0) were equal, 
as is predicted in the quark model, then Eq. (8a) 
would fix o n/iN to have the value 

o TNN ^Am(42 + c) 


»28 MeV . 


(8b) 


Finally, we consider the pion scalar coupling 

which can be evaluated relative to f£ 6 ) (0) 
by noting that 

FjV(0) _ 2M ir {ir\ Kc& a |tt) 

FpW' (AfkcSFglJV) 


A. SU 3 and chiral SU 3 ®SU 3 predictions 

We begin by discussing those renormalization 
constants which can be determined within the 
framework of the Gell-Mann-Oakes -Renner 
(GMOR) model 4 for SU 3 and chiral SU 3 ®SU 3 
breakdown. In this model, the strong interaction 
Hamiltonian has the form 

3C=5C 0 + /c(JF 0 + c£F 6 ), (5) 


= i(M n 2 + M 2 ) -Af „ 2 

^(A/a + Af £ ) —M n 

V2 +c Am * 



where the final equality is obtained by using Eq. 

(7) and the GMOR relation M 2 /M 2 = (42 - c)/ 

(42 ~ + c). 

To get relations for the pseudoscalar density re- 
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normalization constants, we consider axial-vector 
current divergences in the GMOR model. Taking 
first the divergence of we find 

b xSF3 X --*[*1.k(SFo + cSF,)| 


= l 



( 10 ) 


which when sandwiched between nucleon states im¬ 
plies that 





*F9\ 0 ), 


( 11 ) 


Rewriting Eq. (7) for A /// as 




( 12 ) 


and dividing Eq. (11) by Eq. (12) we get 


S ( A a) (0> = * A (3-4a x ) (16) 

[where o; vl =0.66 is the D/(D + F) value of the bary- 
on octet axial-vector vertex] and dividing by Eq. 

( 12 ) gives the second relation 

(7T - c)F5?>( 0 ) + 2cFj,°'(0) = (3 - 4a a)^~ F£»( 0 ) . 

(17) 

A second independent relation for and 

Ej» o) (0) cannot be obtained in the GMOR model. 

We note, however, that if f£> 8 ) (0) and F^P'fO) were 
equal, as in the quark model, then Eq. (17) would 
reduce to 

jr(3)( 0 ) = (3 - 4a A ) F< S B) ( 0) 


* 1 > 3 ) ( 0 ) . g A Ms 
FfW) " 72 + c A m 


Next we take the divergence of giving 
a x JF^--i[F|, k((F 0 + cjf 6 )] 

. CV 2 - c . 72 c . 

Se+ ~JT~ T ° J ■ 


(13) 


(14) 


which when sandwiched between nucleon states 
gives 


2Msg l Jt\0) 

Using 



= (3 -4a A )F£»(0), (18) 

an analog of the SU 3 relation of Eq. (16). We re¬ 
mark finally that standard pion pole dominance 
arguments give for the induced pseudoscalar form 
factor h^H* 2 ) the expression 



2 MhJLa 
M*- k 2 


(19) 


from which we get 


0 ) = 


2 Msg A 

M* 


( 20 ) 


The formulas obtained in this section are listed 
in column 1 of Tables I and II. 


B. Quark model predictions 


We next turn to the quark model, within which we can calculate expressions for all of the scalar, pseudo¬ 
scalar, and tensor renormalization constants, and for certain of the form-factor first derivatives as well. 
We use for the nucleon the standard spin-internal-symmetry wave functions of the nonrelativistic quark 
model , 7 

l/>,s. = ^ i/ = (^) 1/2 [2|<P ♦ 3l*<M)+2|<M<M3l + > + 2|3l + <M<Pt)-|<M<P + 3lt>-l<M3i:t<P+> (21) 

-|<P + 3n<P*)-|3H<P+tfM)-|3HtfM<P+)-|<P+tfM3l*>], etc., 


where p denotes the proton and <P, 31 denote quarks. 
In treating the nucleon spatial wave function, we 
assume three colored quark triplets to be present, 
with the physical nucleon constructed as a color 
singlet . 8 The nucleon states are then completely 
antisymmetric in the color index, and so satisfy 
Fermi statistics with completely symmetric spat¬ 
ial wave functions, which we form from one-par¬ 
ticle quark orbitals. For the quark orbitals in a 
nucleon we assume a spherically symmetric Dirac 
wave-function form: 



. ( J J Of*") )y 

(47 t) 1/2 \-a •rJ l (r )/* ' 


( 22 ) 


with J 0 and arbitrary functions of r , with x 
being the quark Pauli spinor, and with the nor¬ 
malization constant fixed by the condition 


1 =^/dV</i t ( r)^(r) 

= / dV lf[ J ° 2 ( r)+J > 2(r) 1- < 23) 


Renormalization 

constant 


H s) <o) 

Fk 0) <0) 

F< s 3 > (0) 

Fp> > (0) 

Fj, 0) (0) 

Fp > (0) 

F[«>(0), F( 0) (0) 

F\ s) (0) 

F^>(0), F<°>(0) 

Fj 3) (0) 

fS^'io), F<°>'(0) 

f} 3), (o) 

=Mj,[F ( j 3) (0) 4 f‘ 2 ' ’ (0)1 +1 
3F<, 3 > (0) 


r,*B3F\ 3,, (0) + -J 


2 M 


N 




TABLE I. Nucleon parameters. 


SU 3 or chiral 
SU 3 ® SU 3 
prediction 


Quark-model 
prediction, in 
terms oil, , 

1» • m • fu 




-1 


i 2 +c 

Fi a) (0)/(3-4a ss ) 
</2 - c )Fj, 8 > (0) + 2 cF { p °> (0) 


= (3 -4a A )^&*F { s a) 
A Am s 


F ( s e) (0) -fL* ^ JL 
s J2+c Am 


3(1 -2 1 2 ) 

3(1-2 1 2 ) 

1 - 2 1 2 
T-^^3 


20.. r 

3 


3 

1 


ir _ 3 

1 it — 


313 2M n 


10 , 1 

Ail —‘ 


® 3 2M 


* 


2’(^ + 4) oiw ^3 + 


3M* ' 3 ■ m N - 


«"( -~rrr~h + 


m s ' 3 T 4i«r^ 




/ 4 +/ 5 


(Continued on following page) 


CO 
CO 
t—i 

to 


Quark-model 

Numerical value 



phenomenological 

relation 

from column 3 in 
MIT model 

Numerical value 
from column 4 



1.44 

1.86 


3 (“ 2* + To^a) 

1.44 

1.86 



0.48 

0.62 



2.64 

2.79 



2.64 

2.79 

CO 

H 

W 

b 

tVp 

4.41 

4.65 

X 

X 

w 

5S 

3 

3 

3 

r 

• 

1 

1 

1 

> 

0 

r 

(u, - 3)A2M m ) 

-0.36/(2M„) 

-0.21/(2M„) 

w 

X 




ft 

(!/«, - D/W„) 

3.40/ {2M N ) 

3.65/(2 M n ) 

c. 

1 2 Up . 3 

0.25 

0.17 a 


2 ' * 4M n 2 m N 2 

^7 

M t 5 


1.2 Sup , 1 

0.065 

0.035 


6 p 12Af* 2 AM it 1 

AV 

m7~ 



2.64 

Expt: 2.79 


V 

0.50 

m* 1 

0.33 , 

Expt; "^“7 = 0.66 F 2 
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TABLE I (continued) 


Renormal izatio n 

constant 

SU 3 or chiral 
SU 3 XSU 3 
prediction 

Quark-model 
prediction, in 
terms of/i ^ 

Quark-model 

phenomenological 

relation 

Numerical value 
from column 3 in 
MIT model 

Numerical value 
from column 4 

gt-JlUo) 

(3-4 a A )S A 

l-f/2 

U a 

0.65 

0.74 

g^Uo) 


l-K 

&A 

0.65 

0.74 

g^io) 


f(l-f^ 2 ) 

e A 

1.09 

Expt: 1.24 

h { 2 ) ( 0 ) 

mJ 8a 


\M N (^r ^ 2 — 5 ‘g A ) 

°- 037 

M 2 

2 M m A 

7 

^ 3) '(0) 


T <6*^4“T(^5^ 

t 

g A 

0-066 Fxot 

2,48 0.060 


-JfJ- ExPt 

' (0.90 GeV) 2 ’ M ff 2 

gWw/g^Uo), 

^!i 0) '(0)/^ o) (0) 



s' a/S A 



7V>(0), T[ 0) (0) 


i-K 

1 .3 

? + 10 

0.83 

0.87 

7^[ 3 > (0) 


T 

f 

1.38 

1.45 

T f 2 e) (0),1 < 2 0) (0) 


-h M x 2 h 

MA-W + h'2 

-1.98 

-0.88 

7^ 3) (0) 


~r^N 2 h 

3“M^ 2 [ -jT p 2 + tfgj} 

-3.30 

-1.48 



rl -VMsh + T5 M N 2 h + r “T / zl 


-0.78 

-144 

7<3 3) (0) 


?[-T^3 + T^ 6 + i-T/ 2 ] 

fl^-(i,-7^ s) (0)+TJ^ 3) (0)l 

1.34 

0.41 

ffj* >(0), T(, 8) (0) 


- 2M n I 3 + 2~J l 2 

- W? 7 ^ ’(0) 

-3.54 

-3.75 

T< 2 3) (0) 


2 _ 55, 


-0.62 

-0.66 

r*i 3) '(o) 


T<r / 4“^/ 5 ) 

j[u r p 2 + 20^ ji 

0.085/M„ 2 

0.0G8/M n 2b 

> '(O)/!^* >(0) , 

T^wiy^o) 



^ S, '(0)M S) ©) 




1 If ^ (k 2 ) is parameterized in the dipole formF f i 8 * (fe 2 )/fW (0) - ( l-k 2 /M s 2 )~ 2 , then this Value for M 8 ^(0) corresponds to M s = 0.84 GeV. 
’If T\ 3 * (k 2 ) is parameterized in the dipole form 7^ 3 * ( k Z )/T\ 3 * (0) = (l-k 2 /M T 2 )~ 2 , then this value for 7y ^(0) corresponds to M T = 0.91 GeV. 
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TABLE II. Pion parameters. 


Renormalization 

constant 

SU 3 or chiral 

SU 3 ® SU 3 prediction 

Quark-model prediction, 

In terms of/. = 

11 • m • • 9 

Quark-model 

phenomenological relation 

Comment 

F&’ (0) 

/2 + ca/ s <0) 

fM tf 3(l-2 1 2 ) 

><0) 

Equating columns 

2 and 4 =* 

Fj“>(0) 


$M„3a-21 2 ) 

fM/'s' ’(0) 

M w - 0.53 GeV 

7V’(0) 


- m *fh 

3 3 3 

J 

1 

For/= 4) 1 ' 4 , 
columns 3 and 4 
give —1.19 and 
—1.26, respec¬ 
tively. 


The procedure for calculating nucleon renormali¬ 
zation constants is now completely straightfor¬ 
ward. 9 We consider the general quark-model cur¬ 
rent JF r = i£r ip (T is a combination of y and X ma¬ 
trices) with one-nucleon matrix element 


{-Af(/> 2 )l JFrM/’i)) -tt/fU{p 2 )K P2*P\) u {p\) ■ (24) 
Working in the brick-wall frame with 

Pi = - zK P 2 = ^ k 

(25) 

Pio^P2o = Wm 2 + ik 2 ] l/2 , 

and using our independent-orbital construction of 
the nucleon wave function, we get the relation 


31 Af«(/> 2 )^r(/ , 2. PiM Pi> 




with 9Il r a matrix in the quark spin-internal-sym¬ 
metry space given by 


Wl 


r (k) = / 


dV e 


i It • r 


xr 


4;r 

iJ Q (r) 


(- z'J 0 (r), a -rJ^r)) 


-a -rJ^r) }' 


(27) 


Taylor-expanding e { k * r and equating terms of 
zeroth, first, and second order in k on the left- 
and right-hand sides of Eq. (26), we get formulas 
at zero momentum transfer for the form factors 
appearing in K T (p 2t p l ) t expressed in terms of 
integrals over the quark wave function, [in eval¬ 
uating the order k 2 relations we drop nucleon 
recoil terms of order k 2 /(SM N 2 ) on the left-hand 
side of Eq. (26); these terms are relatively small 
and do not represent a well-defined correction 
since a description of nucleon recoil has not been 
built into the quark-model wave functions.] The 
quark wave-function integrals which appear are 
linear combinations of the five basic integrals 


/ *=J dV lr J > ) ’ 

7 3= / dV % rJ o(r)JSr), (28) 

/ « = i dV 

7 * = / dV lr r2j > 2(r) - 

Expressions for the nucleon scalar, pseudoscalar, 
vector, axial-vector, and tensor renormalization 
constants and certain form factor derivatives, in 
terms of I l9 -. -, / 5 , are given in column 2 of 
Table I. Eliminating the integrals 5 in terms 
of the normalization condition of Eq. (23) and four 
experimentally measured parameters of the vec¬ 
tor and axial-vector currents [we take these as 
Sa* Sa s £^(0)» = proton squared charge radius, 10 

\ip/(2M N ) = proton magnetic moment] gives the phe¬ 
nomenological relations listed in column 3 of 
Table I. These relations are valid in any quark 
model with a spherically symmetric wave function 
of the form of Eq. (22); for example, they are 
valid in both the MIT 9 and the SLAC 11 bag models 
and in the Bogoliubov model, 12 even though these 
assign the quarks very different looking wave func¬ 
tions. 

The procedure for calculating pion renormaliza¬ 
tion constants is analogous to that used for the 
nucleon, with a few differences which we briefly 
describe. Just as for the nucleon, we use for the 
pion the usual spin-internal-symmetry wave func¬ 
tions of the nonrelativistic quark model, 7 
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Oo* = ( 2 ) l/2 [ | ? 5*<P*)-|W*<PO], etc. (29) 


For the quark wave function we use an analog of 
Eq. (22), 



' ujr/f) V 
-a-rJ,(r//)/*' 


(30) 


with / being a rescaling factor which reflects the 
fact that quark orbitals in a pion may have a dif¬ 
ferent radius from those in a nucleon. In the MIT 
bag model 9 / has the value (f ) x ^ 4 ~ 0.90, not much 
different from unity. The antiquark wave function 
is the same as Eq. (30), with the antiquark con¬ 
tribution to a current with even (odd) charge con¬ 
jugation equal to +1 (-1) times the corresponding 
quark contribution. The pion analog of Eqs. (24)— 
(27) is evidently 


{*(p2)\ST\*(Px))^K' T (t> Z) t> x ) 


IT 


on 


Vsm/E) 

uuarL 


7H 


QM 


(31) 


91* = 




with 9Il r being the same matrix function as defined 
in Eq. (27). In applying Eq. (31) we only expand 
out to terms of first order in k, since neglect of 
recoil in the case of the pion would be unjustified. 
To order k, the normalization factor is just 
1/(2^). In the case of the tensor density cou¬ 
pling to the pion this factor of is just can¬ 
celled by a corresponding factor of M n coming 
from K f ri giving a formula for r ( ff s) (0) which does 
not involve the pion mass. On the other hand, in 
the scalar density case the factor M n ~ x survives, 
giving the relation 

4V(0)=4M 7f (l-2/ 2 ), (32) 


the quarks in a nucleon are confined to a finite 
spherical region of space of radius R Qt with or¬ 
bitals 


=jo(wr/R 0 ), Ji(r)=j l (ur/R 0 ), r^R 0 
^o(rW,(r) = 0, r>R 0 , 

w =2.04, fi o = 0.97iW n -‘, 

. , . sins . . . sins cos 2 

*(*)■ j.u>- - j—— ■ 


(34) 


Evaluating the integrals l x . 5 we find in the MIT 

model 


/. = 


/.■ 




/- * 


2w-l 

4(w-l) 


*0.740, 


TT~-7T =0.260, 

4(co — 1) ’ 


2^2 4^_?_ _o 304/? 

w 8(«-l) U * J *°’ 


(35) 


= 0.357fl o 2 , 


^ 24w 2 (w-1) 


R £ -(4u> 3 - 10w 2 + 20w 


-15) 


= 0.175* 2 


III. TABULATION OF RESULTS 

In Tables I and II we tabulate our results for the 
form factors defined in Eq. (2). To recapitulate, 
the quantities c, Aw, a$s, a Al and o defined 
above in Sec. HA* have the values 


which explicitly involves the pion mass. Since, 
however, the quark model leads to a degenerate 
meson 35-plet, instead of having a nearly mass¬ 
less pion, we reinterpret the factor M n in Eq. (32) 
as being M Mi a typical quark-model meson mass, 
and thus write 

Fft>(0)«4iH f (l-2/ a ). (33) 

As we will see below in Sec. IV, this interpretation 
of Eq. (32) is in accord with the chiral SU 3 2> SU 3 
formula for F^(0) obtained above. The results of 
our analysis in the pion case are given in column 

2 of Table H (in terms of the integrals I x . 5 ) 

and in column 3 of Table H (in terms of vector 
and axial-vector current parameters). 

We conclude this section by giving expressions 

for the quark orbitals and the integrals I x . 5 in 

the MIT bag model, 9 which gives a fairly satisfac¬ 
tory account of the measurable parameters of the 
vector and axial-vector currents. In this model 


c ~ -1.25 , 

Aw * 173 MeV , 

a***-0.44, (36) 

a A ~ 0.66, 

45 ±20 MeV , 

while the integrals l x . 5 are defined and eval¬ 

uated in Eqs. (28) and (35). The mass Af tf , a typi¬ 
cal quark-model meson mass introduced in Eq. 
(33), is of order 0.6-0.8 GeV while the scale fac¬ 
tor / introduced in Eq. (30) is close to unity, with 
the value (|) 1 ^®0.90 in the MIT model. 13 

IV. DISCUSSION 

We conclude by comparing the results obtained 
by the various calculational methods described 
above and by attempting to estimate the reliabil¬ 
ity of our predictions for the scalar, pseudosca- 
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lar, and tensor current parameters. We turn our 
attention first to the isovector pseudoscalar re¬ 
normalization F$> 3) (0) and the isovector induced 
pseudoscalar amplitude ft*? MO), both of which are 
pion pole dominated. From chiral SU 3 ®SU 3 and 
pion pole dominance we find 


n 3) (0) gA 

4 8> (0) " TTTc 1 

(37) 

A (3)(0)= 


The trouble here is most likely the quark-model 
prediction that Fk o) (0)- Fj?M0), which leads to 
near cancellation of the two terms on the left- 
hand side of Eq. (17) and hence to a large pre¬ 
diction for f£ 8) (0). In actual fact, since there 
is no light ninth pseudoscalar meson associated 
with the SU 3 -singlet axial-vector current it is 
likely that F£°M0)< f£ 8) (0). Rewriting Eq. (17) 
in terms of the ratio 


v- 

f£ a '(o) 


(42) 


_ 2A^ 

Mj 


1.24, 


while the MIT model gives 14 


Ffc»(0) 

W) 


= 3.1, *«,»>(0) = 


2Af, 


M. 


f 0.037 , 


(38) 


both much too small. Evidently, the quark-model 
predictions for pion pole dominated pseudoscalar 
quantities behave as if the effective pion mass 
were 

( = 0.51 GeV from Fj?>(0), 

(39) 

/ I 04 \l/2 

( 0 ^) M * =0 ' 81 GeV from A <?>«», 


not unreasonable values since the quark model 
does not predict an almost massless pion, but 
rather gives a pion degenerate with all other 
pseudoscalar and vector mesons in the 35 re¬ 
presentation of SU e . [in fact, the same MIT mod¬ 
el calculation giving the value / = (|) l/4 used in Eq. 
(30) above leads to a value of the 35 representa¬ 
tion central mass of 8u?/(3 fR 0 ) =0.87 GeV, con¬ 
sistent with the above estimates.] Referring to 
Table n, we see that these values for the effec¬ 
tive quark-model pion mass are compatible with 
the value 0.53 GeV obtained by equating the chiral 
SU 3 S>SU 3 with the quark-model predictions for the 
pion scalar density coupling F^fO). 

We consider next the isoscalar pseudoscalar re¬ 
normalization constants Fj, 8) (0) and Fj?M0). As 
we have seen, chiral SU 3 S>SU 3 gives a single equa¬ 
tion [Eq. (17)] relating these two constants to 
F£ 8 M0), which reduces, when F$> 8) (0) and F$»°M0) 
are equal (as in the quark model), to the simple 
relation 

®=( 3 - 4 ^ ) wS= 15 ' < 4o) 



This prediction is evidently in serious disagree¬ 
ment with the quark-model value 


Fp a) (0) 

WW) 


= 1.5 


(41) 


we find 

J?>(0) (3-4 a A )M NgA 

WW)~ Aw(V2 -c + 2cr) * 


(43) 


which gives the following predictions for r 
= 0.3,0.5, 0.7 respectively; 


r s 0.3: 



F ( P °\ 0) 

WW) 


0.38, 


r - 0.5; 


F^MO) 

4 a, (o) 



jy>(Q) 

WW) 


0 . 86 , 


r =0.7: 


n a> (Q) 

4 a, (0) 


2.65, 


i^ 0, (0) 

4 a, (0) 


= 1.85, 


(44) 


in reasonable agreement with the quark-model 
value of Eq. (41). 

Continuing our comparison of columns 1 and 2 
of Table I, we note that SU 3 predicts 

^ 8) (0)/l : 'i 3) (0) = 3-4a ss = 4.76, (45) 

with an empirical semistrong D/(D + F) ratio a ss 
«-0.44, while the quark model gives 

i^ 8 '(0)/4 3 '(0) = 3. (46) 

Evidently, Eq. (46) represents pure F-type bary- 
on octet semistrong mass splitting, a feature 
which is a well-known shortcoming of the quark 
model. For the corresponding axial-vector cou¬ 
pling ratio SU 3 predicts 

g { 2\ 0)/^ 3> (0) = 3-4a j4 (47) 

with an empirical value a^ K 0.66, while the quark 
model gives 

^ 8> (0)/^ 3) (0) = 3/5, (48) 

corresponding to a value of a A of 0.6. Although 
the quark-model value of a A is quite good in this 
case, the fact that Eq. (47) vanishes for a j4 =0.75 
makes the predicted ^ 8) in the quark model differ 
by more than 60% from the value obtained from 
SU 3 and the empirical a A . Obviously, in doing 
phenomenological calculations the predictions of 
column 1 of Table II should be used (where they 
are available) in preference to the quark-model 
values. 
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For the value of F^ 3) (0) and for all of the tensor 
density parameters, we must rely solely on quark- 
model predictions since no information is furn¬ 
ished by SU 3 or chiral SU 3 ®SU 3 alone. Hence it 
is essential to have some a priori estimate of the 
reliability of the quark-model predictions. 

The following five considerations would appear 
to be important in forming such an estimate. 

1. Consistency of the quark model with SU 3 and 
chiral SU 3 &SU 3 predictions , where available , 

This question has just been discussed in detail 
above. In the case of F^ ) ( 0), the 60% discrepancy 
between Eq. (45) and Eq. (46) suggests an estimate 
of 60-90% for the possible quark model uncertain¬ 
ty- 

2 . Comparison of the quark-model predictions 
for the vector and axial-vector parameters with 
their known experimental values . Referring to 
Table I, we see that the MIT-model predictions 
for g Al g Al fi Pt and Yp all agree with experiment 15 
to within about 30%, suggesting 30-60% as the gen¬ 
eral level of reliability for quark-model predic¬ 
tions when other factors (such as pion pole dom¬ 
inance, sensitive cancellations, nucleon recoil 
corrections, or possible large "glue” contribu¬ 
tions) are not involved. In particular, this esti¬ 
mate of the quark-model uncertainty might be ex¬ 
pected to apply to the tensor renormalization con- 
stant Tf >(<)).“ 

3. Consistency between the predictions in the 
final two columns in Table I 0 Column 5, we re¬ 
call, gives the predictions of the MIT-model wave 
functions, while column 6 gives the predictions 
obtained from the quark-model phenomenological 
relations of column 4, using as input the empirical 
values of g Al g' Al p t1 and r p 2 . Sensitive cancella¬ 
tions are unlikely to be involved in cases in which 
the quark-model predictions are relatively large 
and relatively unvarying from column 5 to column 
6, as for example, for r{ 3) (0). On the other hand, 
when the quark-model predictions are small or 
strongly varying from column 5 to column 6, as 
for T< 3 ’°>(0), ?i 3) (0), and T 3 (3 ’°’ 3 >(0), they may be 
considerably less reliable than the 30-60% esti¬ 
mated above. 

4 . Possible importance of neglected nucleon re¬ 
coil terms . Whereas F$°(0), rj 8,o * 3) (0), and 
^ 8,o,3) (0) are true static quantities which are in¬ 
sensitive to our neglect of nucleon recoil, expres¬ 
sions for the renormalization constants 7 , ( 8 » 0 *a>(0) 
are obtained from the second-order term in k in 
Eq. (27) only when nucleon recoil ambiguities are 
neglected. This introduces an additional source 
of uncertainty in the quark-model determination 
of T< 8 -°- 3) (0) relative to the uncertainties present 
in the quark-model determinations of the other 
renormalization constants. 


5. Possible presence of large “glue” contribu¬ 
tions. In the quark model only quark contributions 
to the various current densities are evaluated, 
while possible contributions from the “glue” which 
binds the quarks together are ignored. One pecu¬ 
liar feature of tensor densities is the possibility 
of induced vector meson couplings of the form 

(49) 

with A being a vector meson field. Such couplings 
can contribute to the induced tensor renormaliza¬ 
tion constants T 2 (0) and T 3 (0), while not affecting 
the value of 7^(0). If all vector gluons carry a 
color quantum number, then terms like Eq. (49) 
will be absent in the color-singlet tensor densit¬ 
ies of Eq. (1). In this case, the quark-model pre¬ 
dictions for T| oj (0) and r£ 3) (0) should, like that 
for r£ 3) (0), be relatively reliable. On the other 
hand, if color-singlet-unitary-singlet gluons are 
present, then the unitary-singlet tensor current 
JFq 71 could receive important “gluon” contributions 
from terms of the form of Eq. (49), introducing 
a possible large uncertainty into the quark-model 
prediction for 7^ o) (0). 

Added note . Applying the method of Eqs. (10)— 
(15) to the ninth axial-vector current gives 
the divergence equation 

3 x JF^=i/c(|) l/2 (JF 5 0 + cJFl), (50) 

which when sandwiched between nucleon states 
gives 

2M^ o) (0)=^[v^ F£»(0) + ci^ 8 >(0)]. (51) 

Dividing Eq. (51) by Eq. (15) then gives the addi¬ 
tional chiral SU 3 ®SU 3 relation 

gi. o) (0) . vgy+c , 

" V2T-c + 2cr ’ ( ’ 

with r - Fi°*(0)/Fj> 8) (0) being the parameter defined 
in Eq. (42). For r = 0.3, 0.5, 0.7, Eq. (52) gives 
the respective predictions for ^ii o) (0)/^Ji 8) (0), 


r 


0.3: 


g ( A 0 H 0 ) _ 

g? W" 


0.43, 


r = 0.5: 



(53) 


-0 7. *i 0> (°)_ 0 28 

- 0 - 7 - j?w~ 0 - 28 ’ 


while for the quark-model value r - 1, Eq. (52) 
reduces to the quark-model prediction that 
g^iOVg^HO) = 1. Equations (50)—(53) are valid 
only when anomalies are not present. When ano¬ 
malies appear, the above equations apply to the 
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axial-vector renormalization ^ o) (0) associated 
with the “symmetry generating” ninth current, 
but this is no longer the same as the axial-vector 
renormalization for the physical ninth axial-vec¬ 
tor current. (See W. A. Bardeen, Ref. 17). 
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We reanalyze the problem of regularization of the stress-energy tensor for massless 
vector particles propagating in a general background metric, using covariant point se¬ 
paration techniques applied to the Hadamard elementary solution. We correct an error, 
pointed out by Wald, in the earlier formulation of Adler, Lieberman, and Ng, and find a 
stress-energy tensor trace anomaly agreeing with that found by other regularization methods. 


1. Introduction 


The problem of regularization of the stress-energy tensor for particles propagating 
in a general background metric has been treated recently by many authors using 
differing calculational methods [1]. In an earlier paper by Adler et. ah [2], the regulari¬ 
zation problem was approached by applying covariant point separation techniques to 
the Hadamard elementary solutions for the vector and scalar wave equations. The 
results of Ref. [2] appeared to contradict those obtained by other methods [1], in that 
no stress-energy tensor trace anomaly was found. Recently, however, Wald [3] has 
found a mistake in the formal arguments of Sec. 5 and Appendix ,B of Ref. [2] which 
accounts for the discrepancy. The mistake consists of the assumption, made in 
Ref. [2], that the local and boundary-condition-dependent parts G {1L) and G {1B) of 
the Hadamard elementary solution G {1) are individually symmetric in their arguments 
x and x' f as is their sum G {1) . Wald [3] has shown that when this assumption is drop¬ 
ped, a reanalysis of the stress-energy tensor in the scalar particle case gives the standard 
result for the trace anomaly. The purpose of the present paper is to give the corrected 
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stress-energy tensor calculation in the vector particle case; again, when the lack of 
symmetry of G&V* and G&® 1 is correctly taken into account, the standard result for 
the trace anomaly is obtained. In order to avoid needless repetition of formulas, this 
paper has been written in the form of a supplement to Ref. [2], with Eq. (N) of 
Ref. [2] indicated as Eq. (2.N) . 


2. The Calculation 


According to Eq. (2.4), the vector particle stress-energy tensor T aS is a sum of 
Maxwell, gauge-breaking, and ghost contributions, 

J ' T-'M I rrBR I 'rGH /1\ 

ocB '— 1 ct(3 ~r 1 ctB ~T 1 ct(3 • W 

Since the argument of Eqs. (2.43)-(2.45) showing that 

<tT/ + <r° 0 H > = o ( 2 ) 

does not depend on splitting the Hadamard elementary solution G a) into (L) and (B) 
parts, it is unaffected by Wald’s observation, and so we still have, as in Eqs. (2.5)- 
( 2 - 6 ), 

<T xe (x)) = CC(x)> 

= (ga“g e A g m ~ lg« B g U Y J ) iU. . 

Puviu = lim, l<FJx) Fjx") + Fjx') Fjx)). (3) 


Expressing the expectation in Eq. (3) in terms of the vector particle Hadamard 
elementary solution Gj , and splitting Gj into (L) and (B) parts, we get [as in 
Eq. (2.48)] 


<TJtx)> = <7i U (*)> + 

<t£ ib \x)> = (g«Vr - i&*s“Y°) P%1? 


p( L/B) 

-* uvAo 

p(L/Bj 


= ^f[uv]ha]) > 


[D u D h -Gll LIB) (x, x% 



with the notation [ ] in the final line denoting the x r -> x coincidence limit. We 
assume that the highly singular coincidence limit is regularized in such a manner that 
<T a sW) is finite and covariantly conserved, which implies that 


D% T^(xj) = -DXT^(x)). (5) 

An explicit calculation of the right-hand side of Eq. (5), to be given below, shows that 

DXT^(x)) = D«C\x), (6) 
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with ^(x) a tensor local in the Riemann curvature and its second covariant deriva¬ 
tives. Thus, from Eq. (5), we have 

D a KT^\x)y + t^\x)} = £“[--< + t%\x)} = 0 , ( 7 ) 

which implies that (T^x)) + ^(x) is a tensor local in the Riemann tensor and its 
covariant derivatives, which furthermore is covariantly conserved. Since no parameters 
with dimension of mass appear in the problem, on dimensional grounds this tensor 
must have the structure 

(T’isV)) + -- Ci4 0 (x) + c-JAx), 

7 <* = 0 =^ f dix( ~ g)1/2 R2 

= 2 gocdR.e 6 + 2 /?. a 0 -f 2 goteR 2 ) (8) 

4 * = 8 ^ J d * X (~£ 1/2 RPTR *r 

~ 2 8*Q R 'Q 6 + 2 £*& R f>o Rpe “1“ R ><*0 ~~ R a0,e° ~ 2R pe R poc dB , (8) 

with c x and c 2 arbitrary coefficients. The regularized stress-energy tensor thus becomes 

< T a0 (x)) = cJJjc) + c 2 J a0 (x) ~ tSe(x) + <rjf(x)>, (9) 

which by Eqs. (7) and (8) is automatically covariantly conserved. To evaluate the 
trace of <T a0 (x)>, we note that from Eq. (4) we have g a0 <T a { ® ) (x)> = 0. Hence we get 

g*\TM)> - - 2 ( 3 c , + c 2 ) R/ ~ g« Q t%\x\ ( 10 ) 

which is the trace anomaly [and, as is evident from Eqs. (43) and (44) below, is 
nonvanishing irrespective of the value of 3 c x + c 2 ]. 

In order to evaluate ^(x) we must carefully calculate D^T^^x)), keeping in mind 
the fact that while G*Jl(x, x') is symmetric under the interchange v, x <-> v\ x', G^V* is 
not symmetric and hence neither is G*J? J . Following the notation of [2, Appendix B], 
we write 



W va '{x, x) = iG!,?(x, x'), 

( 11 ) 

which on 

substitution into Eq. (4) gives 


with 

ryz/T-^Bk _ y 08r\ocn(B) 1 p(B) 

\ ‘ wA / g\ g xJ 1 txQ-yQ 4 g g xJ a0y8 5 

( 12 ) 


f*oc(3y8 ,a y’ H - iy',08’ ^^aS' ,0y' ^^0y' .ad']’ 

(13) 


In evaluating Eq. (12) we can use the equation of motion at x, 

rr vo ,u iy v rr 6o 


(14) 
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but we cannot (as was done in [2, Appendix B]) assume symmetry of W va ' or the 
Lorentz gauge condition of Eq. (2.B3). Beginning with the second term on the right- 
hand side of Eq. (12), and using Synge’s theorem, we get 


n p(B) __ 

ocBy8 


[^V.06'A — faces'. Sv'A + ^V.SS'A' — ^cS'.Sv'A' ~ £)] 

a <-> /3 

f 

W a y’^d'A ~ W a d'.By'A ~ Wya\ss'y + ^Sa'.yflV ~ 


+ 


w. 


yoe ,60V 


^^6a ,yB’ A* 


+ W t 




av ,08 /l 


w. 


ocd'.By'A' 




(b) 

(15) 


Substituting Eq. (15) into the second term in Eq. (12), relabeling dummy indices, and 
combining like terms, we get 


_i<r av a 06 r> 

iS & yd 


1 

2 


l(D k ~ D, )(W 


v 6 
v , 8 



1 

2 





v 

. 8 


' + w 



v 

y 




We next consider the first term on the right-hand side of Eq. (12). Again using Synge’s 
theorem, we have 


nap(B) 

u r oc(3y6 


[W'06\ay'“ + fV a y' i(3S ' a JEaS',0y' a W 8y\a8 ,a 

+ ay' a + W av ' i06 ' a W a6 ' i By'° C ^0y\a6' a ]• 



Using the Ricci identity and Eq. (14) to simplify the first line of Eq. (17), and using the 
cyclic identity, just as is done in [2, Appendix B]; to simplify the second line of 
Eq. (17), we get, on substitution into the first term of Eq. (12), 


g,Y s D a pif vS = - w at ', a V] 


oed 


oed 


+ M(^V,V - ^V.V - ^V.V + w 6 -, v 

Term (d) of Eq. (18) cancels term (b) of Eq. (16), giving the result 


6 y 

t t 


),A']- 


(c) 

(d) 


(18) 


D\rlfy = - M(A - ZVX^V.V - ^V.V)] 


y 8 


r y 6 


+ [^a'.“V - ^0'.“']- 


oed 


(a) 

(b) 


(19) 


If W va '(x, x f ) were symmetric under the interchange vx*-+a'x\ expression (a) in 
Eq. (19) would vanish, and if the gauge condition of Eq. (2.B3) [W vu >* = — Wy with 
W a scalar] were valid, expression (b) in Eq. (19) would banish, giving the result 
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£ w <Twa J > = 0 found in [2, Appendix B], In fact, as we shall now show, expressions 
(a) and (b) in Eq. (19) are both nonvanishing, providing the origin of the tensor 
/<&*(*) appearing in Eq. (6). 

We begin with the evaluation of Eq. (19a). Substituting 

11 / ^ /I 1/2 (b) 1 ., 1 / 2 , (L\ 

(47t) 2 ^ ^va’ (47t) 2 ^ tyvo' ^*»a') (20) 

and noting that (i) w va '(x, x f ) is symmetric, and so makes no contribution to Eq. (19a), 
(ii) in the series w^\ = vv^ f (r(x, x) + w^ f (r(x, x r ) 2 + ***, only the first term contri¬ 
butes to the coincidence limit in Eq. (19a), and (iii) terms with A differentiated make 
no contribution to the coincidence limit in Eq. (19a), we get 


- i [(D, - Dy)(W 


V 6 

y', 



1 l 

2 (4tt) 2 


[0>a - zvxrfvxv 



1 1 
2 (4tt) 2 



D a) wi 


(L)- 


- 2D, 


'wi UV A' 


+ 2D 


Wl A6‘ 


]• 



The next step is to evaluate the coincidence limits appearing in Eq. (21), following the 
procedure used by Wald [3] in the scalar case. Writing the recursion relations (2.20) 
and (2.38) for v loco ' and in the form 


„« , 1 Dvio 

5i " , + 2 i ~sr 


1 " + 2 ds 


- I [A^ l2 D u D^ l2 v 0 V) - BTv 0y A 

- i vfs + \ [A-^D u D u {A^\\-) - r^,], 



with s the arc length along the geodesic joining x r to x, one finds the unique solution 
regular at s — 0 


w 


(L)a 

1 o' 


--^iV — -if g^s, s) v^/ix, x') s ds. 

S dQ 


(23) 


Taking the coincidence limit of Eq. (23) and its first covariant derivative gives 

[h’1 l, V] = (-1 - ^ £* d *) [»iV] = -\ [*1, 

[D^\-] = (-l £ s 2 ds) [D^A = - \ [DM. 


(24) 
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Thus, making use of Synge’s theorem and the fact that a symmetric biscalar 

function of x and x', which implies [D^y ] = \D x [v^ y '\ we get the evaluations 


[dM L \-] = - vDxW./l 
[D^A = - I D X WA, 

[D^A = i[DjvAA - iD«[vA-l 


(25) 


[dX l) a ,<] = ADA^y)] - tAKV]. 


Substituting these into Eq. (21), we get 


1 


\ [Pa - DA{W\'.%’ - ^V,/)] 


(26) 


We trun next to the evaluation of Eq. (19b). We will need, as auxiliary formulas, 
some consequences of the gauge condition of Eq. (2.26), 


v , _<-v 

wju', 1 L^O.a' — 


(27) 


Substituting the Hadamard formulas 

.(i) 2JV2 . 2 g 


(1 ) __ . , . \ 


(47 T 
2JV2 / 2 


(28) 


G ° 1J ~ cm 5- (- + ® ln 


into Eq. (27) and equating to zero the coefficient of ln cr gives 

(ZE/V'),’’ + (A 1/2 v)y = 0, 

while the remainder gives 

2AV\ a . + 2(AV*g v A: + AV*vay + A^v^: + a{A ll2 A,y + o(A 1I2 w v A: 
Substituting the series expansions 


(29) 


0. 

(30) 


co 


CO 


V v u r ^ Vnvo'G i W va * ^ W nva 'tJ , 


w=0 


CO 


n =0 


co 


( 31 ) 


V = Z U " <7 "> 


n =0 


W = Z M '« <T " 

w=0 
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into Eqs. (29) and (30) and equating coefficients order by order in ct, we get the 
recursion relations 


2AV\ a . + 2(JV2£ ro , ); + A^v 0 ay + = 0, 

A 1/2 v n o.<,- + A 1/2 v ma -o* = —(1 ln){{A 1 / 2 v n _ 1 )y + 

AV*w n <,. a . + = (1 ln*){(AV*v n ^ + 


(32a) 


n ^ 1, 


(32b) 


n>l. 


(32c) 


In particular, we will need the coincidence limit of the n = 2 case of Eqs. (32b) and 
(32c), which give 

[tw/ + «!.„'] = 0, 

(33) 

[ww. v + w 1-(/ ] = 0, 

and the n = 1 case of Eqs. (32b), (32c), which combined give 

(J 1/2 w 0 ) i<7 ' + Gd 1/2 WW)/ = — — A 1 l 2 w lva -a_'‘ - — A 1 ! 2 v lva -a\ 

(34) 

We now proceed as follows. Substituting Eq. (20) into Eq. (19b), and keeping only 
those terms in the series expansion of Eq. (31) which make a nonvanishing contribu¬ 
tion in the coincidence limit, we get 


[W a A' *V - w aS ’?\’] 


^ [(A' /2 W £[- + 2P' 2 wS-rV - (A ll2 wZ- + A''\{lW)°\.} 

- (A 1/2 WoxS ’)?\’] (a) 


(35) 


+ p 1 '**®*) “V - (zEV^rV]. (b) 

(4tt) 

where in the next to last line we have replaced w)®). by w 0aA ', which is justified since 
wfc\> = 0- To evaluate term (a) in Eq. (35), we substitute Eq. (34), which gives 

( a ) = [( —(-^ 1/2 W 0 ),A' — ^ 1/2 W 1 (T. a' — J 1 / 2 W la A'(T, a — A'Pt^Oy — 4l 1 / 2 U laA '(T“) i V 

(47Tj 

— (—(J 1 / 2 ^,,)^' — A 1 j — A'PV&'S' ~ A'^V ic6'CT. a ), 6 A'] 

= ,J ~Y 2 [~ (A'Pw-iOX + 4E/ 2 W laA '(T “),V + (J 1 / 2 VV 1 (7 |6 ' + /l 1 ' ! W 1I ,l'ff i “).V] 

(4tt; 

+ TX^F [ —(4l 1/2 Ui<7.A' + “) S 6 ' + (d 1 / 2 !)^ 7.S' + ^ 1/2 ZW CT .“).V] 

(477)“ 
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= - tAt P 1/2 "W)- (^ 1/2 "W).“ - (^ 1/2 w la “'),A' 

(y 7T ) 

+ + 3(4^),.'] 

- 1 P 1/2 VaO- ^ 1/2 *w),“ - (^“'),/ 

(47TJ 

+ (J^abO.® + 3(zJ 1 / 2 z; 1 )_ A ']. (36) 


Substituting Eq. (33), we can eliminate the terms (J 1 / 2 ^) A < and (JVfyj)^', which 
gives finally 



[(^WlaA'X 


4(JV2w laA 0 ,“ - (^/W).A' + (^^WaabO.®] 

- (^ 1/2 u la “'),A' + (^^Afl-). 8 ]. (37) 


Similarly carrying out the differentiations in term (b) of Eq. (35), we get 
(b) = K J1/ * W $0- 4(^ 1 VSaO“ - 04 1/ M B, /).a' + (4l l/2 w^.). 0 ], (38) 


which when added to the expression in Eq. (37) gives 


[wv,“V - fWl = - 


«6 


1 


[D«wU' - 4D«w[ L a l - DM L) / + 




(4") a 

1 

(4rry 


[D*'Vm' - 4D«v lah < - D X 'Vy*' + D*v W ’l (39) 


In writing Eq. (39) we have used the fact, noted above, that derivatives of A do not 
contribute in the coincidence limit. Equation (39) can be reduced to final form by 
using the following relations [some of which were already given in Eq. (25) above]: 

= iAfri] + D a [v^l 

l^^laA'] = — %D h [ Vl l 
[Dl-Vy*’] = WV^'l 


U^iAe'] — \D\\Pi ] + A*[*WL 
[D-w^y] = - fZ) aK] - f DAvfA 
[D«w^,] = lD,[v,l 
[Dyw^/] = - %D,[v lx *'], 

[DVU'] = - iD,[ Vl ] - | D a [ Vl «y], 
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which when substituted into Eq. (39) yield 


ocd 


[ W - j- i /)»[<] + l D x [v*A + d^} 


(41) 


Combining Eqs. (6), (19), (26), and (41), determine the tensor to be 


1(3 

j 4& a b[ V 1v'] + + 8ocbI V i]\ 


(42) 


Using Eqs. (2.33)-(2.34) and the formulas of [2, Appendix D] to evaluate the coin¬ 
cidence limits appearing in Eq. (42), 


2#2a 0 9 [^l] — 2 a 2~°9 

a 2 oi0 ~ hi^gjR Ra)(^ SesR — ^ 00 ) + To 8 a&R.e e — iinga£R pd R 


+ T hg« S R PTKd R>r K e - ~ A* PT0 a R 


PT00 > 


(43) 


02 


^=0 _ l p2 1 l n 0 i 1 nPTKd n 1 noO p 

2 — -f^K - 1 - Tsft.e \ TTo^v K PTK e — TTo^ *'-pd 9 


we get as our final results for the regularized stress-energy tensor and its trace anomaly. 


<TJx)y = cj a0 (x) + c 2 J aB (x) + <T ( x f(x)) 


1 1 


(47 rf 2 


3 

4 SaB a 2 v + 02a0 + £a002 01 


(44) 


8ae( T a e( x )y -2(3c 2 + c 2 ) R^ 6 + 


1 


(0 2 v v - 2a 2 K=0 ). 


(4 ny 


Apart from the undetermined multiple of RJ arising from the undetermined multiples 
of I a0 and J aB in <r a0 (x)>, the trace anomaly given in Eq. (44) agrees with that found 
by other calculational methods. We note, in conclusion, that in the present formula¬ 
tion of the regularization calculation the “curvature-dependent modified averaging” 
prescription of Ref. [2] plays no role, the regularized local part <r a { g , (x)> having been 
identified, by general arguments, to have the value given in Eq. (8). 
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44 No-hair” theorems for the Abelian Higgs and Goldstone models 
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We examine the question of whether black holes can have associated external massive vector and/or scalar 
fields, when the masses are produced by spontaneous symmetry breaking. Working throughout in the spheri¬ 
cally symmetric case, we show that “no-hair" theorems can be proved for the vector field in the Abelian Higgs 
model, for an arbitrary %R term in the Higgs Lagrangian, and for the Goldstone scalar field model with 
t “ 0. We also show that a Minkowski-space analog problem does have nontrivial screened charge solutions, 
indicating that the “no-hair” theorems which we prove are consequences of the stringent conditions at the 
assumed horizon in the general-relativistic case, not of the interacting field or spontaneous-symmetry-breaking 
aspects of the problem. 


I. INTRODUCTION 

One of the striking features of the physics of 
black holes is the existence of “no-hair” theorems, 
which state that the only external attributes of 
a black hole (such as its mass Af, angular momen¬ 
tum J , and electric charge Q) are those associated 
with massless fields admitting conserved flux in¬ 
tegrals. 1 * 2 All other types of fields must decouple, 
under the assumption of a well-behaved geometry 
at the horizon. These theorems have been proved 
for a variety of wave equations, including the 
massless Dirac field, various massive scalar field 
theories, and the massive spin-1 Proca field. Our 
purpose in the present paper is to extend this list 
of equations studied to include classical wave equa¬ 
tions in which masses are generated by sponta¬ 
neous symmetry breaking. This is particularly 
important in the vector-meson case, since it is 
widely believed that if massive spin-1 fields exist, 
they get their masses through a dynamical mech¬ 
anism of spontaneous symmetry breaking, 3 rather 
than kinematically as in the Proca equation. The 
simplest relevant model is the Abelian Higgs mod¬ 
el, 3 and so the main focus of this paper is on the 
question of whether black holes can have Abelian 
Higgs “hair.” We also give some results for the 
closely related Goldstone scalar-meson model. 

For simplicity, we assume spherical symmetry 
throughout, since we expect that if interesting 
violations of the “no-hair” theorems were to oc¬ 
cur, they would be seen in the spherically sym¬ 
metric case. We find, in fact, no evidence for 
such violations, and prove “no-hair” theorems for 
the cases we study. We believe it likely that our 
proofs will generalize to the nonspherical case. 

II. THE ABELIAN HIGGS MODEL 

Before writing down the Abelian Higgs model 
Lagrangian, we begin with some geometric pre¬ 


liminaries.** We assume the general time-indepen¬ 
dent, spherically symmetric line element 

ds 2 = -e 2a dt 2 + e 2& dr 2 + r 2 (d6 2 + sin 2 6d<p 2 ). (1) 

Using a caret to denote components on the or¬ 
thonormal basis 

Q*=e a dt, ^ 

Q ® = rdB, (3^ = rsin0d0 , 

and using a prime to indicate differentiation d/dr, 
the Einstein tensor components for this line ele¬ 
ment are 

CT=j;e- 2B a'-r- 2 (l-e- 2B ), 

d‘=Ze- 2B 0' + r- 2 ( l-e- 2B ), (J}) 

= e' 2fl (o! # + a' 2 - a'/3'+ a'r" 1 - /3'r“*). 

The curvature scalar is 

R = 2r“ 2 (l - e“ 2 fl ) + 4r“V 2fl (^' -a') 

- 2e“ 2fl (a # + a' 2 - a'ft ), (4) 

and the Bianchi identity is 

(G f 0' + a'G t7 -^G S ®+ ^a' + ^G^ r = 0. (5) 

The Abelian Higgs model describes a charged 
scalar field, with a double-well self-interaction, 
coupled to an initially massless Abelian gauge 
field. The Lagrangian density for the model, writ¬ 
ten in generally covariant form, is 

£ = (-g) 1 ' 2 v ?"“ - d*'* -1 R 101 :2 

-fc(|0| 2 -0- 2 ) 2 ], (6) 

with 
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_8A U 

Fuv ~Bx v ’ (7) 

The parameter £ is zero for the usual “minimal” 
scalar wave equation, while £ for the “con¬ 
formal” scalar wave equation which is conformal¬ 
ly invariant in the absence of mass terms. Spon¬ 
taneous symmetry breaking arises because the 
effective potential 


m=H\<p\ 2 -<t>j) 2 ( 8 ) 

has its minimum at 101= 0*, rather than at 0=0. 

Following the analysis of Bekenstein 2 in the 
similar case of charged scalar electrodynamics, 
we use the fact that in the time-independent case 
of interest in black-hole physics we can choose a 
gauge in which 0 is real, A 4 = 0, and A t is time 
independent. 5 In this gauge the field equations of 
motion which follow from the Lagrangian of Eq. 

(6) are 


j 


(e“ la+ 0 V 2 A[Y = r 2 e 0 ' a 2 e 2 A t <f > 2 , 

(e“- fl r 2 0O' = rV +0 [-e- 2 VV0 + 2fe0(0 2 -0« 2 )], ■ “minimal” 

(e““ fl r 2 0') / = r V* + 6 [-e’ 2a e 2 A 2 <t> +±R<t> + 2fc0(0 2 - 0.*)], “conformal" . 

The stress-energy tensor components in the two cases are the following for the “minimal" model: 

7* 7 = ±e ~ 2 ‘“ + 0> (A t ') 2 + e' 2B (0') 2 + e~ 2 ‘ x e % 2 </> 2 + h(<t> 2 - 0 J ) 2 , 

- -$e~ 2 l “ + fl) U t ') 2 + e“ 20 (0') 2 + e- 2a e 2 A 2 </> 2 - h(<f> 2 - <j>Jf , 

T®® = fe“ 2< “ + 0) (A ( ') 2 - e~ 2& (<f> f ) 2 + e~ 2a e 2 A 2 <t> 2 -h(<t> 2 - 0 OO 2 ) 2 , 
and the following for the “conformal” model; 


r=rs=4ft0 oo 2 (0 2 -0 oo 2 ), 


T** = - |T + fe“ 21 ° + B) (A0 2 + fe“ 20 (0') 2 + |e' 2 “e 2 A t 2 0 
+ fa'e" 2fl 00'- f0 2 fl + j0 2 G n - ±0 2 fc(0 2 - 0« 2 ) 

T* = f T - f e' 21 “ + B) (Aj) 2 + e“ 2B (0') 2 +ie“ 2a e 2 A t V 


2 


» 


- Ue~ a (e' V)' + i<*> 2 fl + 3<*> 2 G" + fA(<*> 2 - <*>„ 2 ), 

7^ = £r + |e' 2 (a+ 8) (A,') 2 - ?e' 2 + ^e~ 2a e 2 A 2 <l> 2 

- § ;*' 21 V*'+i<*> 2 * + W- *. 2 ). 


(9a) 

(9b) 


(10a) 


(10b) 


r 


In both cases these components satisfy the equa¬ 
tion of stress-energy conservation 

(T") , + a , T“-^2’ §§ + ^a' + ^T" = 0, (11) 

which determines T 00 given T rr and 7* ( . 
independent Einstein equations are then 

aa aa 

CP = 8irT*. 

We proceed now to prove a “no-hair” theorem 
for the Abelian Higgs model. We assume that the 
coupled system consisting of the vector and the 
Higgs scalar field and the spherically symmetric 
space-time geometry, described by Eqs. (1) and 
(2) above, has a horizon at r~ r H at which all 


The two 

( 12 ) 


physical scalars are finite. We show that these 
assumptions imply that the vector field A t vanish¬ 
es identically outside the horizon. Multiplying 
Eq. (9a) by A t and integrating from r H to °o gives, 
after an integration by parts, 

/" r 2 dr[e~ ‘“ + B >(A t ') 2 + 2 e a - a e 2 A t 2 <j> 2 ] 

r H 

= yl f A t 'rV<“ +8 )|». (13) 

H 

The contribution from °o to the right-hand side 
vanishes, since A t falls off asymptotically at 
least as l/r . The assumption that the physical sca- 
lar Fji v F Uv is bounded at r~ ^implies that 
e - (a+ is bounded at the horizon. Hence if A t 
= £>at r H} the right-hand side of Eq. (13) vanishes, 
and the fact that the left-hand side is non-nega- 
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tive (note that the metric components e 2a and e 20 
are non-negative outside the horizon) then implies 
A t s 0 for all r&r a . So we get a “no-hair” theo¬ 
rem unless 6 A t I H * 0. 

The remainder of the argument consists of 
showing that having A t 1**0 contradicts the as¬ 
sumption that all physical scalars are finite at 
the horizon. 7 We do this by examining the be¬ 
havior of the scalar field equation near the hori¬ 
zon. We note first of aU that in the “minimal” 
model boundedness of 7^ \ Ht and in the “confor¬ 
mal” model boundedness of T \ Ht both imply that 
the scalar field <p is bounded on the horizon. 

Hence when A t 1 a * 0 we have 



e 2a 



x (bounded) 0 > 1(14) 

H 


and the scalar field equation can be approximated 
near the horizon by 

(e a - R r 2 <t>')' + r 2 e“ +0 e-V^ = O. (15) 

It proves convenient at this point to change the in¬ 
dependent variable from r to X, with X the affine 
parameter of an incoming null geodesic. The dif¬ 
ferential equation relating X to r is 



Since t is a cyclic variable for a time-independent 


metric, the conjugate momentum P 0 is a constant 
of the motion,* and so after rescaling x to make 
P 0 = 1, the second line of Eq. (16) gives 


dr 

dx 


= (a+ 0) 


(17) 


Since the horizon must be a finite affine distance 
away from any r>r H , the value \ H of x at the hori¬ 
zon is finite. In terms of x, and making the defi¬ 
nitions 


_ „2a 

q -e 


= .-2 (a+0) 


p =e 


( 18 ) 


so that dr/dx =p 1/2 , the approximated scalar field 
equation becomes 



To proceed, we need some information on the 
behavior of q and its derivatives near the horizon. 
This can be obtained by rearranging Eq. (3) for 
the Einstein tensor components into the form 



(20a) 

& 

> 



(20b) 


(20c) 


From the boundedness at the horizon of the left- 
hand sides of these equations, and the fact that 
both terms on the right-hand side of Eq. (20a) are 
non-negative, we deduce the following: 


Eqs. (20a), (20b) 1/2 Jj 


a A 1 / 2 


=bounded *=> ~rr(p 1/2 q) 
h a A 


Eq. (20c)=> 


dcL 


= bounded , 
h dx 


- bounded. 


H 


= bounded =* p 1/2 q | H - bounded, 


H 


( 21 ) 


Hence writing 
6^qr 2 , 
we have 


de 

dX 

d 2 e 

dx 2 


( 22 ) 


I ~ r H~?\ + 2r ff qp 1/2 \ IJ =bo\inded and 3*0, 
I H «X|// 


H 


* dx 2 


dcL 


S^dX 


+ 2 r 


hQ 


dp 


1/2 


dX 


H 


+ 2pq I h- bounded, (23) 


H 


and in terms of Q the scalar field equation near 
the horizon takes the compact form 

d 2 ^ dOd^ K& 

dx 2 dxdx Q ’ 

K=e 2 r 4 A 2 1„>0. 


(24) 
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The final ingredient needed for the argument is 
the fact that boundedness of T** \ H requires 

Q~ 1( t> Z I h = bounded (25a) 

in the “minimal” model (since in this model all 
terms in 7^* are non-negative), and 


+ If] ^bounded 

in the “conformal” model, with 

K, = [q/(5e 2 A 2 )] a , K 2 = [(, dq/dX)/(5e 2 A t 2 )] 


(25b) 


H 


(25c) 


two bounded constants. The strategy of the argu¬ 
ment now is to show that Eqs. (23)-(25) are in¬ 
consistent. We consider separately the two cases 
where dQ/dX | ff >0 and where dQ/dX l* = 0. 

When dQ/dx l// = C>0, we can approximate 6 
-C(X~X H ) near the horizon, and Eq. (24) takes the 
form 


d 2 (j> 1 d<f> K <f> 

dX 2 + X-X H dX + C 2 (X-X H ) 2 “ ’ 

which has the general solution 

K 1/2 

<t>=‘(p Q cos(x+6)f * = ln(x — X H ). 


(26) 


(27) 


Hence in this case we find near the horizon 


*"V + *i 


2 

I +k 2 <t> 


d(p 

dX 


oc (x-X ff )' 1 (cos 2 x+C 1 sin 2 # 

+ C 2 sin# cos#), (28) 


which is unbounded at x H for all values of the con¬ 
stants C 12 . In the second case, when dd/dx 1* = 0, 
we make an exponential substitution <f>=e f in Eq. 
(24), giving 


e \£L + (j£Y\ + dld[ + K 

[dx 2 \dx)} dx dx 6 


(29) 


which vanishes at the horizon, justifying the as¬ 
sumption of Eq. (30). So we find in the second 
case that the two linearly independent solutions of 
Eq. (24) have the following approximate form near 
the horizon, 

t^xe^iK^fdx/e). (33) 

Both solutions are singular at the horizon, again 
giving a contradiction with our initial assump¬ 
tions. The conclusion of this somewhat lengthy 
analysis is that A t I a *0 is not allowed, and thus 
by our earlier arguments, A t must vanish identi¬ 
cally outside the horizon. That is, a black hole 
cannot support an exterior massive vector-meson 
field, even when the mass is generated by spon¬ 
taneous symmetry breaking. 

III. THE GOLDSTONE MODEL ["MINIMAL” (£=0)CASE1 

WithA t s0, Eqs. (1)-(12) of Sec. II describe the 
Goldstone model of a self-interacting scalar field, 
as generalized to curved space-time. We will 
now show that for this model in the “minimal” 

(I =0) case, a further “no-hair” theorem can be 
proved, stating that </> = for all r^r H . That is, 
outside the horizon the scalar field reduces to an 
unobservable constant, and [cf. Eq. (10a)] the sca¬ 
lar field stress-energy tensor vanishes identical¬ 
ly. Our argument does not apply to the “confor- 
mal” (| =|t) case, where the scalar field stress- 
energy tensor has a considerably more compli¬ 
cated structure than in the “minimal” case. 

The argument proceeds from the scalar field 
equation, which with A t = 0 takes the form 

(/> 1 W)' = r 2 /’- 1/2 HW, (34) 

V(<P)=H<p 2 -^ 2 ) 2 , 

and from the Einstein equations, which with A t =0 
may be rearranged to give 


Assuming 


d 2 f/dx 2 

(df/dx) 2 


= 0 , 
K 


(30) 


then Eq. (29) is simply a quadratic equation for 
df/dx , which can be solved to give 


P'-™*rpWY, (35) 

(p 1/2 qr)’ =p~ 1/2 - 8irr 2 p- 1/2 V(<l>) . 

Multiplying Eq. (34) by <p' and integrating frcm r„ 
to oo gives, after use of Eq. (35) and an integration 
by parts, 


dx e 



From Eq. (31) we get 


(31) °=/ dr[\(<y) 2 p 1/2 qr + U<l>') 2 p- 1/2 r 

r H 

*rp- l ' 2 V(<p)] 


d 2 f/dx 2 ide/dx 1 dd 2 S/dx 2 — (dd/dx) 2 
(df/dx) 2 ~ K 1/2 + 2 K ’ 

(32) 


+ k[r 2 p- U2 V(<p)] H - [/.‘V 2 ^') 2 ]* • (36) 

Since all terms in Eq. (36) are non-negative ex-, 
cept for the final one, we see that if [p 1/2 q(<t>') 2 ] H 
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= 0, then we can conclude that <p * </>„ for r ^r a> 
and the desired “no-hair" theorem follows. 

To complete the proof, we must exclude the pos¬ 
sibility [/> 1/f2 g , (0') 2 ]/f^O* Just as in the preceding 
section, this is done by a local analysis in the 
vicinity of the horizon. We begin by noting that 
since dq/dx | ff 2>0, Eq. (20a) implies 


cr+^ 

r 2 


_P 


bounded 
1/2 




r \ua rj r~ q 
q xboundedss (p 1/2 q) 2 



(37) 


Furthermore, since \ H is bounded, and since 
both terms in 7^ are positive semidefinite, we 
have that p 1/2 q 1/2 <f / \ H is bounded. Since the first 
equation in Eq. (35) implies that dp/d(-r)^ 0, 
and since/>(<>o) = l, we have/) >1, which puts the 
boundedness of p 1/2 q l/2 <p f into the form 

a 1/2^1/2 | 

p 1, Y /2 <p'\„ = P X/*y H ~ bounded. (38) 

P I H 

Suppose now that p 1/4 q 1/2 <p ' \ H = K * 0. Then 


(p 1 h~ bounded 


/ 


H 


dr^~ = bounded 
dr 



= convergent, 


(39) 


v fl(r) 

=* lim ---bounded 

f dr ^ 

=* / ® divergent, 

J r H (4 

in contradiction with Eq. (39). Hence we must 
have ^=0, which completes the proof. 

IV. AN ABELIAN HIGGS ANALOG MODEL IN 
MINKOWSKI SPACE-TIME 

As our final topic we briefly investigate a Min¬ 
kowski space-time analog of the Abelian Higgs 
model analyzed in Sec. n. We consider a sphere 
of radius r H , impenetrable to the Higgs field, and 
carrying charge Q, surrounded by the Higgs sca¬ 
lar medium. The differential equations and bound¬ 
ary conditions describing the time-independent be¬ 
havior of this system are 

(r 2 A' t )' = r 2 2e 2 A t </> 2 , 

(r V)' = r 2 [-e 2 A, 2 <p + 2h<p(<t> 2 - <t>J)] , (43) 

<Hr u ) = 0, /!«>*) = -A, 

r H 

which apart from the absence of the metric fac¬ 
tors e“, e 0 have essentially the same structure as 
the system of equations analyzed in Sec. H. How¬ 
ever, unlike the situation found in the general 
relativistic case, the Minkowski model of Eq. (43) 
has a nontrivial screened-charge solution. 9 To 
prove this, we consider the energy functional 


with 

a(r)9p u * q u *r 2 , a|„ = 0. 


(40) 


E (r„,Q) = 4ir f r 2 dr[i<A',) 2 + e 2 A t 2 <t> 2 

T H 

+ *(* 2 -*„ 2 ) 2 ] 


(44) 


But on the other hand, the differential equation 
for </> in Eq. (34) gives 

(p 1/2 q r2 <t> f )'\u~ bounded f (41) 

which on substituting <f> f *K/(p 1/4 q 1/2 ) gives 
= bounded 


and use the differential equation for A t (the charge 
conservation constraint equation) and its associa¬ 
ted boundary condition to write 

A ‘ = ’p[I r d2 ''r' 2 2e 2 A t (r')4> 2 (r')-Q^, (45) 

which when substituted into Eq. (45) gives the new 
functional, 


j 


*E{r H ,Q) = ^T, £ r2dr ^*^j r d r'r' 2 2e 2 A t (r')<p 2 (r')~Q J +e 2 A t 2 <f> 2 +h(<p 2 - <pJ) 2 J . (46) 


Extremizing *E with respect to variations in A t 
and <t> [with an endpoint condition 6<f>(r u ) =0] is 
easily verified to lead to the differential equations 
of Eq. (43). Hence these equations describe the 
field configuration which minimizes the field en- 


i- 

ergy, subject to the constraint that the inaccessible 
region r^r H contains total charge Q. 

Since the functional *E is positive semidefinite, 
and since there is a nonempty class of functions 
A t , <f> for which *E is bounded from above, func- 
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tions A t> <t> which minimize *E must exist, and 
thus the coupled equations in Eq. (43) have a solu¬ 
tion. 10 Near r = <c, the solution has the behavior 

(47) 

A, =r- 1 exp[~r/(2e 2 <l>J) 1/2 ], 

and as expected, the Higgs mechanism results in 
screening of the charge Q from view at infinity. 

The conclusion from this analysis is that the ab¬ 
sence of screened-charge black-hole solutions in 


THE ABELIAN HIGGS AND... 2803 

the general-relativistic case is a result of the 
stringent conditions for the existence of a horizon, 
not of the interacting field or spontaneous-sym¬ 
metry-breaking aspects of the problem. 
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with Spontaneous Scale Breaking 
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It is shown that in unified theories containing only fermions and gauge fields and in 
which scale invariance is spontaneously broken, radiative corrections induce an order- 
R term in the vacuum action functional which is uniquely determined by the flat-space¬ 
time theory. 


PACS numbers: 04.60.+n, ll.30.Qc, 12.20.Hx 

In recent papers Minkowski, 1 Zee, 1 and Smolin 2 have suggested that spontaneous scale-invariance 
breaking may play an important role in a fundamental theory of gravitation. The basic mechanism con¬ 
sidered by both involves an action 3 

S= f d*x{-gY n [kt<P 2 R - V(<p 2 ) +kinetic terms +other fields], (1) 


with <p a scalar field. The potential V(<p ) is as¬ 
sumed to have a minimum away from <p = 0, 

V'(k 2 ) = 0, (2) 

V n u 2 )>0, 

so that spontaneous symmetry breaking induces 
an effective gravitational action 

S^=fd' X (- g Y^R, (3) 

with € a free parameter. In this note I examine 
the analog of the above mechanism in unified the¬ 
ories which contain no fundamental scalar fields 
and in which scale invariance is spontaneously 
broken, 4 I show that in such theories the vacuum 
action functional contains an order-# term which 
is explicitly calculable in terms of the flat-space¬ 
time parameters of the theory. This result is 
basically an extension, to curved space-times, 
of the known fact that in such theories all mass 
ratios are explicitly calculable. 

Consider a theory based on a scale-invariant 


classical Lagrangian, constructed from spin-i 
fermion and spin-1 gauge fields, with the gener¬ 
ally covariant renormalized matter action 

S= f d 4 x (-g ) 1/2 (£ manEr + counter terms). (4) 

Because quantum effects induce nonlocal inter¬ 
actions with the space-time curvature, the vac¬ 
uum action functional (3)„ cannot be related to its 
flat-space-time value by the equivalence prin¬ 
ciple. Instead, we have a formal decomposition 

<S> 0 = /d 4 *(-^) 1/2 <£> 0 , 

<£>o = i> 4 ‘ 2 "<£>o (2 " ) 

n- 0 

= K%C} 0 toi +/c 2 (£>o t2i + (£)o t4i + • • ■ , (5) 

with k the unification mass of the flat-space-time 
theory and with <£> 0 (2n) homogeneous of degree 
2 n in derivatives 8* acting on the metric. Be¬ 
cause the curvature scalar R is the only Lorentz 
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scalar of order (9J 2 , the second term in Eq. (5) 
has the form 5 

<£>o (2) =M, (6) 

with £ in general nonzero. According to a cri¬ 
terion of Weinberg, 6 the coefficient will be cal¬ 
culable in the flat-space-time field theory, pro¬ 
vided that there are no possible Lagrangian count¬ 
er terms which contribute to this term. The only 
relevant counter terms of the general form 

A£ = 0 2 R, (7) 

with © 2 a gauge-invariant operator with canonical 
dimension 2. However, in a theory with no fun¬ 
damental scalars, and with spontaneous breaking 
of scale invariance (and hence no bare-mass pa¬ 
rameters), the only dimension-2 operators are 
of the form with b/ a gauge potential. 

But such operators are not gauge invariant, and 
hence no counter terms of the form of Eq. (7) are 
possible. Therefore, in the theories under con¬ 
sideration, 0 is finite and calculable (as opposed 
to merely renormalizable). 

Following 7 Sakharov 8 and Klein, 9 it is tempting 
to regard the k 2 0R term in Eq. (5) as the entire 
gravitational action, rather than as just an addi¬ 
tional finite contribution to the gravitational ac¬ 
tion. This interpretation is clearly justified if 
the unified matter theory predicts the correct 
sign and magnitude of the Einstein action and if 
the virtual integrations contributing to 0 are dy¬ 
namically cut off at energies well below the 
Planck mass. If the virtual integrations extend 
beyond the Planck mass, then use of the semi- 
classical, background metric analysis given 
above requires further justification or correc¬ 
tions, involving an analysis of possible quantum 
gravity effects. 10 

Added notes .—Calculations by Hasslacher and 
Mottola, 11 Mottola, 11 and Zee 11 in models obeying 
the premises of this note all give a nonvanishing 
induced order-I? term, and show that the sign 
can correspond to attractive gravity, 

Guo has brought to my attention a number of 
further references on I? 2 -type gravity Lagran- 
gians. 12 In particular, it is known that a C llvXa 
x qiiv\o g rav ity theory is renormalizable, but 
has a dipole ghost. Hence the extended matter- 
gravity theories discussed in Ref. 10 of this note 
are renormalizable; they could also be unitary 
(by the Lee-Wick 13 mechanism) if scale-sym¬ 
metry breakdown causes the dipole ghost to split 
into a single positive-residue graviton pole at k 2 
= 0, and a pair of complex-conjugate unstable 


ghost poles at k 2 =M±iT (with M and T of order 
the unification mass). Detailed dynamical studies 
of the extended matter-gravity theories will be 
needed to settle the unitarity issue. Tomboulis 14 
has given an interesting model with a dynamical 
Lee-Wick mechanism, and I wish to thank him 
for a discussion about this point. 

Because dimension-4 and dimension-0 opera¬ 
tors are always available (e.g., £ matter and the 
gauge-field bare coupling, respectively), the 
arguments of this note do not apply to the order- 
(0) or -(4) terms in Eq. (5). Hence, even in scalar- 
free theories with spontaneous scale-invariance 
breaking, the cosmological constant contains re¬ 
normalizable infinities. An additional symmetry, 
very likely relating the boson and fermion sec¬ 
tors of the theory, will be needed to give a calcu¬ 
lable cosmological constant. L. S. Brown and 
J, C. Collins point out that because dimension-0 
operators are available, the induced R 2 term in 
the vacuum action can become divergent in high 
loop order; if this happens, a quadratic gravita¬ 
tional Lagrangian must include an R 2 term in 
addition to the term C flvXa C flI/Xo discussed above. 
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Gross, H.-Y. Guo, B. Hasslacher, E. Mottola, 

M, J. Perry, and T. Tomboulis for helpful com¬ 
ments. This work was supported by the U. S. 
Department of Energy under Grant No. DE-AC02- 
76ER02220. 
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I derive a formula for the induced gravitational constant in unified theories with dynamical scale-invariance breakdown. 


In a recent note [1] I gave a simple argument showing that in unified theories with dynamical scale-invariance 
breakdown, radiative corrections in curved spacetime induce an order-/? term in the vacuum action functional 
which is uniquely determined by the flat spacetime theory. Subsequent calculations by Hasslacher and Mottola 
[2], Zee [3] and Mottola [4] in models obeying the premises of ref. [1] all give a non-vanishing induced gravita¬ 
tional constant, and show [2] that its sign can correspond to attractive gravity. Their calculations also suggest 
that it should be possible to do the curved spacetime manipulations in a formal way, giving an explicit formula 
for the induced gravitational constant involving flat spacetime quantities only. I derive such a formula below; it 
leads to a less abstract derivation of the basic finiteness theorem of ref. [1 ], and should provide a useful starting 
point for dynamical calculations. 

Consider a theory based on a scale-invariant classical lagrangian, constructed from spin-1/2 fermions and spin-1 
gauge fields, with the generally covariant renormalized matter action 

S-fd^xV^g ^matter. Matter = -^matter + counter-terms . (1) 


Because quantum effects are non-local, the vacuum action functional (S)q in curved spacetime cannot be related 
to its flat spacetime value by the equivalence principle. Instead, we have a formal decomposition +1 

<£>0 ~ J d 4 x \f^g {£) 0 , {£) 0 = (-£/g at s P acetime + ^ 67rG ind )“ 1 R + terms involving higher metric derivatives, 

( 2 ) 

with the induced gravitational constant. I use sign conventions * 2 in which a positive induced gravitational 
constant corresponds to attractive gravity. Defining the renormalized matter stress—energy tensor 

= 2(-g)-l« rf - K-*)V2 1 , (3) 

Eq. (2) can be rewritten as a formula for (T ^)o ,' 

(7^> 0 = s P acetime + (8 ttG^^R + terms involving higher metric derivatives. (4) 

To get a formula for G^, it suffices to calculate the change in <7^> 0 induced by spacetime curvature, in the 
special case of a conformally flat, constant curvature spacetime. For a general lagrangian variation, we have * 3 


* 1 In ref, [1], I denoted (lb^G^)" 1 by) 3k 2 . 

* 2 I use the conventions of Misner et al. [5], 

Fq, (5) is obtained from eq. (17.22) of Bjorken and Drell [6] by making the replacements 9t\{t) -+ / d 3 x 

Y. 8 £, and is independent of metric conventions. Neglect of the intrinsic metric dependence [I take 5 7^(0) = 0] is allowed in 

a calculation to order-7?, since this terms first contributes in order-/? 2 . 
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«<V(0)> 0 = i f d4x W [(-g) 112 £(*)] M( 0))> 0 - (5 [(- g yi 2 £(*)] ) 0 (r M m (0 )>} 

= i/d4*<7X6[(^)l/2£(*)]r/(0))> 0jCOimected . (5) 

Taking the metric to have the form near x of 4=4 

*„„(*) = - uRx 2 + •••) , (6a) 

and taking 5 £ to be the lagrangian change induced by the metric variation 

Sg^(x) = “ V • 57 Rx 2 , (6b) 

we immediately get [by a second use of eq. (3)] 

(SrrG^ylR = 5 (f^(0)) Q = i f d*x (-g)^ 2 (-^Rx 2 )(T0 x \x)T^ (0))> 0> connected . (7) 

Dividing by R , and taking the limit of flat spacetime, gives the desired formula 

(16^)-!= f 6 fd4 X [(*0)2 _ *2] (T(f x Hx)%m)yZonntTZ e ■ (8) 

To study the ultraviolet convergence properties of eq. (8), it is necessary to regard eq. (8) as the dimensional 
continuation limit 

(lens*,,)-! = ^ to fd"x [(*0)2 -*2 ] {T (f x H X )%Hm■ (9) 

From a perturbative point of view [7], the only way poles at n = 4 can appear is from terms of the form 
(nr x H*)r/(0))> JJSSr = ... + <0 2 > JX (* 2 )~ 3 X logarithms + ..., (10) 


in the perturbative operator product expansion of the T-product, with 0 2 a gauge-invariant operator of canonical 
dimension 2. But the hypothesis of dynamical scale invariance breakdown (vanishing bare masses, no scalar fields) 
excludes the presence of such operators, and so the limit of eq. (9) as n -► 4 is finite. From a nonperturbative 
point of view, the (x 2 )“ 3 term in the expansion of eq. ( 10 ) is altered, in theories with dynamical scale breaking * 5 , 
to either of the forms 

(x 2 )~ 3+ y , 7 >0 ; (* 2 )- 3 (log* 2 )- s , 5>1, ( 11 ) 

for both of which the limit of eq. (9) exists as n -► 4. Note that although the classical lagrangian of eq. (4) is con¬ 
formally invariant, dynamical scale breaking introduces a mass scale into the theory, and so low energy matrix 
elements of T^(x) will be non-vanishing. Hence eq. ( 8 ) gives an ultraviolet-convergent, and in general non-vanish¬ 
ing, induced inverse gravitational constant. 

I have assumed up to this point that eq. ( 8 ) is infrared finite, as is true in the calculation of Zee [3]. In the in- 
stanton gas model examined by Hasslacher and Mottola [2], the leading curvature term in (£> 0 is of the form 
R log 0/R), indicating that eq. ( 8 ) is logarithmically divergent at x = °°. The divergence arises from expanding the 

* 4 The local expansion of a general conformally flat metric is 

~ T l/j,^(l + ...)j ~ + * 

Eq. (6a) results from making the specialization 
* 5 See Pagels [8] for a review of models of dynamically broken gauge theories. 
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conformal factor in a power series in x inside the x integral [cf. eq. (6a)]; the legality of this expansion is not 
guaranteed by the ultraviolet-finiteness criteria of ref. [1 ]. 

One can attempt to carry the argument one step further, by using dispersion relations to put eq. (8) in spectral 
form. Defining 


H9 2 ) = /d 4 * eW• *(-0 (T(T X * (x)T^ m(0))> C , 



and assuming that i//( q 2 ) — i//(0) satisfies an unsubtracted dispersion relation, a simple calculation * 6 gives 


( 16 J rG ind )-l=-L 


j dcr 2 p(cr 2 )/cr 4 , 
0 


p(-q 2 ) = (27T)3£54(p n _ (? )|( 0 |r M M( 0 )|«>| 2 

n 



Since in canonical gauges the Hilbert space metric is positive definite, eq. (13) implies that is negative. How¬ 
ever, both eq. (13) and this conclusion about the sign of are false. The reason is that in gauge theories T v 
contains [9] a trace anomaly term proportional to j3(g 2 )Ar(F^^^), which makes the spectral function behave 
asymptotically as p ^ a 4 X logarithms, invalidating the unsubtracted dispersion relation assumption needed to de¬ 
rive eq. (13). Although the trace anomaly gives rise to a singular term proportional to (3 2 (x 2 )“ 4 X logarithms in 
the operator product expansion of eq. (10), the contribution of this term to eq. (9) is well behaved in the n -► 4 
dimensional limit. 


I wish to thank B. Hasslacher, L. Brown, E. Mottola and A. Zee for numerous discussions, and E. Mottola for 
checking the calculation. A. Zee has independently obtained some of the formulas given above, and suggested the 
role of a 4 terms in p in the breakdown of the spectral analysis. This work was supported by the Department of 
Energy under Grant No. DE-AC02-76ER02220. 

Eqs. (14) and (15) follow immediately from eqs. (16.33), (16.27) and appendix C of Bjorken and Drell [6]. 
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Einstein gravity as a symmetry-breaking effect 
in quantum field theory' 

Stephen L. Adler 

The Institute for Advanced Study, Princeton, New Jersey 08540 

This article gives a pedagogical review of recent work in which the Einstein-Hilbert gravitational action 
is obtained as a symmetry-breaking effect in quantum field theory. Particular emphasis is placed on the 
case of renormalizable field theories with dynamical scale-invariance breaking, in which the induced 
gravitational effective action is finite and calculable. A functional integral formulation is used 
throughout, and a detailed analysis is given of the role of dimensional regularization in extracting finite 
answers from formally quadratically divergent integrals. Expressions are derived for the induced gravita¬ 
tional constant and for the induced cosmological constant in quantized matter theories on a background 
manifold, and a strategy is outlined for computing the induced constants in the case of an SU(«) gauge 
theory. By use. of the background field method, the formalism is extended to the case in which the 
metric is also quantized, yielding a derivation of the semiclassical Einstein equations as an approximation 
to quantum gravity, as well as general formulas for the induced (or renormalized) gravitational and 
cosmological constants. 
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I. INTRODUCTION 

In the conventional formulation of general relativity, 
gravitation is described by rewriting the matter action in 
generally covariant form, and by adding to it the 
Einstein-Hilbert gravitational action 

Vv=-^JV* v/3 i ( *- 2A) , (in 

with G Newton's constant and R the curvature scalar, 
and with the cosmological constant A taken to be zero. 
The total action is then treated as a classical variational 
principle, to be extremized with respect to variations of 
the c-number metric g MV . As discussed in the survey ar¬ 
ticles in Hawking and Israel (1979), the theory in this 
form accounts very well for all astronomical gravitation¬ 
al phenomena and has a structure which is understood in 
considerable theoretical detail. On the other hand, when 
treated as a fundamental quantum action, Eq. (1.1) leads 
to a nonrenormalizable quantum field theory. This prob¬ 
lem has long been known, and has stimulated much 
theoretical effort aimed at achieving a satisfactory quan¬ 
tization of the Einstein-Hilbert action or its supergravity 
extensions. [For reviews of the current status of these 
approaches, see Hawking and Israel (1979) and Van 
Nieuwenhuizen (1981).] 

An entirely different approach to quantum gravity 
derives from work by Zel’dovich (1967) and Sakharov 
(1967) on induced quantum effects. Zel’dovich studied 
the effect of vacuum quantum fluctuations on the 
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cosmological constant; extending this idea, Sakharov pro¬ 
posed that Eq. (1.1) is not a fundamental microscopic ac¬ 
tion, but rather is an effective action induced by vacuum 
quantum structure (see also Klein, 1974). To quote the 
two key sentences from Sakharov’s paper, “The presence 
of the action (1) [Eq. (1.1)] leads to a metrical elasticity 
of space, i.e., to generalized forces which oppose the 
curving of space. (TT) Here we consider the hypothesis 
which identifies the action (1) with the change in the ac¬ 
tion of quantum fluctuations of the vacuum if space is 
curved.” Sakharov’s proposal attracted attention from 
the outset (see Misner et aL, 1970), but further progress 
was hampered by the fact that in the free field models 
for which he made his estimates, the induced gravita¬ 
tional constant is given by integrals which contain 
both quadratic and logarithmic divergences. It is only in 
the last few years that the technology of quantum field 
theory has advanced to the point where one can sys¬ 
tematically study induced quantum effects in interacting 
field theories. These advances, and their application to 
induced Einstein gravity, are the subject matter of this 
review. 

Since the topics discussed below span the areas of 
high-energy physics and relativity, in which different no- 
tational conventions are generally used, I have adopted 
the following compromise with respect to notation. I use 
microscopic units throughout, 

¥ l=c = 1, (1.2) 

so the only dimensional quantity is mass = (length)” 1 . 
The coordinates are taken to have the dimension 
(length) 1 , making the metric g^ dimensionless. In all 
flat space-time examples and discussions, I use the 

A -signature convention of Bjorken and Drell 

(1965), while in all expressions which involve a curved 
manifold I follow the —h + + convention of Misner 
et aL, (1970). In the few places where it is necessary to 
change from one convention to the other, I will explicitly 
call attention to the transition. 


II- FIELD THEORY PRELIMINARIES 

A. Actions and canonical dimension accounting 

The functional integral formulation of quantum field 
theory (see Abers and Lee, 1973) expresses transition am¬ 
plitudes in the form 

Z= , 

S[{+]\=Jd*x se[{+]\ , ( 2 .D 

with (^} the set of fields present, S' the action, and J? 
the Lagrangian (or action) density. Since the argument 
of an exponential or a logarithm must be dimensionless, 
in the conventional accounting of canonical dimension in 
which 


dimflength] = — 1, dimfmass] = + 1 , (2.2) 

we have 

dim[S]=0, dim[d 4 Jt] = — 4 

=*>dim[^]=4 . (2.3) 

From Eq. (2.3) we can infer the canonical dimensionality 
of the fields and parameters from which elementary re- 
normalizable matter theories are constructed. For a 
scalar cp 4 field theory we have 

■ (2.4) 

with m 0 the bare mass and k 0 the bare coupling, and 
with 

dim[3^ = d/dx^]^= 1 =>dim[<p] = l , 
dim[m 0 ]= 1 , 

dim[A 0 ] = 0. (2.5) 

For a spin-1 Abelian gauge field (the photon) we have 
„ i. p F 

— 4 1 fAV* > 

F JJlv = d v A JJl '-d JJl A v , (2.6) 

with 

dim[F JUV ] = 2 , 

dim[,4 JU ] = l, (2.7) 

while for a spin-y Dirac field we have 

^ = , (2.8) 

with 

dim[t/f] = y . (2.9) 

Minimal coupling of the photon to a Dirac field or a 
complex scalar field with bare charge e 0 yields the La¬ 
grangian densities for quantum electrodynamics, 

~^ qedi / 2 = m 0 )tp , 

^QED0= - T v* v + tI^VpI 2 - Wo I <P\ 2 - 7*0 | <p \ 4 , 

^ ~ "I” > 

=>dim[e 0 ]=0 . (2.10) 

For a spin-1 non-Abelian gauge field (the massless gauge 
gluon) we have 

9’z=— — F i F l l* v 

4 f /»V 1 > 

f; v = a v A ; - vi i +gof ,Jk A (2.ii) 

with i the internal symmetry index, f IJk the group struc¬ 
ture constants, and g 0 the bare coupling constant. 
Minimal coupling of the gauge field to a Dirac field in 
the fundamental representation (with representation ma¬ 
trices \k!) gives the basic Lagrangian density for quan¬ 
tum chromodynamics, 
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-^qcd= - T F i ltv F i » v +if,(ir'*D lt —m 0 )xp , 

D v.= dp + igoT^K > ( 2 . 12 ) 

from which we infer the dimensional assignments 
dim[F]| (v ] = 2 , 

dim[^J,] = l, dim[^i] = \ , 

dim[g 0 ] =0 . (2.13) 

All of the field theory models currently under study as 
candidates for unified matter theories [for reviews see 
Marciano and Pagels (1978); Fritzsch and Minkowski 
(1981)] are combinations of scalar, Dirac, and gauge field 
action building blocks of the basic types enumerated 
above. The characterizing feature of all such renormal- 
izable actions is that their coupling constants (A 0 ,e 0 ,g 0 
above) have canonical dimension zero. 

B. Effective actions 

Consider now a renormalizable field theory with ac¬ 
tion 

S[[^M^)]= $d\ J?[\<j> L },\<l> H }] , (2.14) 

where \(j> L } are “light” field components whose dynam¬ 
ics we directly observe, while are “heavy” field 

components which influence the dynamics of the light 
components but are not directly observable. The \<f> H } 
can in general include fields with high physical masses 
and high-momentum 1 components of fields with light 
physical masses. Since the are hidden from view, 

it is convenient to rewrite the functional integral of Eq. 
(2.1) in the following form, 

z= f d\<t> L }d\<t> M }e is[ ^-^ i 

= f d{<l> L }e iSe!fl{ * L]] , (2.15) 

where the effective action S cff [ ] for the light fields is 
defined through 

J d ^« ]e isU* L )A* H U . (2.16) 

Clearly, the effective action, if exactly known, would 
give a complete description of the dynamics of the fields 
{ (j > L }. In practice, one usually works with only an ap¬ 
proximation to S eff , obtained by keeping leading terms in 
an expansion in a small parameter. Some examples of 
commonly used effective actions are as follows. 

1. The Heisenberg-Euler (1936) effective action in quantum 
electrodynamics (see also Schwinger, 1951). 

Integrating out the electron fields in quantum electro¬ 
dynamics gives an effective action describing the non- 

J I wish to thank B. Holdom for suggesting the inclusion of a 
momentum criterion in the definition, as a way of automatical¬ 
ly including renormalization effects arising from overlapping 
divergences. For recent discussions of effective actions, see 
Weinberg (1980a) and Ovrut and Schnitzer (1980,1981). 


linear interactions of photons, 

e ' S eff [ V' = J d [^j e iS QEDI/2nV'^l ) (2.17) 

For field strengths which are slowly varying over an 
electron Compton wavelength, S eff can be approximated 
by taking F liV = constant, which gives a problem which 
can be solved in closed form. For weak, slowly varying 
fields (on a scale of an electron Compton wavelength), 
Sgff can be approximated by the first two terms in a 
series expansion 

S e[ dF MV l= f d 4 x Jf'ff , 

Jf ea = f (E 2 -tf 2 )+ ( E 2 -H 2 ) 2 

45m +7 (E-H) z ]+ ■ ■ ■ , | (2.18) 

with E,H the electric and magnetic fields, a the fine- 
structure constant, and m the electron mass. If inter¬ 
preted as a fundamental action and used (or, rather, 
misused) beyond the tree-approximation level, Eq. (2.18) 
would yield a nonrenormalizable perturbation expansion 
in powers of the dimensional effective coupling 
2a 2 /45m 4 . 


2. The four-fermion effective action approximation 
to the Weinberg (1967)-Salam (1968) weak interaction 
theory 

At center-of-mass energies well below 100 GeV, the 
weak interactions are described by a current-current 
four-fermion effective action 

S eff [(fermions)]= / d 4 x( -?T n ') . 

^eff arged = +AxA hk +JA ) , 

^“=^G F (jAjnXjnX + Jnt.) , 

7ch =ey k (l-y s )v e +fi,T terms , 

A = «rx( 1 — YiKd cos 8 C +s sin 6 C ) 

+charm terms , 

jn = -\eY\l-y 5 )e + -jv e rHl-r 5 )v e 
+ 2sin 2 6wey k e +[i,t terms , 

J„= \uyHl — y 5 )u — ~dy\l—y 5 )d 

— 2 sin 2 0^( \uy k u — \dy k d ) 

+ strange, charm terms , (2.19) 


with e,v e ,u,d y s the electron, electron neutrino, and up, 
down, and strange quark fields, respectively, and with d c 
and 6w the Cabibbo and Weinberg angles. 2 Since the 
fermion fields have dimension y, the Fermi constant Gp 
has dimension —2, and has the empirical value 



1.023 X 10“ 5 


m 


2 

pro! on 


( 2 . 20 ) 


2 For a recent review of the phenomenology of the Weinberg- 
Salam model, see Kim et al. (1981). 
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As expected for a theory with a dimensional coupling 
constant, the use of Eq. (2.19) as a fundamental action 
leads to a nonrenormalizable perturbation expansion in 
Gf. This difficulty is resolved in the Weinberg-Salam 
gauge theory, in which in addition to the fermions, the 
fundamental action contains gauge and Higgs boson 
fields, and which has a renormalizable perturbation ex¬ 
pansion in the gauge boson couplings g,g'. When the 
boson fields are integrated out, according to 


a _ 17 _ 

3 A i u> r juv—^3 r y,v > 

t p=z\ /2 y, 

e a =Z) / 2 e, m 0 = Z m m ; (2.24a) 

F liv F< iV =Z 3 F r llv F r ' iv , 

b ,li e 0 A IJI if>=Z l Vr^eA ' x[> r =Z 2 Z e ' n Ztp r , 


^/S e ff[{ fermions) \ 

= /rf|bosons| e ' S “*- Satam[|femions) ’ |b “° ns)1 . (2.21) 

the effective action of Eq. (2.19) is obtained as a leading 
approximation, with the Fermi constant related to the 
electric charge e, the charged gauge boson mass M w , and 
sin0jp by 

1 r g 2 

V2 F mlr ’ 

( 2 . 22 ) 

_ 6 
^ sin 0jy 

Let us now return to gravitation. The action of Eq. 
(1.1) contains dimensional couplings G~ l and A, 


dim[G 1 ]=dim[A] = 2 , 

^ ~ 1/2 = m Planck — 1*22 X 10 19 GeV — /f t * nck , 

(2.23) 

/pianck = 1.62X 10” 33 cm , 

| A I <10~ 57 cm- 2 , 

and, as expected for the case when the couplings are not 
dimensionless, leads to a nonrenormalizable quantum 
field theory. The viewpoint of this article will be that 
the gravitational action is not a fundamental microscopic 
action, but rather is a long-wavelength effective action 
similar to the ones discussed above. The fundamental 
action will be assumed to be renormalizable, and condi¬ 
tions on it will be formulated which guarantee that the 
effective gravitational action is calculable in terms of 
parameters of the microscopic theory. 

C. Renormalizabflfty and the dimensional algorithm 

In quantum field theory, one in general encounters 
divergences when evaluating radiative corrections. In re¬ 
normalizable field theories, all divergences can be elim¬ 
inated by making divergent rescalings, or renormaliza¬ 
tions, of a finite number of parameters of the theory, 
which cannot be calculated from first principles but are 
replaced by measured values at the end of the calcula¬ 
tion. 

For example, in spin-y quantum electrodynamics, 
working for simplicity to one-loop order, one introduces 
renormalization constants Z it2 ,3,m,e > renormalized fields 
Ap, F' v , xp r , and renormalized charge and mass parame¬ 
ters e,m by writing 


tpm 0 ip=Z 2 Z m tp r nup r . (2.24b) 

From Eq. (2.24b) and the Ward identity (which is de¬ 
rived from current conservation) one learns that 

Zl n Z\ n = ^-=\ , (2.25) 

leaving as the independent renormalization constants Z e , 
Z m , and Z 2 . Thus the renormalization procedure calls 
for the bare e 0 , m 0 , and xp to be adjusted to absorb all 
divergences, leaving finite e, m, and xp r to be identified 
with the measured values. To understand why e, for ex¬ 
ample, cannot be calculated, let us recall that in one-loop 
order, the divergence in Z e has the form 

2 

Z e = ] + ~logM 2 +0{al), a 0 =^~ , (2.26) 

37T 47T 

with M a massive regulator. Under rescalings M—►J'M 
of the regulator mass, Z e changes to 


Z,->l + -?-logM 2 +^-Iog| 2 + 0(a^) , (2.27) 

37r 37T 


and hence the finite part of Z e is regulator-scheme 
dependent. As a result, the finite quantity e extracted 
from the divergent bare charge e 0 remains a free param¬ 
eter of the renormalized theory. In general in a renor¬ 
malizable field theory, we expect to find one free renor¬ 
malized coupling or mass parameter for each bare cou¬ 
pling or mass appearing in the unrenormalized Lagrang- 
ian density. 

Continuing for the moment to work to one-loop order, 
the renormalization procedure given in Eq. (2.24b) for 
the various dimension-four terms in the action density 
can be rewritten in a compact matrix notation, 

m=[z][v r ]. 




F F& v 

1 pv 1 





tyy**iif 

ipyPeoApip 

. m= 


xfrm 0 xp 


4> r m xp r 


Z 3 0 0 0 

0 Z 2 0 0 

0 0 Z x 0 

0 0 0 Z 2 Z m 


(2.28a) 


Beyond one-loop order, the renormalization constants as¬ 
sociated with the action density terms F /IV F /IV ,... are no 
longer simply products of the renormalization constants 
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for the individual field, charge, and mass factors intro¬ 
duced in Eq. (2.24a), and the action density terms them¬ 
selves will mix under renormalization. The appropriate 
generalization of Eq. (2.28a) then takes the form 







(tpm 0 x(;) r 


(2.28b) 


with ['P] as in Eq. (2.28a), with CF JUV F /iV ) r , . . . , the re¬ 
normalized action density terms, and with [Z] a nondi¬ 
agonal renormalization matrix. 

As we have seen from the above example, in the gen¬ 
eric case multiplicative renormalization takes the more 
general form of matrix multiplicative renormalization. 
The set of operators which can mix under this renormal¬ 
ization process is characterized by the following rule. 

The dimensional algorithm [see Weinberg (1957), Zim- 
mermann (1970), and Brown (1980)]. A composite 
operator in quantum field theory is defined (up to a con¬ 
stant factor) as the product of any number of fields or 
field derivatives at the same space-time point. The di¬ 
mensional algorithm states: (i) The most general basis 
set of composite operators which can mix under renor¬ 
malization are the polynomials of the same canonical di¬ 
mension, and of the same symmetry type (spatial and 
internal) formed from the bare fields, the bare masses, 
and 0/0JC**. (ii) The Lagrangian density for a renormal- 
izable field theory must contain a complete basis set 
(apart from total derivatives) of Lorentz- and internal 
symmetry-invariant composite operators of canonical di¬ 
mension four. 


Let us illustrate the dimensional algorithm in the flat 
space-time cases of scalar <p 4 theory, QEDy, and QCD, 
and then use it to deduce additional Lagrangian counter¬ 
terms which must be added to assure renormalizability 
when these theories are embedded in a curved back¬ 
ground manifold. 


1. Scalar <p A theory in flat space-time 

Excluding total derivatives, the only dimension-four 
composites even under q>~*■— q> (the internal symmetry 
of the model) are 

dpqpdPq?, m^q) 2 , cp 4 , (2.29a) 

mt . (2.29b) 

The operators of Eq. (2.29a) are just the ones appearing 
in the Lagrangian density of Eq. (2.4), while in flat 
space-time Eq. (2.29b) is an irrelevant constant which 
can be dropped. 

2. QED j and QCD in flat space-time 

For QED y, the only dimension-four composites (ex¬ 
cluding total derivatives) are 


F liV F> ,v , , 

(2.30a) 

mo , 

(2.30b) 

mlA^A*, A^A", (d M A M ) 2 . 

(2.30c) 


The operators of Eq. (2.30a) are just the ones appearing 
in the Lagrangian density of Eq. (2.10), while in flat 
space-time Eq. (2.30b) is an irrelevant constant. The 
operators of Eq. (2.30c) are Lorentz scalars, but are not 
invariant under the internal symmetry (or gauge) 
transformation 

, (2.31) 

and hence do not appear in the Lagrangian density. For 
QCD the classification of gauge-invariant Lorentz scalar 
operators constructed from the bare fields is 
analogous—one simply adds an internal symmetry index 
i, and changes the definition of the covariant derivative 
as in Eq. (2.12). A careful proof that A m ,-A m/ is not an 
internal symmetry invariant in the non-Abelian case, tak¬ 
ing account of the complexities introduced by gauge¬ 
fixing and ghost terms, is given in Appendix A. 

3. Additional Lagrangian density terms 
in a background curved space-time 
(Brown and Collins, 1980) 

When spin-0, spin-y, or gauge spin-1 matter fields are 
quantized on a curved background manifold with metric 
gjuv, the action takes the form 

WU„*] = / d 4 xv^jr[ft},g MV ], (2.32) 

with d 4 x V —g the invariant volume element, and with 
J? a scalar with respect to general-coordinate transfor¬ 
mations. According to the dimensional algorithm, J? 
must contain all scalar dimension-four polynomials 
which can be formed from the bare fields (including now 
gjuv), the bare masses, and d/dx**, and which are invari¬ 
ant under the internal symmetries of the matter fields. 
The terms which can thus appear in <¥ are easily 
enumerated, and may be conveniently grouped into the 
following four classes: (i) The generally covariant tran¬ 
scriptions of the Lagrangian densities of Eqs. 
(2.4) —(2.13), obtained in the usual manner by replacing 
ordinary derivatives 0^ by covariant derivatives with 
respect to the background metric, (ii) The bare mass 
terms m% of Eqs. (2.29b) and (2.30b), which contribute to 
the cosmological constant on a curved manifold, 3 as well 


3 The terms and mlR, which appear in ¥ multiplied by 
independent renormalization constants, may be considered, 
respectively, as the bare cosmological constant Aq/Go and the 
bare order-Lagrangian density R/Gq. Prior to the discus¬ 
sion of the cosmological constant in Sec. VI.C, we shall not in¬ 
troduce bare parameters Aq,Go when not required to do so by 
the presence of dimensional parameters in the microscopic 
matter action. 
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as corresponding regulator mass terms M 4 if massive re¬ 
gulators are employed, (iii) Terms of first degree in the 
Riemann curvature tensor, 4 

<? 2 R , (2-33) 

with $2 a. general-coordinate — scalar and internal sym¬ 
metry-invariant operator of canonical dimension two. 
The allowed forms for & 2 are 3 

ml, M 2 , <p 2 , (2.34) 

since as shown in Appendix A, A M ,A MI is excluded by 
gauge invariance. The differential ope rator & 2 =V is 
omitted from the list because V r —g is a total 

derivative, and # 2 =V /£ A M , with A M an Abelian gauge po¬ 
tential, is omitted because it is not gauge invariant. 
Moreover, as is also shown in Appendix A, the operator 
tp 2 is excluded by supersymmetry invariance when <p is a 
spin-0 partner of a massless supermultiplet. 5 (iv) Terms 
of second degree in the Riemann curvature tensor, 

&=R liV i J ,R livKa -4R a ( 1 R aP +R 2 , 

#*=C llvKa C l,vKa , JT=R 2 , (2.35) 

with & the Gauss-Bonnet density and the Weyl 

conformal tensor, which in four dimensions has the form 

^'juvXo- = RixvXa T^Sfik^vtT Sfur^vk Svk^jur 4" SvffRjik ) 

^^(SjikSva SjiaSvk^ * (2.36) 

The results of this enumeration can be summarized in 
the following lemmas: 

Lemma 1. For a general renormalizable matter field 
theory (spin-0+spin-y + gauge spin-1 fields) in curved 
space-time, quantized in a manner which respects all 
gauge and supersymmetry internal symmetries, the La- 
grangian density terms proportional to R are of the fol¬ 
lowing types, 

A 

m^R, mo =a bare mass , 

M 2 R, M=a massive regulator , (2.37) 

tp 2 R, tp= a spin—0 field not a member 
of a massless supermultiplet . 

Lemma 2. If there are no bare masses or massive regu¬ 
lators and if all spin-0 fields belong to massless super- 


4 No additional counterterms of first order in the curvature 
tensor can be formed by using the Ricci tensor R Mv , since these 
must have the form with a rank-two symmetric 

tensor of canonical dimension two. The only possibilities are 
#% v =V tt A v +V v A tt , which can be reduced to by in¬ 

tegration by parts and use of the Bianchi identity 
V /i R /iV = yV v R, and ^S v =^ 2 g # * v , which is equivalent to Eq. 

(2.33) of the text. Similarly, no additional counterterms can be 
formed by using the Weyl conformal tensor C^j^, and so the 
enumeration given in the text is complete. 

5 See Fayet and Ferrara (1977) for a discussion of supersym¬ 

metry field representations. 


multiplets, then there are no terms in proportional to 
R —that is, terms (iii) above are absent. Moreover, when 
these conditions are satisfied, terms (ii) above are also ab¬ 
sent, and the structure of reduces to 

— -^maitcrf { 4*} >£juvl "b-^gravfSjuv] » 
-^nav^o^+^o^+CoJT , (2.38) 

with .^niaitert W’Sjuv] the generally covariant transcrip¬ 
tion of the flat space-time matter Lagrangian density 
The splitting of Jf into the “matter” and 
“gravitational” parts given in Eq. (2.38) is unique, since 
in the absence of dimensional constants ^ mat ter an d 
Jf grav satisfy 

-^maitcrt (0] >£juvl = 0 j 
-^gravt^juv] =0 > -ST matter 

(2.39) 


D. Conditions for calculability 
of the gravitational effective action 

We are now ready to return to a discussion of the 
gravitational effective action induced by quantized 
matter fields on a curved background. Following Eq. 
(2.16), we define the gravitational effective action by 

e ' s effl* /i v) = J*e'^'W’V 1 . (2.40) 

Since S eff is a scalar under general-coordinate transfor¬ 
mations, it may be represented as the integral over the 
manifold of a scalar density, which for slowly varying 
metrics can be formally developed in a series expansion 
in powers of dkKnv* 6 

S'efftg/.v] = / d 4 xV^gjf' f{ [g IJlv ] , 

-^eff [*„ v I= -^eff[*„v ]+■**«§W ] + O[ () 4 ] , 

-2drte„v]= I6lr G ind (-2Aind) ’ 167rG . nd R ■ 

(2.41) 

What are the conditions for GjUj and A ind to be uniquely 
calculable in terms of the renormalized parameters of the 
flat space-time matter theory? Clearly, if the fundamen¬ 
tal action contains terms proportional to R , 

then the finite renormalization ambiguities arising from 
these terms will produce an undetermined finite contri¬ 
bution to Gjnd 1 ; in this case the induced gravitational con- 


6 In renormalizable theories, massless particle loops in general 
give rise to logarithms of dxgfiv in the (d*g MV > 4 terms (that is, in 
the curvature-squared terms) of Eq. (2.41). For example, the 
existence of a conformal trace anomaly proportional to in¬ 
dicates t he pr esence of an effective action term proportional to 
f d 4 x V^g 
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stant is renormalizable, but not calculable. On the other 
hand, if no terms proportional to R appear in 
•S'tW.g/.vL then CiJ will be calculable, since there will 
now be no source of ambiguity proportional to R. 1 In 
this case the theory will yield a uniquely determined fin¬ 
ite value for G^ 1 . So we have the following result: 

Theorem [Adler (1980a)]. Under the conditions of lem¬ 
ma 2, a quantized matter theory in a curved background 
produces a calculable induced gravitational constant 

Gj. 

Consider next the induced cosmological constant A ind , 
which appears in the effective action in the dimension- 
four combination Aj nd /Gj nd . Ambiguities in A illd can 
arise only from dimension-four terms in the flat space- 
time limit of which are not determined by 

the renormalization conditions on the flat space-time 
matter theory. The decomposition of Eqs. (2.38)—(2.39) 
guarantees that no such additional dimension-four terms 
are present, and so we can conclude: 

Theorem. Under the conditions of lemma 2, a quantized 
matter theory in a curved background produces a calcul¬ 
able induced cosmological constant A^, and so the en¬ 
tire effective Einstein-Hilbert gravitational action is cal¬ 
culable. 

The basic theorems just stated give sufficient condi¬ 
tions for the finiteness of the induced gravitational ac¬ 
tion. Of the three conditions in lemma 1, two—the ab¬ 
sence of bare masses, and of scalar fields not in massless 
supermultiplets—are also necessary conditions. Howev¬ 
er, the exclusion of massive regulators is not necessary, 
and in Appendix A the analysis is generalized to the case 
where massive regulators are employed. As discussed in 
Sec. 4.4 of Fadde’ev and Slavnov (1980), massive regula¬ 
tors have useful formal properties, but they are awkward 
to use in explicit calculations. A superior method for di¬ 
agram evaluations is the technique of dimensional regu¬ 
larization, which is discussed in Sec. Ill below. The sub¬ 
sequent sections of this review contain elaborations on 
the theorems of this section. For the theorems to have a 
nontrivial content, we must have a way of generating a 
nonzero scale for physical masses even when bare masses 
are zero (otherwise we get G,nd=0» which is calculable 
but trivial); this requires dynamical breaking of scale in¬ 
variance, as discussed in detail in Sec. IV. In Sec. V, we 
derive explicit, formal expressions for G ; ~ d and A ind in 
terms of expectations of operators in the flat space-time 
matter vacuum. Finally, in Sec. VI, I extend the discus¬ 
sion to include the effects of quantization of the metric. 

III. DIMENSIONAL REGULARIZATION 
A. Survey 

The regularization of quantum field theory without in¬ 
troducing massive regulators can be accomplished by an- 

7 This type of argument was first used in connection with the 
calculability of mass relations by Weinberg (1972). 


alytic regularization methods, in which divergent in¬ 
tegrals are defined by analytic continuation in a dimen¬ 
sionless parameter (for a review, see Leibbrandt, 1975). 
It will suffice to limit the discussion to regularization 
methods for flat space-time, since we will see below (in 
Sec. V) that after doing the curvature arithmetic needed 
to extract expressions for G^ 1 and A ind , we can explicit¬ 
ly take the flat space-time limit in the resulting formu¬ 
las. The most widely used form of analytic regulariza¬ 
tion for flat space-time calculations is dimensional regu¬ 
larization, in which the dimension of the space-time 
manifold is continued from 4 to 2 m by the coordinate 
and momentum space replacements 

JVx — / d^x 

fd*p-+ f d^p , (3.1) 

while keeping the formal structure of the action, in 
terms of fields and field derivatives, the same as in di¬ 
mension four. After Wick rotation to 2o)-dimensional 
Euclidean space, 8 Feynman integrands in the continued 
theory are evaluated by using the following simple rules. 
The Kronecker delta 8£ obeys the usual composition law 

5%=5g , (3.2a) 

but its trace is modified to 


S£ = 2(0 


(3.2b) 


From Eq. (3.2) the symmetric average of momentum fac¬ 
tors can be uniquely deduced, giving, for example, 


n & 

OVPv)&ymmeiric = ~ ^juv * 


average 


(3.3) 


The Dirac y matrices continue to obey a Clifford algebra 


lr JU >rv)=26 Juv i, 

and are trace normalized so that 


(3.4a) 


Tr(r Ju rv)=2 fl, 5 JUV , Tr( 1)=2" , (3.4b) 

permitting one to deduce unique values for all spinor 
loops not containing an odd number of factors y 5 . 9 Us¬ 
ing Eqs. (3.2)—(3.4) and rotational covariance, all pertur¬ 
bation theory calculations can be reduced to multiple in¬ 
tegrals of scalar-valued integrands over the momentum 
space of dimension 2m. 

The basic momentum space integral which appears is 
fd^pf(p), (3.5) 

and is uniquely specified, up to an overall normalization, 
by the following three conditions given by Wilson (1973), 


8 I will use the notation to denote a Euclidean integral, 
and will consistently use a 4 - 4 - 4- 4 - metric convention in Eu¬ 
clidean space formulas. 

9 For recent discussions of the dimensional regularization 
treatment of 75 , see Gottlieb and Donohue (1979) and Ovrut 


(1981). 
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linearity: 

f E d 2a ’p[af l (p) + bf 2 (p)] = af E d 2 °’pf l (p) 

+ bf E d 2 “pf 2 (p), 

translation invariance: 

J^Pfip +g)=f E d^pf(p), 
scaling law: 

f E d 2 “pf( S p)=s- 2 *’f E d 2 °‘pf(p) . (3.6) 

The normalization which is conventionally used is 


f E d 2a pe-P 2 =iT, (3.7) 

but is fixed only at co =2; Collins (1975) shows that am¬ 
biguities in normalization away from co =2 can always be 
absorbed into the ambiguities of the renormalization con¬ 
stants discussed in Sec. II.C. Hence dimensional regular¬ 
ization gives a well-defined procedure for regularizing 
the ultraviolet divergences of quantum field theory. 10 

Using the rules of Eqs. (3.5)—(3.7), we find, for exam¬ 
ple, that 

f d 2a p(p 2 + m 2 )~ a =v ja ^~-~-(m 2 )‘°~ a . (3.8) 

j e r(a) 

For co — a <0, the integral on the left is convergent in the 
ultraviolet and yields the expression on the right, which 
is meromorphic (analytic apart from isolated poles) in co 
and a. The integral can then be defined by analytic con¬ 
tinuation for 6) —a > 0, except at points where it develops 
poles. For example, when a = 1 we have 



2 to 


1 


P- 2 , 2 

/> 2 + m 2 


=^ni-a> )(m 2 r* 


rr 


1 —co 
7T 2 

2 — 6 ) 


+ finite , near co — 1 


+ finite 


m 2 , near co=2 , 


(3.9) 


showing that the pole at co = 1 is associated with the 
two-dimensional logarithmically diveigent integral 



TT 

1 - 6 ) * 



(3.10a) 


while the pole at co — 2 is associated with the four¬ 
dimensional logarithmically divergent integral 

1 7T 2 




2-co’ 


CO 


>2 . 


(3.10b; 


between the representation of a four-dimensional loga¬ 
rithmic divergence in the massive regulator and the di¬ 
mensional regularization schemes. In N -loop order in di¬ 
mensional regularization, one in general encounters 
higher powers of logarithmic divergences 
1/(2 —co), . . . , 1/(2 —co) N near co=2; these divergences 
must be cancelled against corresponding poles in the re¬ 
normalization constants Z in order to extract finite phys¬ 
ical amplitudes at dimension four. 

B. Vanishing of quadratic divergences 


The formally quadratically divergent (and /^-inde¬ 
pendent) integral 



(3.12) 


is assigned the value 0 by dimensional regularization, 
since the right-hand side of Eq. (3.9) is proportional to 
m 2 at co=2. A more precise statement of this fact is 
given by the following: 


Lemma. The only evaluation of the ultraviolet diver¬ 
gent, infrared convergent massless integral 

I a,a = $ E d 2a p(p 2 )- a , co-a>0 , (3.13) 

which is meromorphic in co and a and which agrees with 
the m—► 0 limit of Eq. (3.8), is 0. The proof fol¬ 
lows immediately from the observations that: (i) when 
co — a> 0 is not a positive integer, the limit as m—► 0 of 
Eq. (3.8) exists, and is 0; and (ii) the only meromorphic 
extension (to co — a=positive integer) of 0 is 0. 11 Work¬ 
ing from J“ ,a =0, we can now prove the vanishing of 

f d 2a p(p 2 )~ a (logp 2 )- p , co-a> 0 (3.14) 

by repeated differentiation of with respect to a, and 
by repeated application of the Weyl transform (Erdelyi, 
1954) 

w e r , a ,v = _A^ f°°dy(Y-af-'I a ’ r ’ p ' 

Lip) 

= r d 2<a p[W p (p 2 )- a ](\ogp 2 )- l, ‘ . (3.15) 

Since 

W e (p 2 )~ a =-j^(p 2 )- a dbb p -\p 2 )~ B 

=(p 2 )~ a (logp 2 )~ p , 0</?< I , (3.16) 


The integral of Eq. (3.10b) would be represented by 
tt 2 log M 2 using a conventional massive regulator, giving 
the useful correspondence 

IogAfV.—— (3. II) 

2—co 


l0 A detailed axiomatization of the rules of dimensional regu¬ 
larization, along the lines sketched by Wilson (1973), has been 
given by Collins (unpublished). 


we have 


11 Any nonzero evaluation of J w * a (such as the one given by 
Leibbrandt, 1975) is thus necessarily not a meromorphic func¬ 
tion of co . Such evaluations violate the basic philosophy of an¬ 
alytic regularization, which is essentially a calculus of mero¬ 
morphic functions. The vanishing of I** a in dimensional regu¬ 
larization was first noted by’t Hooft and Veltman (1972), and 
is deduced as a theorem in the axiomatization of Collins (un¬ 
published). 
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w p 1 0 1 , a ,p- =1 0 1 , a ,p-+p ) o</?< I , (2.24)-(2.26) and Eq. (3.11) are used to write 

(-3/3, (3.17) F ka F ka =Z~'F r ka F rka , 


and so by repeated operations any value of P can be 
reached, starting from P=0, where we have 
/to,a,o_joj 7 a_Q gy continuing this procedure with 
respect to the index ft we can generate powers of 
loglogp 2 , etc., giving finally: 

Lemma. In dimensional regularization, for ft? — a>0 we 
have 


an 

Z e = l + — logAf 2 + 0(ao) 


1 + 


«o 1 


z,-' = i- a ° 


3 7T 2—ft? 
1 


+ 0(<*o) , 


(3.23) 


3tt 2 — 


CO 


+ 


«0 

3tt 


1 


( 2 —ft ?) 2 


+ • * 


ja,a,P,Y, • ■ ■ _ / £ rf 2 ^(^2)~a (logp 2 ) -^ (loglogp 2 ) -y. . . 

=0 . (3.18) 

In particular, the generalized quadratically divergent in¬ 
tegral vanishes, 

j2,\,p,Y,- .. _ f ^-^(log/J 2 )“^(loglogp 2 )” y * * * —0 . 
J E p* 

(3.19) 

This result shows that computing radiative corrections to 
the basic quadratically divergent integral of Eq. (3.12) al¬ 
ways gives 0, independently of whether one proceeds or¬ 
der by order in perturbation theory, which gives only 
positive powers of log/? 2 (corresponding to I 2,l ’~ n ), or 
whether one uses the renormalization group to sum 
powers of logp 2 into running coupling constant factors 
(see Sec. IV.C below), giving the more general integral of 
Eq. (3.18). 


C. An application: the stress tensor trace anomaly 
in gauge theories 


As an application of dimensional regularization, let us 
derive, to one-loop order, the flat space-time stress tensor 
trace anomaly in QED -j. In spinor quantum electro¬ 
dynamics, the symmetrized stress energy tensor is given 
by 


Tpv= TVuvFkaF*^ ~ 


+ ^[VKrv-f> JU +r JU -f>v)^-V f (-^ JU rv+-D~vr JU )^] > 

+ 0^=3^ — ieoAp . (3.20) 

Contracting with rf v and using Eq. (3.2b) and the spinor 
equation of motion 


/y^ = (3 . 2 !) 

we get 

rjj = -2(2-<o)±F Ka F Ka + $m 0 rl> . (3.22) 

Although the first term on the right-hand side of Eq. 
(3.22) is proportional to 2 — co, it cannot be dropped as 
co—>2 because the factor F^ a F k(T contains a pole series in 
(2 —co) -1 . To exhibit these poles explicitly, Eqs. 


where we have worked to one-loop order in the photon 
proper self-energy, and to iterated one-loop order in 
Z~ l . Substituting Eq. (3.23) into the first term of Eq. 


— 2(2 -co)Z~ x = 


2 a 


o 


3tt 


1- 


+ 


^o 7 -i 
3rr e 


as co- 

-2) 

«o 

1 

3 7T 2 

.—co 

«0 

2 

1 

3 7T 

. 

(2 —is?) 2 

2 a 

e 2 

3tt * 47r 




• 4 • 


(3.24) 


Hence to one-loop order the stress energy tensor trace is 

(3.25) 


Tf.=~^F r Xa F r ^ + 1 (,morl> . 


3tt 


The first term on the right-hand side, found by Coleman 
and Jackiw (1971), Crewther (1972), and Chanowitz and 
Ellis (1972, 1973), would be lost if one naively used the 
equations of motion without attention to regularization, 
and is called the trace anomaly. The derivation given 
above can be generalized to all orders in perturbation 
theory (Adler, Collins, and Duncan, 1977; Nielsen, 1977) 
and yields 

n = I (F^F^Y+i I + S(e)](^m 0 t/-) r , (3.26) 

e 


with P and 5 finite functions of e which are defined 
through the renormalization group, and with the split¬ 
ting of into the two terms on the right-hand side 
made unique by the specification of certain zero 
momentum-transfer matrix elements of the composite 
operators (F^F^Y and (tpntoxpY. The analogous formu¬ 
la for QCD (obtained by Collins, Duncan, and Joglekar, 
1977 and Nielsen, 1977) reads 12 

T^ = 2^± (F i K ^ay +[l+8(g )]^ molfl) r 

8 

(3.27) 

For a pure SU(«) gauge theory with no quarks, the 
second term on the right-hand side is absent, and the 
trace anomaly formula simplifies to 


12 I have dropped equation of motion terms, which both van¬ 
ish at nonzero momentum transfer and have vanishing zero- 
momentum—transfer vacuum expectation values. 
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8 


(3.28) 


Equation (3.28) will play an important role in the 
analysis, given in Sec. V.D below, of and A ind in an 
SU(h) gauge theory. 


IV. SYMMETRY BREAKDOWN 


A. Models with elementary scalars 


at 0 becomes unstable, and V develops a pair of 
stable minima at 

* =± *' (4.3) 

?=(-mjA 0 ) 1/2 , 

as shown in Fig. Kb). Either the minimum at cp=q> or 
the minimum at <p= — ^ can be used as the zeroth-order 
approximation in a perturbation expansion, by making a 
shift 

<P— ±<P+<p' (4.4) 


Spontaneous symmetry breaking plays a crucial role in 
constructing gauge theory models, since it permits gen¬ 
eration of the gauge boson masses needed to get realistic 
low-energy effective actions, while preserving the ultra¬ 
violet cancellations which guarantee renormalizability. 
The simplest model exhibiting spontaneous symmetry 
breaking is the scalar q> A field theory of Eq. (2.4), 

JSf = T~V 9 

T=±(d of p) 2 , 

V=^(d i tp) 2 +^ml<p 2 +{h < fp 4 . (4.1) 

For constant <p , the potential V reduces to 

F(<p)=ymo^ 2 +T^ 4 > (4.2) 

and has the behavior sketched in Fig. 1. In the conven¬ 
tional case m o>0, the potential has a single stable 
minimum at 0, as shown in Fig. 1(a). However, 
when the sign of mo is reversed to mo <0, the extremum 




(b) 


FIG. 1. (a) Potential V of Eq. (4.2), for mj>0. (b) Potential 
V of Eq. (4.2), for m\ <0. 


and taking cp ' as the new field variable. Mixing between 
the two configurations is not possible, because in the 
limit of an infinite space-time volume, they are separated 
by an infinite quantum-mechanical tunneling barrier. 
Thus the discrete qx^—<p symmetry of the Lagrangian is 
broken by the choice of one of the two classical minima 
as the quantum mechanical vacuum state. 

The simplest field theory model in which a continuous 
symmetry is broken is obtained by making <p a complex 
scalar field 

<P = <Pl + f<?>2 > ( 4 -5) 

with Lagrangian density 

X = |0 „<p3V- (<p*<p) 2 . (4.6) 

* 

When mo<0 the potential V has the behavior sketched 
in Fig. 2, and a suitable quantum-mechanical vacuum is 
obtained by making the shift 

, (4.7a) 

with ^ any complex constant scalar satisfying 

\v\ 2 =—ml/k 0 . (4.7b) 

In this case a continuous symmetry is broken, and the 
excitation <p' which generates an infinitesimal rotation of 
^ is a zero-mass Goldstone mode. When the complex 
scalar field of Eqs. (4.5) —(4.6) is minimally coupled to a 
spin-1 gauge field, the zero-mass mode decouples from 
the physical degrees of freedom, and the spin-1 field be- 



FIG. 2. Potential V in the complex scalar model, obtained by 
substituting <p 2 —*<p*<p in Eq. (4.2). The curve shows an arbi¬ 
trary section lm(<pe 10 )— 0 of the potential surface, O^0<2tt, 
plotted vs R e(<pe i0 ). 
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conies massive. This is the so-called Higgs mechanism, 
which is used in the Weinberg-Salam model to generate 
intermediate vector boson masses. (For a detailed 
pedagogical review of these ideas and full references, see 
Bernstein, 1974). 

The suggestion of linking spontaneous scale symmetry 
breaking with generation of the gravitational constant 
first appeared in the context of scalar meson models [see 
Fujii (1974), Englert, Truffin, and Gastmans (1976), 
Minkowski (1977), Chudnovsky (1978), Matsuki (1978), 
Smolin (1979), Zee (1979), Linde (1979, 1980) and Nieh 
(1982)]. The basic mechanism of the above-cited papers 
is to start from a Lagrangian density of the form 

jf = eq> 2 R + T — V(<p 2 ) , (4.8) 


with V a symmetry-breaking potential as in Eq. (4.2). In 
the unstable symmetric phase ^>=0 there is no order-/? 
term in Jf, but in the stable broken-symmetry phase 
with ^ 2 = -mjA 0 an induced gravitational action is 
generated with 


1 

l6rrG ind 



(4.9) 


In such models, since both scalar fields and dimensional 
parameters (m 0 =7^0) appear, the induced gravitational 
constant is not calculable* 3 ; e is an additional curved 
space-time parameter of the theory which is not deter¬ 
mined by the flat space-time renormalized parameters 
(Brown and Collins, 1980). 


B. Dynamical symmetry breaking: 
the renormalization group 
in asymptotically free gauge theories 

In order to get a calculable and nonvanishing induced 
gravitational constant, we must turn our attention to 
field theory models with dynamical scale-invariance 
breaking. Such theories, by definition, are formally scale 
invariant at the classical Lagrangian or tree- 
approximation level, but exhibit spontaneous scale- 
invariance breaking as a result of quantum corrections in 
one- or higher-loop order. There are two reasonably well 
understood mechanisms by which dynamical scale- 
invariance breaking can occur. The first, which will be 
discussed in this section, is through the renormalization 
process itself, in infrared-singular theories such as unbro¬ 
ken non-Abelian gauge theories. The second, which will 
be described in Sec. IV.C below, is through the genera¬ 
tion of a mass gap and a fermion pair condensate in rela¬ 
tivistic versions of the Bardeen-Cooper-Schrieffer (BCS, 
1957) theory of superconductivity. The two mechanisms 
are not really disjoint, and both are believed to be opera¬ 
tive in non-Abelian gauge theories. This fact and some 


13 Strictly speaking, to get a renormalizable model an addi¬ 
tional term 6mj>R must be included in the spontaneous 
symmetry breaking then generates a change in the constant 

A A T 

factor multiplying R from 6m o to 6m o + • 


further gauge theory-superconductor analogies are dis¬ 
cussed briefly in Sec. IV.D. The material which follows 
has been organized so that the reader who wishes to 
proceed most directly to the gravitational applications of 
Secs. V and VI can do so after reading Sec. IV.B alone. 

The most important class of field theory models exhi¬ 
biting dynamical spontaneous scale-invariance breaking 
are asymptotically free gauge theories [see *t Hooft (un¬ 
published), Gross and Wilczek (1973), and Politzer 
(1973)]. Consider an SU(n) non-Abelian gauge field cou¬ 
pled to Nf massless fermions in the fundamental repre¬ 
sentation, as is described, for example, by -5?qcd of Eq. 
(2.12) with m 0 =0 and with xp replicated Nf times. In 
tree approximation this theory contains no dimensional 
parameters, and so scale invariance is unbroken; more¬ 
over, since there are no scalar fields, all of the conditions 
of the theorems of Sec. II are satisfied. Let us now con¬ 
sider the effect of quantum corrections to the tree- 
approximation theory. When radiative corrections are 
included, the coupling constant g appears in calculations 
through the running coupling constant 



_ shah _ t 

l + \b 0 g 2 (fi 2 )log(—q 2 /ti 2 ) + 


(4.10) 


with q 2 the four-momentum squared, fi 2 an arbitrary 
subtraction point, and g 2 (^ 2 ) the value of the coupling 
constant at —q 2 —fi 2 . The appearance of the subtraction 
mass fi 2 is necessitated by the fact that radiative correc¬ 
tions to massless gauge theories are highly infrared 
divergent, making it impossible to introduce a renormal¬ 
ized coupling parameter by specifying the value of g 2 at 
q 2 =0 , as is done in the more familiar case of quantum 
electrodynamics. The parameter b 0 is determined by 
one-loop radiative corrections to be 




(4.11) 


and is positive, provided that Nf is not too large. When 
b 0 is positive, Eq. (4.10) shows that the running coupling 
vanishes at large four-momentum squared, and the 
theory in this case is said to be asymptotically free. 

Let us examine the structure of Eq. (4.10) in the ap¬ 
proximation in which only one-loop radiative corrections 
are retained, while the higher-loop contributions to the 
running coupling constant, denoted by . .., are neglected. 
(As discussed in Appendix B.I, there is a well-defined 
sense in which a one-loop analysis is exact.) Evaluating 
Eq. (4.10) at —q 2 =ii 2 , we get 




tMo gn\ 


= -lT77-TMog^ 2 , (4.12) 

g V) 

showing that the scale mass ^(g(/i),/i) defined by 
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^(g(^),^)=^e 1/[bo8 (fl )! (4.13) 

is subtraction-point independent. In technical terminolo¬ 
gy, the scale mass is said to be renormaliza¬ 

tion group 14 invariant to one-loop order, since it is left 
unchanged to this order by transformations of the renor¬ 
malization point fi 2 and the renormalized coupling con¬ 
stant g 2 (/t 2 ). When radiative corrections to all orders are 
kept, Eq. (4.13) generalizes to (Gross and Neveu, 1974; 
Lane, 1974a) 

ds/Pig \ (4.14) 

with 

^(g)--|6og 3 +0(g 5 ) (4.15) 


the function appearing in the trace anomaly formula of 
Eq. (3.27), and again ^(g,^) is said to be renormaliza¬ 
tion group invariant. An alternative, and frequently 
used, way of specifying that has the functional form 
of Eq. (4.14) is obtained by requiring that satisfy the 
Callan (I970)-Symanzik (1970) differential equation 




_d_ 

dfi 





(4.16) 


Let us now apply the above analysis to determine the 
structure of physically observable parameters, such as ef¬ 
fective action parameters. Since observables must be 
subtraction-point independent, they can depend on p, 
only through the scale mass and so we get the 

following important result: 


theories, characterized by their values of the dimension- 
one scale mass ^(g,^). This change in dimensionality 
of the effective parameter, when radiative corrections are 
included, clearly implies that there has been a dynamical 
breaking of scale invariance. The general phenomenon is 
called dimensional transmutation, after Coleman and 
Weinberg (1973), who discovered similar behavior in 
massless QED 0 (a theory which, like the massless non- 
Abelian gauge theory, is highly infrared divergent.) 

C. Dynamical symmetry breaking: relativistic 
generalizations of the superconductor gap equation 

Historically, the earliest suggestion that dynamical 
symmetry breaking plays an important role in particle 
physics was contained in the classic paper of Nambu and 
Jona-Lasinio (1961), who proposed a model for nucleon 
mass generation 15 based on an analogy with the BCS 
theory of superconductivity. 16 The Nambu—Jona- 
Lasinio model starts from a Lagrangian containing mass¬ 
less, interacting fermions, and then sets up a self- 
consistent equation for the dynamically generated fer¬ 
mion mass in analogy with the “gap equation” of super¬ 
conductivity. In this section, I give a very schematic ac¬ 
count of the basic approximation method used in the 
BCS and Nambu—Jona-Lasinio models, and show that it 
gives a dynamical version of the tree-approximation 
model for symmetry breaking described in Sec. IV.A. 

Let us consider a fermion with bare propagator Go* 1 * 
proper self-energy part 2, and full propagator G -1 , re¬ 
lated to one another as usual by 


Theorem [Gross and Neveu (1974)]. Any physical 
parameter P(g,fi) which has canonical dimension d P in 
the accounting of Sec. II.A must be equal to [^(g f ^)] p 
up to a calculable number, 

P (g,M) = calculable number X l^(g,fi) ] dp . (4.17) 

Equivalently, P(g,/t) must satisfy the homogeneous re¬ 
normalization group equation 

P(g,li)=0, (4.18) 

which for a quantity of canonical dimension d P implies 
Eq. (4.17). 

According to this theorem, it is the dimensional scale 
mass rather than the dimensionless (but subtraction- 
point dependent) renormalized coupling g 2 (^ 2 ), which in 
asymptotically free gauge theories plays a role analogous 
to that played by the renormalized fine-structure con¬ 
stant a in quantum electrodynamics. In other words, the 
renormalization process has replaced a one-parameter 
family of unrenormalized theories, characterized by their 
values of the dimensionless unrenormalized gauge cou¬ 
pling go, by a one-parameter family of renormalized 


,4 For a pedagogical discussion of the renormalization group 
structure of non-Abelian gauge theories, see Stevenson (1981). 


n^-+P(g) 9 


dp, 


dg 


G“ J = G o' -2 - (4.19) 

Assuming the fermions interact through a potential F, a 
simple self-consistent approximation for the proper self- 
energy is obtained by truncating the Dyson equation for 
2 to include only the lowest-order skeleton diagram il¬ 
lustrated in Fig. 3. This gives 

x=/kg 

= jV(G 0 “'+2)[G^ 2 -2 2 ]-\ (4.20) 

where f indicates symbolically a summation or integra- 
tion over intermediate state (closed loop) variables. In 
models with dynamical symmetry breaking, the unbroken 
symmetry of the classical Lagrangian can be shown to 


,5 A very important aspect of the Nambu —Jona-Lasinio 
model, which is not dealt with in this review, is the generation 
of the pion as a zero-mass bound state. There has been recent 
interest in analogs of this phenomenon in which the Higgs 
scalars or pseudoscalars in unified theories are dynamically 
generated composites of more fundamental fields; see Englert 
and Brout (1964); Jackiw and Johnson (1973); Cornwall and 
Norton (1973); Weinberg (1976); and Susskind (1979). 

l6 For texts on the BCS theory, see Schrieffer (1964) and 
Fetter and Walecka (1971). The Ginzburg-Landau phe¬ 
nomenological theory is also described in these books. 
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FIG. 3. Truncated Dyson equation for the self-energy part. 
The dashed line and dots denote the potential V in the BCS 
case, or the photon propagator and emission and absorption 
vertices in the JBW model case. The heavy line denotes a full 
electron propagator G = (Gq 1 — 2)"', giving a nonlinear in¬ 
tegral equation (the gap equation) for 2. 

imply that 

fVGo'[Go 2 -Z 2 ]~ l =0. (4.21) 

Substituting Eq. (4.21) into Eq. (4.20) then gives the gen¬ 
eral form of the “gap equation” for 2, 

2= f VS.lGo 2 -^, 2 ]- 1 . (4.22) 

Equation (4.22) always has a trivial solution 2=0, analo¬ 
gous to the trivial root <p=0 of the equation 

0 =F'(<p)=< ? >(mS + V 2 ) . (4.23) 

which governs the vacuum structure of the scalar meson 
model discussed in Sec. IV.A. However, when Fhas the 
(attractive) sign for which dynamical symmetry breaking 
occurs, there is also a nontrivial solution to Eq. (4.22), 
corresponding symbolically to the root of 

1 = f F[Go" 2 —2 2 ] -1 , (4.24) 

and analogous to the symmetry-breaking roots cp^±cp of 
Eq. (4.23). 

To solve Eq. (4.24) explicitly in the case of the BCS 
model, we make substitutions appropriate to the nonrela- 
tivistic kinematics of the superconductor problem [see 
Schrieffer (1964)], 

Go 2 =ko — (k 2 ~-k£) 2 +iE , (4.25) 

2 2 —A 2 , 


with N the density of states at the Fermi surface and c a 
numerical factor of order unity. Solving Eq. (4.28) for A 
gives 


A = — co D exp 
c 


1 

NV 


(4.29) 


showing that the energy gap has a nonperturbative 
dependence on the interaction strength F, with an essen¬ 
tial singularity at F=0. The detailed analysis of the 
BCS model shows that the energy gap A is proportional 
to the ground-state expectation value of a product of 
creation (or annihilation) operators for two electrons, 
with opposite momenta lying near the Fermi surface and 
opposite spins, 

A a , I k | ~k F . (4.30) 

Thus, the presence of a nonvanishing energy gap in a 
superconductor implies the existence of a ground-state 
condensate of correlated electron pairs. 

An analogous reduction of Eq. (4.22) (now using rela¬ 
tivistic kinematics) can be carried out for the 
Nambu—Jona-Lasinio model and for its more recent 
gauge-theoretic extensions, in which the nonrenormaliz- 
able local four-fermion interaction used by Nambu and 
Jona-Lasinio is replaced by a renormalizable interaction 
mediated by vector meson exchange. [See Johnson, Bak¬ 
er, and Willey (1964), Jackiw and Johnson (1973), 
Cornwall and Norton (1973), and Lane (1974b).] For de¬ 
finiteness, let us consider the case of the Johnson-Baker- 
Willey (JBW, 1964) model for fermion mass generation 
in Abelian electrodynamics. These authors consider 
zero-bare mass spinor electrodynamics [that is, & QED 1/2 
of Eq. (2.10), with m 0 =0] in the approximation in 
which all photon self-energy parts are neglected. The 
dashed line in Fig. 3 then represents a bare photon pro¬ 
pagator; thus to leading order of perturbation theory for 
the vertex parts, the analog of Eq. (4.22) is 

2(p)~«a 0 / d A k^2.(p -k ) 

K 


where k F is the Fermi momentum, and we carry out the 
k 0 integration. Equation (4.24) then yields an algebraic 
equation for the energy gap characterizing the low-lying 
electronic excitations in a superconductor, 


l = V l |* 2 _* 2|=0 


d 3 k 


1 


[(/c 2 —/c |) 2 +A 2 ] 1/2 


(4.26) 


X[(p-fc) 2 -2(p-fc) 2 r' , (4.31) 

where — indicates that numerical constants of order uni¬ 
ty have been omitted. In addition to the trivial solution 
2=0, Eq. (4.31) has a nonperturbative solution in which 
2 has the asymptotic behavior 


with o) D the Debye frequency, which serves as an effec¬ 
tive ultraviolet cutoff in the BCS model. Because phase 
space in the neighborhood of the Fermi momentum is ef¬ 
fectively one dimensional. 


2(p)~m 



6 


, 8~a 0 . 


Equation (4.32) gives self-consistency because 


d 3 kai4vkfdk , 


(4.27) 


Eq. (4.26) is logarithmically divergent at the lower limit 
when A = 0, and for small A can be approximated by 




m 


k 2 ( p-kY 


m 


— {p —k) : 


! =NVf 


Q) 


*> dco 


cA a) 


—NVlog 


• 

1 

m 2 1 

Ct ) j ) 

cA 

i 

, (4.28) ~ 8 m 

1 

K> 


<*o 


2<P>, 


(4.32) 


(4.33) 
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which follows from angular averaging and the elementa¬ 
ry integral 

f “ . (4.34) 

J a B 5 

The parameter m in Eq. (4.32) is an arbitrary integration 
constant introduced by the boundary condition 

2(/> 2 = —m 2 )~m , (4.35) 

and clearly corresponds to an electron physical mass. 
We see that as a result of dynamical symmetry breaking 
a mass scale has appeared in the solution to Eq. (4.31), 
even though no mass scale appears in the integral equa¬ 
tion itself or in the fundamental Lagrangian from which 
it was derived. The vanishing of m 0 is mirrored in the 
fact that tip) has a softer ultraviolet behavior 


2(/>) -► 0 (4.36) 

p 2 —► » 

than would be found if a mass scale were introduced 
kinematically into the Lagrangian. Such ultraviolet soft¬ 
ness (seen also in the discussion of asymptotically free 
gauge theories in Sec. IV.B above) is a very general 
feature of field theory models where the mass scale is in¬ 
troduced through dynamical scale-invariance breaking. 
The detailed analysis of the JBW and other 
Nambu—Jona-Lasinio type models shows that, associat¬ 
ed with the generation of a nonvanishing fermion physi¬ 
cal mass, the ground state contains a fermionic conden¬ 
sate, this time involving a nonvanishing fermion- 
antifermion expectation value of the form q. 


D. Gauge theory-superconductor analogies 


Comparing Eq. (4.13) with Eq. (4.29), we see that 
there is a close similarity between the nonpertuibative 
structure of the gauge theory one-loop scale mass 
^(g,/x) and that of the superconductor energy gap A. 
As was noted in connection with Eqs. (4.26)—(4.28) 
above, the e~ x/NV form in the superconductor case arises 
from the effectively one-dimensional phase space near 
the Fermi surface, which produces a logarithmically 
divergent one-loop perturbation theory contribution 


/ d 3 k |» ^max dk 

k 2 —kp k F k — k F 


(4.37) 


Similarly, the e~ ,/g form in the gauge theory case arises 
from the logarithmic divergence of the one-loop contri¬ 
bution to g 2 ( —q 2 )~ l at q 2 — 0, which in turn comes from 
the nonvanishing and effectively one-dimensional phase 
space for a massless particle to decay into two massless 
particles, as expressed in the identity 


I k, | | k 2 1 8 3 (k-k, —k 2 )8( | k | - | k, | - | k 2 1 ) 

=2tt/ 0 'd^8 3 (k 2 -kx)8 3 [k,-k(I-x)] . (4.38) 

To see the effect of Eq. (4.38), let us recall that the S- 
wave phase space for a pair of particles of mass m, at 
center of mass energy V'T, is 



s —4 m 2 
s 


1/2 


9 


(4.39) 


and vanishes at threshold for m > 0. However, when 
m =0, Eq. (4.39) reduces to p(s) = l, which is nonvanish¬ 
ing at threshold as required by Eq. (4.38). Consequently, 
the one-loop perturbation-theory integral 

r s max ds'p(s') (4 40) 

s'—q 2 

is logarithmically divergent at q 2 = 0. 

As suggested by this phase-space analysis, and as dis¬ 
cussed in more detail by Gross and Neveu (1974) and 
Lane (1974a, 1974b), the renormalization group mechan¬ 
ism for dynamical symmetry breaking on the one hand, 
and the superconductor gap equation mechanism on the 
other, are really two complementary aspects of the 
dynamical symmetry breaking which occurs in non- 
Abelian gauge theories. The gauge theory-super¬ 
conductor analogy can be carried considerably further. 
Just as a superconductor contains an electron pair con¬ 
densate proportional to the energy gap A, quantum chro¬ 
modynamics contains a fermionic condensate (ifnlr) 0 pro¬ 
portional to the third power of the gauge theory 
scale mass and very likely 17 contains a gluonic con¬ 
densate (F r ka F ikir )o proportional to ^ 4 . When a super¬ 
conductor and its energy gap are perturbed by a weakly 
varying electromagnetic field, the resulting dynamics is 
described by the induced effective action of the 
Ginzburg-Landau theory. 16 Correspondingly, when a 
non-Abelian gauge theory and its scale mass are per¬ 
turbed by a weakly varying metric, the resulting dynam¬ 
ics, as we will see in detail below, is described by an in¬ 
duced effective action of the Einstein-Hilbert form. 18 


V. INDUCED GRAVITATIONAL 
AND COSMOLOGICAL CONSTANTS 
FOR MATTER THEORIES 
ON A BACKGROUND MANIFOLD 

A. Path-integral derivation of formulas for Gj and Aj nd 

From the viewpoint of the theorem of Gross and 
Neveu discussed in Sec. IV.B, the induced gravitational 


l7 For discussions of gluon pairing see Batalin, Matinyan, and 
Savvidl (1977); Savvidy (1977); Pagels and Tomboulis (1978); 
Valnstein, Zakharov, and Shifman (1978); Ambjorn and Olesen 
(1980); Fukuda and Kazama (1980); Kazama (1980); and Mil- 
ton (1981). See also Sec. V.D below. 

,8 The superconductor phase space analogy is discussed briefly 
in the “photon pairing” paper of Adler et al. (1976). One con¬ 
clusion of their paper, that photon ladders cannot generate a 
graviton in flat space-time, is a special case of a recent general 
theorem of Witten and Weinberg (1980). The remainder of 
their paper and a subsequent paper of Adler (1976) attempted, 
unsuccessfully, to generate a gap equation as a curvature effect 
in a model which has no gap equation in the absence of curva¬ 
ture. 
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constant G and cosmological constant A ind of a gauge 
field theory are simply physical parameters of canonical 
dimension two, defined through the response of the 
gauge field system to local perturbations in the space- 
time metric. This suggests that it should be possible to 
take formal derivatives with respect to deviations of the 
metric g ^ from the Minkowski metric 77 ^ thereby ex¬ 
tracting expressions for and A ind in terms of flat 
space-time vacuum expectation values. Such an analysis 
will be carried out in this section, using the metric and 
curvature conventions of Misner et al. (1970). 

The starting point of the derivation is the basic defini¬ 
tion of the gravitational effective action given in Eq. 
(2.40) above. 




= fd{H 


«[(*). 


(5.1) 


with 


fd 4 xv^g 


1 


16ir G ind 


( R -2A ind ) 


+o[ o^v) 4 ] 


(5.2a) 


<WU„v] = fd 4 xJf[{</,}, gltv ], 

■¥[{<!>],gn V \ ■ (5.2b) 

I will assume that the microscopic action density 1? is a 
function of the metric and its first and second deriva¬ 
tives, 

(^} »gjuv] — {^Ug/iv’dxg/iv’dxdog/iv) » (5.3) 

making the derivation general enough to encompass the 
case, discussed in Sec. VI below, where the metric itself 
(and not just the matter fields (^}) is path integral quan¬ 
tized. To proceed, let us calculate the conformal varia¬ 
tion of Eq. (5.1) around a general background metric. 
This is done by acting on the left- and right-hand sides 
with the differential operator 2 g JUV (y) 8 / 8 g JUV (y), where y 
is an arbitrary space-time point which will shortly be 
chosen as the origin, and then dividing by iexpUS ef f). 
Inserting the expansion of Eq. (5.2a) in the left-hand side 
gives 




>M V 


1 


167tG j nd 


iR -2A ind ) + 0[0^ v ) 4 ] 


fdlH 


«S[{^),g uv | 


" v, 2 g MV (y) 


Sg MV (y) 


I* 




fdlH 




(5.4a) 


where the quantities g^, Jzf, R inside the x-integral are 
values at space-time point x, and where the functional 
integral f d ( <f >} is still an integration over the values of 
the matter fields at all space-time points, 

fd{<t>} = Hfd{<l>(z)} . (5.4b) 

2 

Equation (5.4a) can be evaluated using standard formulas 
for the first variations (with as before, the renor¬ 
malized matter stress-energy tensor), 

8^=? = . 


8(V^R )= -V~=i{R^-\g^R)b gllv 


+ total derivatives , 

(5.5) 

8 ^ = \ T^bg'jv , 


T^—V—g T^ v 

(5.6) 


dJ? dJf BJf 

ag„v x 9(9a*„ v > + * 


Substituting these, and defining the point y to be the ori¬ 
gin 0 in order to simplify the subsequent formulas, we 
get 


8 t tG, 


ind 


[/{(0)-4A ind ]+O[O^ uv ) 4 ] 

fdMe—^T[g MV , 


>JUV 

*S[{^),g uv |,= ( 


0] 


fd{+l 




(5.7) 


with Tfg^vjx] the stress-energy tensor trace functional 
defined by 


T[g,j V ,x] = V^gT* 



djf 3 a.y 
3 g„ v 1 90xS„v) 




dJ? 


(5.8a) 


Taking the flat space-time limit of Eq. (5.7) and intro 
ducing the abbreviated notation 


T(x)=T[ ViiV ,x] = T» 


&flV VflV 


(5.8b) 


we obtain a formula for the induced cosmological term. 


i A ind _ Pw^’V'Vq)) 
2^ G ind fdWe^* '’V' 1 


(5.9) 


In order to extract a formula for the induced gravita¬ 
tional constant, we must take a further metric variation 
of Eq. (5.7). Since the left-hand side of Eq. (5.7) has no 
tensor structure, it suffices to specialize 19 to a metric 
which around x =0 has the conformally flat, constant- 
curvature form 


19 For a derivation which does not make this specialization, 
but instead proceeds from the general Riemann normal expan¬ 
sion g tt y = rj ttv —(\)R tiav pX a x p + • ■ • , see Adler (1980c). See 
also Brown and Zee (1982). 
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8v V 1 (x)='Vjuv[ 1 — TT-K ( 0 2 + °<Vi? ,R 2 )] 
= 7 7 Ju v+5g JU v » 

8g MV (x)=— (0)x 2 , x 2 =(x') 2 —(x 0 ) 2 . 


(5.10) 


Varying Eq. (5.7) around a Minkowski background, and 
dropping terms which are higher than second order in 
the expansion in powers of a^g^, we get 


8t tG 


ind 


•[*(0)—4A ind ] 


8t tG 


R( 0) 


ind 


/d[^Je B[W,vl 6r[«„ v .O] 

fdWe iSlW ’^' 


+ 


/rf(^}e'' Sn0) ’ Vl r(O)i / d\h& 

Jd w ~ lslw ^' (ii 

fd[(f>}e‘ 


«[(«!, Vvl r(0 ) fd{</>] 


(ii) 


f d 4 x 


. (iii) 


(5.11) 


(5.13) 


[ / d{0}e tS[| * 1, ’ Vvl 

Terms (ii) and (iii) on the right-hand side can be evaluated by using Eqs. (5.6), (5.8), and (5.10), which give 

t f d 4 x8^=--^R{ 0) / d 4 xx 2 T(x) . (5.12) 

To evaluate term (i), we note that since Sg^v vanishes as x 2 at x =0, the only terms which contribute to 5T[g JUV ,0] are 
those in which 8g MV is acted on by two derivatives. After a certain amount of algebra, we find 

5T[g JUV ,0]=2/? (0)t7(0) , 

with U(x) the functional defined by 

a 2 ^ a 2 ^ a a 2 ^ 

gM ao^v)a (Begat) *** dg MV d(dgdtg a/} ) +8e * K ao^vjao^a^) 

a 2 ^ 0 2 ^ 

_28KI>da did^g^didgg^) _ 8e *^° d(d K d a g fiV )d(d g d l fg a p) 

Inserting Eqs. (5.12) —(5.14) into Eq. (5.11) and dividing by 2 R (0) gives the desired formula for G^}, 

/ d(Me'' !?[m ’ V ' l t/(0) 


t/(x)=- [r g /( vg a p 


(5.14) 


SflY V 


1 

16irGi nc | 




fd 4 xx 
96 J 


/ d{^}e SmU ^T(Q)T{x) 

/ d(^}e WI| ^ , ’’ , '‘ vl r(0) 


/ d^e'^’^Tix) 


f d{4>\e lSm ’ v ^ 

2 


(5.15) 


If we define the subtracted functional T by 


r(x)=r(x)- 


/ dWe^'^Tix) 
f d{<t>]e‘ 




(5.16) 


and note that the second term on the right-hand side is a 
constant, we can rewrite Eq. (5.15) as 


1 


16ir G ind 


/d(<He' ,!?im ’ V ' 1 t/(0) 




f d * xx 

96 J 


/ d{<t>}e lsli * ] ’ 7 ,i ‘ v 'nomx) 


fd[+]' 


,«[(4|.V ] 


(5.17) 


Finally, recalling the correspondence (Abers and Lee, 
1973) between expectations of functionals and vacuum 
expectations of time-ordered products of the correspond¬ 
ing operators, 

(A(0 „ /rf[»I« gtw,vl ^( 0) 

<*«»• r , «U4],^ V ] 

J d W e * (5,18) 

f dit}e iSll * ],v i‘ r, A(x)BW) 

{ruum o)»o= J —^-iFuiv-i-> 

we can rewrite Eqs. (5.9) and (5.15)—(5.17) in the com¬ 
pact form 

1 Aind =(r(0)) o , (5.19a) 


2?r Gind 
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1 

16irG ind 


= <t/(0)> 0 - 


i 

96 


x 2 [ <^-(T(x)T( 0))>o 
-<r(0)>o] 


= (£/( 0)> o -^r f d 4 x x 2 (3 r (T(x)T(0))) o , 

f(x)=r(x)-<r(x)>o. (5.19b) 

As noted above, we have so far carried along some ex¬ 
tra generality, which will be needed to discuss the case 
when the metric is a quantum variable. When the metric 
is not quantized, the trace functional F[g^ v ,0] depends 
on derivatives of the metric only 20 through terms of or¬ 
der R 2 , which come directly from the Lagrangian terms 
of Eq. (2.35). The variations of these terms 
vanish in flat space-time, and so when the metric is not 
quantized, the functional U vanishes. Hence for matter 
theories on a background manifold, Eq. (5.19b) reduces 
to the form 

- L -- = -^± fd 4 xx 2 (jr(¥{x)¥{ 0))>o (5.20) 

16t rG ind 96 J 

given by Adler (1980b) and Zee (1981). 


B. Convergence and spectral analysis 


From the explicit formula of Eq. (5.20), we can again 
analyze the conditions for G^J to be calculable. Since 
Eq. (5.20) is a flat space-time formula, it will be con¬ 
venient at this point to switch to the Bjorken-Drell 
(1965) signature convention, in which Eq. (5.20) becomes 


1 

167rG ind 


-±r f d*xx 2 (srmx)n 0))> o , 


X 2 =(x°) 2 -(x') 2 . (5.21) 

As discussed in Sec. Ill above, we define the flat space- 
time matter theory by a renormalization procedure based 
on dimensional regularization, and so Eq. (5.21) is to be 
interpreted as a dimensional continuation limit 


lim fd^xx 2 (jr(T(xmO)))% , 
167rGi nd 96 o>—► 2 J 

(5.22) 

where ( >o denotes the vacuum expectation in the 2 co- 
dimensional theory. Equation (5.22) will give a calcul- 
able G~ d * if the integral on the right-hand side is regular 
at & = 2, and as we have seen, the singularity structure in 
the & plane is directly determined by the ultraviolet 
divergence structure of the dimension-four integral of 
Eq. (5.21). This can be studied by using the Wilson 
(1968) operator product expansion of the time-ordered 
product, 21 

(^-(f(x)f(0))) o = X logs 

(x z r 


{# 2^0 

(X 2 ) 3 


Xlogs+O 


1 

U 2 ) 2 


(5.23) 


20 The Lagrangian density J? also contains metric derivatives 
in the spin connections used in constructing the spinor kinetic 
terms, but these do not appear in the trace functional T. 


where “X logs” indicates the presence of power series in 
logx 2 , and where <? 0t2 are Lorentz-scalar, internal 
symmetry-invariant operators of canonical dimension 0 
and 2, respectively [corresponding to the fact that T has 
canonical dimension four, and hence the left-hand side of 
Eq. (5.23) has canonical dimension eight]. When Eq. 
(5.23) is inserted in Eq. (5.21) the order (x 2 )“ 4 terms give 
formally quadratically divergent integrals, which vanish 
by the lemma of Eq. (3.18) above, while the order 0c 2 )“ 2 
and higher terms are ultraviolet convergent. However, 
the order (x 2 )~ 3 terms give logarithmically divergent in¬ 
tegrals and thus generate poles at m = 2 in the dimension¬ 
al continuation, unless no operators are present in the 
theory, in which case Gi„ d is calculable. We have there¬ 
fore recovered the same calculability criterion as was ob¬ 
tained from the dimensional algorithm in Sec. H.D 
above. 

Let us next attempt to put Eq. (5.21) into spectral 
form, which if possible, would yield information about 
the sign of G ind . From the standard 22 spectral analysis 
for a scalar operator cp 9 we have 

— i(3?~{q>{x)q>(0)))o— f Q da 2 p(cr 2 )A F (x,a) , (5.24) 

with p the spectral function defined by 23 
p(q 2 )^(27r) 3 ^8 4 (p n —q) \ <0|<p(0)|n> | 2 >0 , (5.25) 


and with A F the scalar Feynman propagator, 


A F (x,a) 


r d*k c _, k . x 1 

J (2tt) 4 k 2 -a 2 + ie ' 


(5.26) 


Ignoringjbr the moment questions of convergence, let us 
set <p—T in the above formulas and substitute into Eq. 
(5.21), giving 


^= ^ f 0 "dcr 2 p(a 2 ) [-/ d 4 xx 2 A F (.x,a) 

(5.27) 


16 irG ind 

A simple calculation then shows that 


— f d 4 x x 2 A F (x,a)= 


a 2 


1 


0^0** (k 2 -cr 2 ) 


-8 




and so Eq. (5.27) yields 

j_J_ r°° j ? p(g 2 ) 

167rG ind 12 Jo a 4 ’ 


(5.28) 


(5.29) 


which if correct would imply that Gi~ d has manifestly 
the wrong sign to give attractive gravitation. However, 
Eq. (5.29) is valid only if the integral on the right-hand 
side converges, which requires the vanishing of p(a 2 )/a 2 
as a becomes infinite. But in gauge theories, we have 
seen in Sec. III.C above that T contains a trace anomaly 


2l For a proof of the operator product expansion in perturba¬ 
tion theory and a detailed discussion, see Zimmermann (1970). 

22 See Bjorken and Drell (1965), pp. 138—139 and pp. 
387-390. 

23 Since p is gauge invariant, it can be evaluated in a canonical 
gauge to establish positivity. 
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term proportional to the hard operator [(Fj^ ) 2 ] r , as 
a result of which p(a 2 ) behaves asymptotically as a 4 
X logs, invalidating the spectral representation of Eq. 
(5.29). The failure of the spectral representation, as indi¬ 
cated by the quadratic divergence of Eq. (5.29), is just a 
reflection of the formal quadratic divergence of Eq. 
(5.22), arising from the leading (x 2 )~ 4 term in the opera¬ 
tor product expansion of Eq. (5.23). 

The breakdown of Eq. (5.29) can also be rephrased in 
the language of dispersion relations, by defining 

X0t 2 )= /d 4 xe , * x (-!)<^'(f(x)f(0))>o . (5.30) 

If X(k 2 )— X(0) obeyed an unsubtracted dispersion rela¬ 
tion, then Eq. (5.29) could be derived, but in fact one 
must make an additional subtraction, as in 
X(k 2 )— X(0)— k 2 X'(0), before getting a quantity which 
obeys an unsubtracted dispersion relation. Substituting 
this dispersion relation into Eq. (5.21) then yields 
G ind -r(0), which furnishes no a priori information 
about the sign of G ind . The calculations discussed in the 
next two sections suggest, in fact, that the sign of G ind is 
sensitive to details of the infrared behavior of the matter 
theory. 

C. Early model calculations of Gf* 

According to Eq. (5.21), the leading perturbative con¬ 
tributions to G are those in which two insertions of 
the stress-energy tensor trace T are made in connected 
matter diagrams of low-loop order, as shown in Fig. 4. 
In theories with dynamical spontaneous symmetry break¬ 
ing, such as SU (n) gauge theories, the diagrams of Fig. 
4(a) and 4(b) are typically absent and the leading contri¬ 
butions to Gj“ d begin at three-loop order. However, one 
way of simulating the ultraviolet softening produced by 
dynamical scale-invariance breaking is to consider a mas- 



T 



T 



T 


FIG. 4. Typical diagrams contributing to G,~J in (a) one-, (b) 
two-, and (c) three-loop order, respectively, with the solid lines 
indicating matter field propagators. In an SU(n) gauge theory, 
the contributions of one- and two-loop order vanish, and the 
perturbation series for Gj^J begins at three-loop order, with a 
leading term proportional to g 4 . 


sive fermion or scalar meson theory, in which the lead¬ 
ing contribution is the one-loop diagram of Fig. 4(a), and 
to include explicit, finite-mass Pauli-Villars regulators to 
control the ultraviolet divergences. This calculation has 
been performed by Sakharov (1975), Akama et al. 1A 

(1978), and Zee (1981), and Zee’s results in particular 
were important in motivating the general derivation lead¬ 
ing to Eq. (5.21). Zee considers a fermion loop of mass 
and by including two Pauli-Villars regulators 
with mass m u2 , finds 

1 _ 2-7T 2 j 

167rG ind " 3(27r) 4 * 

2 2 2 

/= 2 cimflogm 2 , with2 c i=0» 2 c *' w2== 0 . (5.31) 

1=0 o /=0 


By some simple algebra, Eq. (5.31) can be rewritten as 


J^=m 2 


w 2 _ 2 
m j —tn 

~~2 ~2 

tn j — tn 2 


, 2 , 

log—Z-— m z log—r , 

tn 2 tn 2 


(5.32) 


an expression which is positive as long as m 2 <m 2 t2 , but 
which can change sign when the regulator masses are 
smaller than m , illustrating the sensitivity of the sign of 
Gind t0 dynamical details. In order to give the observed 
magnitude of G^ d , Eq. (5.31) requires 
m~m Y \anck = I-22X I0 19 GeV, suggesting more generally 
that to get a realistic theory of Einstein gravitation as an 
induced quantum effect, the physics of dynamical scale- 
invariance breaking must take place at energies near the 
Planck mass. 

According to the discussion of Sec. IV.B above, the 
simplest field theory model which has calculable induced 
gravitational and cosmological constants is a pure SU(2) 
gauge theory. A direct evaluation of Eq. (5.7) has been 
given in this case by Hasslacher and Mottola (1980), us¬ 
ing the approximation of saturating the Euclidean con¬ 
tinuation of the functional integral by a dilute gas of in- 
stantons. 25 Their result can be written as 


-Ar ~<* -4A ind )+CK* 2 ) 

87rG ind 

= f 0 ^f[C\+C 2 p 2 R + * * * ]D(pp) , (5.33) 

where the integral is over the instanton size parameter p, 
and where p max (F) symbolically indicates a cutoff on this 
integration, of unknown form at present,' produced by 
the infrared vacuum structure of the gauge theory. The 
instanton gas calculation gives a definite expression for 
the integrand of Eq. (5.33), written as a series expansion 
in R times the flat space-time instanton density 25 D{pp ), 


24 See also Terazawa et al (1977a,b) for related earlier work 
by this group. 

25 For a pedagogical review of instanton gas methods, see 
Coleman (1979). A simplified derivation of the instanton den¬ 
sity D(pp) (with p the subtraction mass discussed in Sec. 
IV.C) is given by Bernard (1979). 
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C 

C 



j(a 2 -\~a p -\~a g — jP) , 


«2 


I 

7» 


a *=T Io 8 







(5.34) 


In Eq. (5.34), C\ gives the contribution to the cosmologi¬ 
cal constant arising from the instanton gas expectation 
of the trace anomaly of Eq. (3.28), while C 2 gives the 
corresponding contribution to the induced gravitational 
constant, obtained by summing contributions from the 
various small fluctuation modes around an instanton. 
Specifically, a 2 , a py and a g are, respectively, the contri¬ 
butions from the translational, dilatational and gauge 
zero modes, while J3 is the contribution from the nonzero 
modes. The log R terms in a p and a g arise because these 
zero modes make a contribution to Eq. (5.21) which is 
infrared divergent. Since an exact evaluation of the Eu¬ 
clidean continuation of the correlation function 
r(x)r(0))> 0 is expected to show an exponential de¬ 
cay law for large separations x (see Sec. V.D below), Eq. 
(5.21) should in fact be strongly convergent in the in¬ 
frared. Thus the divergence leading to the presence of 
Iogft in a p and a g appears to be an artifact of the dilute 
instanton gas approximation, and one expects the R log R 
terms in the integrand of Eq. (5.33) to be cancelled by 
corresponding terms in the integration cutoff p max (/?) 
and/or in corrections to the instanton picture, leaving a 
remainder of order R which is determined by the de¬ 
tailed dynamics of the infrared region. This means that 
the dilute instanton gas calculation, while demonstrating 
the existence and ultraviolet finiteness of the induced 
gravitational action in the gauge theory case, does not 
yield a quantitative calculation of G^ d . 


D. A strategy for calculating G in d and A m d 
In an SU(n) gauge theory 

Because a pure Yang-Mills theory is the simplest field 
theory model with dynamical scale-invariance breaking, 
it would clearly be desirable to carry out quantitative 
calculations of G^} and A ind in this case. I shall outline 
below a general strategy for doing this, assuming that 
one can, in principle, make arbitrarily good Monte Car¬ 
lo 26 evaluations of the various gluon field vacuum expec¬ 
tations which are needed, together with calculations to 
any finite order of perturbation theory. 

Let us begin with the induced cosmological term 
A ind /G ind . Substituting Eq. (5.8b) into Eq. (5.19a) and 
converting to the Bjorken-Drell metric convention (which 
was used in the derivation of Sec. III.C), we get 


26 For a review of statistical physics applications of Monte 
Carlo methods. See Binder (1976). Lattice gauge theories were 
introduced by Wilson (1974); see also Kogut and Susskind 
(1975) and the review by Creutz (1978). The application of 
Monte Carlo methods to lattice gauge theories was initiated by 
Creutz, Jacobs, and Rebbi (1979) and Creutz (1980). 


1 A; nd 

27r G ind 


= <rj> 0 - 


(5.35) 


The vacuum expectation on the right can be expressed in 
terms of the gluon field strength by using the trace 
anomaly relation of Eq. (3.28), giving 



At this point it is convenient to choose a definition of 
the coupling constant (see Appendix B.I for details) for 
which the one-loop renormalization group structure of 
Eqs. (4.11) —(4.13) is exact. 27 Combining Eq. (5.36) with 
Eqs. (4.11) and (4.15), we then find 


I A 


ind 


27r G ind 


= <^>0 


8 


a,= 


g 2 


II 


n — 


t n ' 




(5.37) 


47r 


where for a pure SU(n) Yang-Mills theory one would set 
Nf=- 0. Equation (5.37) expresses the induced cosmologi¬ 
cal term as a multiple of the extensively studied 17 gluon 
pairing amplitude ((a 5 /7r)((i r j lo .) 2 ) r > 0 . Since the gluon 
pairing amplitude has canonical dimension four, it is 
proportional to the fourth power of the renormalization- 
group-invariant scale mass introduced in Sec. IV. B 
above. 





-i /(V 2 > 


9 


(5.38) 


with c a numerical constant of order unity. According 
to Eq. (5.38), the pairing amplitude has an essential 

singularity of the form e 4Ab ° g ) a t g 2 =0, and vanishes 
identically in perturbation theory. This agrees with what 
would be found by making a Feynman diagram expan¬ 
sion of the left-hand side of Eq. (5.38) and evaluating the 
formally quartically divergent momentum space integrals 
by using the lemma of Eq. (3.18). 

In order to express Eq. (5.38) directly in terms of an 
observable quantity, it is customary to introduce the 
string tension a, defined as the coefficient of the asymp¬ 
totic linear term in the heavy quark-antiquark static po¬ 
tential, 

K stalic (R) = crR+0(l). (5.39) 

oo 


Since the string tension has canonical dimension two, it 
is proportional to the square of 

a=c '^ 2 , (5.40) 


with c* a second numerical constant of order unity. 
Eliminating from Eqs. (5.38) and (5.40), we get 


27 If the transformation of Appendix B.I is not made, the gen 
era! definition of the gluon pairing amplitude which corre 
sponds to that of Eq. (5.38) is (— 2p/b 0 g 3 )((a s /'irH(F l Ktr ) 2 ) r )o- 
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(~(^L^' X<r )jo=c"o- 2 , 

c"=c/(c') 2 , (5.41) 


which when substituted into Eq. (5.37) gives a relation 
between the induced cosmological term and the string 
tension, 

-^=-2tt \(±n-\N f )c"o 2 . (5.42) 

Gi n d 


Methods for making a Monte Carlo estimate of c" in 
pure SU(2) and SU(3) gauge theories (w=2,3; N/=0) 
have been discussed by Kripfganz (1981), by Banks et al. 
(1981), and by Di Giacomo and Paffuti (1982). 

Let us consider next the expression for the induced 
gravitational constant G^ 1 given in Eq. (5.21), which, we 
have seen, must be interpreted as a dimensional con¬ 
tinuation limit. Again substituting the trace anomaly 
equation, and defining the coupling constant so that the 
one-loop renormalization group is exact, we get 


= ■77 f d 4 xx 2 ^(-x 2 ) , 

Q f\ J 


167rG ind 96 

^(-x 2 )=(^(r(x)r(o)))o-<r)S, 


(5.43) 


To evaluate Eq. (5.43) it is convenient to make a Wick 
rotation to the Euclidean section, which is formally ac¬ 
complished by making the substitutions d A x —►— i d 4 x, 
x 2 -> -x 2 , giving 


I 

16 irG ind 


±f s d<xx*V(x 2 ) 


(5.44) 


In order to devise a practical method for implementing 
the dimensional continuation limit implicit in Eq. 
(5.44), 28 we shall split the integration over the variable 
x 2 =t into an ultraviolet (UV) part 0<f <f 0 , and an in¬ 
frared (IR) part *o <* < oo. 


16- 96 (/uv+/ir) ’ 

I V y=f\tt 2 mt), (5.45) 

/ir= . 

J r 0 


Let us suppose that the correlation function WU) has 


been determined to high accuracy by Monte Carlo stud¬ 
ies. In the infrared region, behaves for large t as 

-m f 1/2 

¥(f) ~ e * , (5.46) 

t —► OO 


with m g a parameter, called the glueball mass, which is 
related to the string tension by 

=c g <T l/2 =c g (c')' /2 ^ , (5.47) 


with c g a numerical constant. [Numerical Monte Carlo 
estimates of c g for an SU(2) gauge theory, obtained by 
studying the plaquette-plaquette correlation function, 
have been given recently by Berg (1981) and by Bhanot 
and Rebbi (1981).] As a result of the good asymptotic 
behavior of Eq. (5.46), the infrared integral J IR of Eq. 
(5.45) is convergent at f=oo, and can be evaluated by 
numerical integration. Turning next to the ultraviolet 
integral J uv » let us write it in the form 


I\ jv—Tuv+ATuv , 

/ c uv= f\tt 2 v e (t) , 

A/ uv = f^dttUVW-Vclt)] , (5.48) 

with 'I'cU) a comparison function chosen so that: (i) the 
integral AJ UV converges at t= 0, and hence can be 
evaluated by numerical integration; and (ii) the dimen¬ 
sional continuation needed to evaluate I u V can be carried 
out explicitly, leaving a convergent integral which can 
again be done numerically. The motivation behind the 
introduction of is the evident fact that, while discrete 
methods can be used to evaluate convergent integrals, 
they cannot be used to make analytic continuations. 

The general form required for the comparison func¬ 
tion ^(f) can be inferred from the operator product ex¬ 
pansion of Eq. (5.23). This expansion can be “im¬ 
proved” by using the renormalization group and asymp¬ 
totic freedom, which permit a partial resummation of 
the power series of logarithms in the leading term of Eq. 
(5.23) into a joint power series in the running coupling 
constant g\t) and (since we have made the transforma¬ 
tion of Appendix B.I) in its logarithm Iogg 2 (f). Defin¬ 
ing the coordinate space running coupling by 29 

2 


g l (t) = 


I 


g 


- Tb 0 Iog(^ 2 f) I — 7*og 2 Iog(^ 2 0 


(5.49) 


we have 30 


28 Use of a coordinate space formalism is not necessary in or¬ 
der to implement the dimensional continuation limit. For ex¬ 
ample, one could equally well rewrite the spectral representa. 
tion of Eq. (5.29) as 




UV 




00 


4 


d & 


p(a 2 ) 


IR 






cr - u cr 

and evaluate J v v by dimensional continuation. However, it is 
likely to be easier to extract the coordinate space function 
*(x 2 ) than the spectral function p(cr 2 ) from Monte Carlo stud¬ 
ies of the infrared region. 


29 The use of the same scale mass in Eq. (5.49) as in the one- 
loop version of Eq. (4.10) is a matter of convenience; redefin¬ 
ing ^ by a constant factor simply redefines the expansion 
coefficients appearing in Eq. (5.50). 

30 ln general, such renormalization-group-improved operator 
product expansions contain an additional fractional power 
[Iog(uf 2 f)] fi , with the exponent 6 proportional to the difference 
in anomalous dimensions of the operators on the left- and 
right-hand sides. Since and &o cc 1 both have zero 
anomalous dimension, this fractional power is absent from the 
leading term in the expansion. See Gross and Wilczek (1974), 
p. 982, for a detailed discussion of this point. 
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V(f)=C ,y 


I 

t\ —lo&s# 2 t) 2 


1 + T~~r~iA^S a 10 +a '' io g io g(-^ 2 ') -1 ]+ 

(—log ^t) 


+ 0(f~ 2 ) . 


(5.50) 


The leading term in Eq. (5.50) is proportional to 


I 

( — Iog^ 2 *) 2 



(5.51) 


because, as seen from Eq. (5.43), the perturbation expan¬ 
sion for 'P begins in order g 4 ; the constant C* is com¬ 
puted from Iowest-order perturbation theory in Appendix 
B.2, with the result 


C 


v — 



(5.52) 


[The two-loop contribution to the glueball propagator, 
which gives the coefficients a, 0 ,a M in the series of Eq. 
(5.50), has recently been calculated by Kataev et al. 
(1982).] No order term is present in the expansion 
of Eq. (5.50) because of the absence of dimension-two 
operators while the order t~ 2 and higher terms make 
contributions to / uv which are convergent at f=0. 
Hence it suffices to take as the comparison function 
the leading t ~ 4 part of 'P(f), 


VcU)=C* — 


I 


2*\2 


X 


f (— Iog^f) 

,, * 4 [iogiog 

1 ' u nm , , „l,\n 

.tuto (-log ^t l t) n 


and to restrict t 0 by the condition 

~# 2 t 0 < I , 


(5.53) 

(5.54) 


so that the logarithm Iog(~^ 2 f) does not vanish in the in¬ 
tegration range 0<f <f 0 of Substituting Eq. (5.53) 
into /uv and making the change of variable u=~# 2 t 
gives 

jc _ r ^2 r u ° du G(u) « 2 

/ uv -C^j o M 2 (IogM) 2» **0, 


©(m)= I + 


oo ft 

n = I m =0 


(log log u T 
(log u 1 ) n 


(5.55) 


The evaluation of this integral by dimensional continua¬ 
tion is carried out in Appendix B.3, with the result 


/ uv = C\ 




l 00 


log(^ 2 r 0 ) 1 


dv^rO(e- v ) 


V 


(5.56) 


where Re indicates the real part, and where the integra¬ 
tion contour is shown in Fig. 5. [As discussed in Ap¬ 
pendix B.3, the need to take a real part in Eq. (5.56), re¬ 
flecting the existence of a cut in the co plane, arises from 
the fact that the running coupling constant variable g 2 (t ) 
used in the “improved” expansion sums an infinite num¬ 
ber of Feynman diagrams. The dimensional continua¬ 
tion of individual Feynman diagrams remains mero- 
morphic in <y.] The integral of Eq. (5.56) can be done by 
numerical integration, and so the problem of evaluating 


Eq. (5.43) has been reduced to a sequence of steps which 
can each be implemented by discrete methods. 

Up to this point in the discussion I have used the 
one-loop exact running coupling constant defined in Eq. 
(5.49), which transforms the renormalization group to its 
minimal, exponential form. However, in doing an actual 
calculation it is not advantageous to make the nonanalyt- 
ic transformation of Appendix B.I; instead, it is better to 
work with a two-loop exact or more general definition of 
the running coupling constant g\t ), in terms of which 
'Pc(f) takes the form of a simple power-series expansion 




00 


1+ 2>„[g 2 U)] n 


n *= I 


. (5.57) 


Corresponding to this, Eqs. (5.55) and (5.56) take the 
form 


00 


0(m)= 1 + £ c n [g 2 (u/^ 2 )] n , 

n = I 


/lv = j*oCV^Re 


/ . I 00 

, dve v 

lOgC.^fn )~ 1 


X[g 2 (e- v /^r 2 )] 2 0(e- v ) 


(5.58) 


with the coefficient c, known from the above-cited work 
of Kataev et al ., and with the higher coefficients yet to 
be computed. In doing a calculation it is of course 
necessary to make an explicit choice both for the divid¬ 
ing point f 0 , and for the accuracy to which the perturba¬ 
tion expansion is to be computed. A reasonable stra¬ 
tegy for doing this, I believe, is as follows: 

(i) Choose t 0 far enough into the ultraviolet so that 
perturbation theory is valid at t 0 , and so that 
I A/uv //jr I is small. Such a choice is always possible, 
since the fact that A/uv is a convergent integral implies 
that 



FIG. 5. Contour of integration C to be used in-evaluating Eq. 
(5.56). The contour begins at v — Iogi/fT 1 — 10g(^ 2 fo)~' and 
must avoid the singularity at v =0. 
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lim A/ uv =0 . (5.59) 

r 0 —*-0 


(ii) Then, keeping t 0 fixed, compute a large enough 
number N of perturbation-theory coefficients c n so that 
Juv is well approximated by 


Ivv 


= r °dt 

l Jo 


o 

I • 2 


= ^boCq,^ 2 Re 


dimensionally regularized 

/ i00 

, ,dve v 

Og ^ 2 t 0 )~ ] 


X[ g 2 (e- v /^ 2 )] 2 e N (e~ v ) 


(5.60) 


with ^(f) and O n (u), respectively, the truncations of 
the series of Eq. (5.57) and Eq. (5.58) to the first N 
terms. Such an approximation is possible because 31 

lim (5.61) 

N —► oo 

implies that 

lim /Jft=/uv • (5.62) 

N—+ 00 

(iii) According to Eqs. (5.59) and (5.62), the total in¬ 
tegral which we are calculating is given by the double 
limit 


I=Iir +Tuv + ATuv 

— lim lim (/jr+Tuv) » (5.63) 

fp — >0N —► oo 

which with t 0 and N chosen according to (i) and (ii), is 
well approximated by 

/ss/jr+Juv ■ (5.64) 

However, for fixed N we must be careful not to let t 0 be¬ 
come arbitrarily small in the approximated expression of 
Eq. (5.64), because as a result of the mismatch between 
Jj R and Juv and the quadratic divergence of the unregu¬ 
larized integral, we find 

lim (/i R +/uv)—oo . (5.65) 

r 0 —*-0 

In other words, the order of the limiting operations in 
Eq. (5.63) is significant, and is reflected in the procedure 
for choosing t 0 and N given in (i) and (ii) above. 

In a recent paper. Zee (1982a) has given a model in 
which the infrared region is explicitly known, permitting 
the complete integral /uv+Tir to be evaluated explicitly 
by dimensional regularization, and thus giving a simple 
illustration of the methods outlined above. Zee’s model 
is a gauge theory in which the one-loop /^-function coef¬ 
ficient b 0 is positive and small, while the two-loop /?- 
function coefficient b\ is negative [cf. Appendix B, Eq. 

31 If the series for ¥ c (0 is only an asymptotic series, a sum¬ 
mation procedure [such as Pade approximants or Borel sum¬ 
mation; see Simon (1981)] is needed to extract, from the per¬ 
turbation coefficients c n , a sequence of approximants which 
satisfy Eq. (5.61). 


(Bl)], as happens, for instance, in QCD with 16 quark 
flavors. Such a theory is still asymptotically free, but 
has a nontrivial infrared stable fixed point at a small 
coupling constant g* = — b 0 /(2b{). In the approxima¬ 
tion of retaining only the leading term in an expansion in 
powers of g*, one finds 


»(rt=* e (rt=x*oC*-r 

g\t) 

f, g 2 u) 

1 2 


r 


g* 

L j 



(5.66) 


with the two factors g 2 (f)[l — g 2 (0/g*] arising directly 
from the two factors j8(g)/g, which appear in 'P when 
the trace anomaly formula of Eq. (3.28) is used. Hence, 
in this model the entire answer is given by the power- 
series expansion of Eq. (5.57), and the series terminates 
after only a finite number of terms. The explicit calcula¬ 
tion shows that the sign of G ind in this model depends 
strongly on the values of the ^-function coefficients bo 
and b,, and thus again is sensitive to infrared details. 
[For a further discussion, in the context of a survey of 
induced gravitation generally, see Zee (1982c).] 


VI. EXTENSION TO A QUANTIZED METRIC 

A. The general-coordinate invariant effective action, 
and derivation of the background metric Einstein 
equations 

Up to this point the metric g^ v has been treated as a 
purely classical variable, which determines the back¬ 
ground geometry and thereby influences the dynamics of 
the quantized matter fields, but which is not itself quan¬ 
tized. While this classical metric formulation is useful 
as a model, there are a number of arguments indicating 
that it is not a satisfactory starting point for a funda¬ 
mental theory. For example. Duff (1981) has pointed 
out that if the metric is not quantized, then the system 
of equations comprising the quantized matter fields and 
the classical Einstein equations for the metric is not in¬ 
variant under metric-dependent redefinitions of the 
matter fields. Such redefinitions should be allowed in a 
completely consistent formulation, and Duff shows that 
they are in fact permitted if the metric is quantized. A 
second argument is simply that if the metric is treated as 
a classical variable, then the Einstein equations or the 
equivalent Einstein-Hilbert action principle must be pos¬ 
tulated on an ad hoc basis. As we will see below, when 
the metric is quantized, the Einstein equations for the 
background metric emerge automatically as the leading 
long-wavelength approximation to the effective action 
formalism. 

In discussing the dynamics of a quantized matter- 
metric system, it is necessary to give a procedure for 
identifying that part g^ v of the metric which we observe 
as the “classical” metric and a method for computing its 
effective action functional. I do this by using the back¬ 
ground field method of DeWitt (1965), in which the total 
quantum metric g ^ is split, in a self-consistent fashion, 
into the sum of a background metric g^ v and a quantum 
fluctuation A Vf 
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8jxv 8fiiv hfiiv • ( 6 . 1 ) 

Elaborating on earlier work by ’t Hooft (1975), recently 
Boulware (1981) and DeWitt (1981) have given an exten¬ 
sion 32 of the background field method which preserves 
manifest general-coordinate covariance with respect to 
the background metric, and hence is an ideal vehicle for 
the discussion which follows. 

To introduce the general-coordinate invariant effective 
action formalism, let us consider first the case in which 
no matter fields are present, so that the total microscopic 
action density consists solely of the term ^gravtSjuv] in- 
troduced in Eq. (2.38). The partition function is then 
given formally by 

Z= f d[ gliv ]e iS ^ lg ^ ] , 

S tmv [g^ v ]= f d 4 x , (6.2) 


in the following discussion. The gauge fixing of Eqs. 
(6.4) and (6.5) completely breaks the invariance of the 
gravitational action under the group of general- 
coordinate transformations g^^g^y, which has the in¬ 
finitesimal form 34 

+ , ( 6 . 6 ) 

with 60* an arbitrary infinitesimal contravariant vector. 
The Fadde’ev-Popov compensating determinant for the 
gauge-fixing action of Eq. (6.3) is defined by 35 

1 = f d[d]e !S ^ ls ^’ s ^[g^, gltv ] , (6.7) 

with d[6] the invariant measure on the manifold of the 
general-coordinate transformation group. Since the in¬ 
variant measure satisfies 

d{W\=d{&e\=d{e\ ( 6 . 8 ) 


but this expression is divergent because of the general- 
coordinate invariance of the action. To get a useful ex¬ 
pression for Z, a gauge-fixing term and a compensating 
Fadde’ev-Popov (1967) determinant must be introduced 
into Eq. (6.2). Let us choose the gauge-fixing term in 
the action to have the form 

•Vteo£>S/lv]= / d * x V S R gf\.gap>8 » V ] . (6-3) 

with g^p an arbitrary fixed reference metric (which for 
the time being is distinct from g^ v \ and with Jfgf con¬ 
structed so as to transform formally as a general- 
coordinate scalar with respect to g%p, when the total 
quantum metric g ^ is treated as a tensor with respect to 
gap. A suitable gauge fixing for quantizing the 
curvature-squared action of Eq. (2.38) would be 33 

^gflg^g^h^g^R^v^v . < 6 - 4 > 

n 

with V* the covariant derivative with respect to g^v, and 
with G v formally a covariant vector with respect to g£v, 
given by 

G v =Vftg /lv -|g^V J , v g M . (6.5) 

Equations (6.4) and (6.5) are a natural generalization of 
the usual harmonic coordinate condition; however, the 
precise form of (beyond the fact that it depends ex¬ 
plicitly on the auxiliary metric g*p) will not play a role 


32 See also Fradkin and Vilkovisky (1976), who use the gauge 
fixing 


-^ = TS''" v <V< ? v| 




to quantize the Einstein theory formally, and who suggest that 
it gives a generally covariant effective action for g MV . For the 
use of the gauge-invariant background field method to com¬ 
pute two-loop counter terms, see Abbott (1981) and Ichinose 
and Omote (1982). 

33 For a discussion of the complexities involved in representing 
higher-derivative gauge fixings in terms of a local “ghost” ac¬ 
tion density, see Kallosh (1978) and Nielsen (1978). 


for any fixed general-coordinate transformation g MV 
-+g£v, we learn from Eqs. (6.7) and (6.8) that the com¬ 
pensating determinant is invariant under general- 
coordinate transformations on g MV9 

A[gap,g / , v ] = A[g„p,g®v] • (6-9) 

According to the Fadde’ev-Popov ansatz, 35 a convergent 
path-integral representation for the partition function is 
then given by 

( 6 . 10 ) 


34 As pointed out by Fradkin and Vilkovisky (1975) and re¬ 
viewed by Batalin and Fradkin (1979), the presence of a term 
in degfty leads to a nonvanishing variation of the integra¬ 
tion measure under general coordinate transformations, 

cc f d*x a>.s\omHx). 

The d^S^O) term vanishes in covariant calculations using di¬ 
mensional regularization, and is ignored in the discussion of 
the text, where d[g MV ] is treated as being general-coordinate in¬ 
variant. The variation of the integration measure cannot be ig¬ 
nored in setting up a canonical, Hamiltonian formalism using 
a massive regulator scheme; in this case it leads to an extra 
Jacobian factor in the path-integral formulas, which can be 
represented by a quartic local “ghost” action density. For a 
related analysis of the connection between Jacobian factors in 
the path-integral measure and chiral and conformal anomalies, 
see Fujikawa (1981). 

35 The discussion of Eqs. (6.7) —(6.14) is based On Sec. 3.3 of 
Fadde’ev and Slavnov (1980). Strictly speaking, the Lagrang- 
ian form of the path-integral formula given in Eq. (6.10) must 
be derived from the more fundamental Hamiltonian form, and 
the standard textbook discussions describe this step only for 
second-order actions. The derivation of Eq. (6.10) from the 
Hamiltonian formalism in the case of fourth-order, curvature- 
squared gravitational actions has been carried out by Boulware 
(1982). 
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To verify that Z is independent of the choice of the 
reference metric gap, let us multiply the integrand of Eq. 
(6.10) by unity in the form 

1 = /c?[0]e iV[ *“ g ’*" vl A[g^,g® v ] , (6.11) 

giving 

Z=f c?[0]c?[g /lv ]A[g^,g /lv ]A[g^,g® v ] 

g ^grav \-Sftv 1 + ^gf[&afj<8pv 1 + ^gfig a# >Xpv 1 (6 12) 


Making the substitution g^v-'+g^v , and using the fact 
that the action SgravfgjuvL the compensating determinants 
A and the integration measure 34 d[g are all general- 
coordinate invariant, and also using the invariance prop¬ 
erty d[d]=d[d~ l ] t Eq. (6.12) becomes 

Z= f d[0- 1 ]c?[g /1 v]A[g^,g®v']A[g^,g /1 v] 

^ ■*" ^gf[g a p,gp V X + iSgflgaP'SpvX (613) 

But now applying Eq. (6.7) once more (with 0 replaced 
by 0” 1 ) we get 

Z= f ^[gjuv ] A[ga/J tgjtv ]e ,5grav[ *('v 1 + f (6 14) 


which differs from the original form in Eq. (6.10) by the 
replacement of gap by g' a p- 

Let us now introduce an external source J** coupled 
to the metric gj^, so that the path-integral formula of 
Eq. (6.10) is modified to read 


e 




^Z[J k °,g*p] 


J * d[g I i y\&\g ap,S pv\ 

x e ^gravt1 + *Sg/[ 8 2/5* 1 “ * / 1***^*" 


(6.15) 


Both J ktT and gap are indicated as arguments of Z in Eq. 
(6.15) because the source term breaks the general- 
coordinate invariance of the action. As a result, when 
J ka =£0 the argument of Eqs. (6,11)—(6.14) cannot be ap¬ 
plied, and hence the previously derived zero-source in¬ 
variance, 


0=~ 


6 

Sgtxp 


Z[0,8a P ] = 


6 

&ga P 


WIG,ga P ] , 


(6.16) 


cannot be extended to the case when a source is present. 
From the functional W y we can calculate the expectation 
value gxa of the metric in the presence of the source J ka 
by using the formula 


gkAJ^fg%p] = (gka)j=--jrjte > (617) 

which can be inverted to determine J ktr implicitly as a 
functional ofg^ (and ofg^), 

J ka =J ka [g r& ,g*p] . (6.18) 

Let us now introduce the Legendre-transformed effective 


action functional T defined by 

fd+xgiaJ*" . (6.19) 

Varying Eq. (6.19) (for fixed g*p) and using Eq. (6.17), 
we get 

8r=8ir+ / d'xig^bj**+bg^m 

= / d 4 x Sg^J^ , (6.20) 

which shows that r is a functional only of g ka and g„p, 
r=r[g A V^] , (6.21a) 

and satisfies 

-^=/ Xo \ (6.21b) 

SgAcr 

The partition function Z[J ktr f gapl can be reexpressed in 
terms of the effective action T through the formula 


e '"^*V =ext 

8 


‘Tlg^gigl-l f d*x gluj J^ 

> 

/ 


( 6 . 22 ) 


where ext **( ) indicates that one is to take the ex¬ 
tremum of the parenthesis over all values of g*". Equa¬ 
tion (6.22) is verified by noting that the exponent on the 
right-hand side is extremized 36 at the metric g^ r =g k<r 
for which Eq. (6.21b) is satisfied, and that at the ex¬ 
tremum it can be rewritten, by using Eq. (6.19), to give 

With these preliminaries completed, we are ready to 
introduce the general-coordinate invariant effective ac¬ 
tion functional r^g* 0 '], defined by identifying the 
reference metric gap with the expectation value g a p i Q 
the formulas given above, 


rinvtg^ng^g^]. 


(6.23) 


To get an explicit formula for r inv , let us multiply Eq. 
(6.15) by exp(i f d 4 x g^J^) and change to defined 
in Eq. (6.1), as the new functional integration variable. 
Making use of the identity 


^gf\.gap*gjxv~^~^nv\ ^gf\.gap*^ixv\ 

= T«^r v VxG /1 V tr G v , 

G v = — jg^^Vy h^x. (6.24) 

(which follows from the fact that Vxg]a V =0), we get the 
following functional integral representation for r inv , 

e ' r in V [f^l 

= J* ^[^juvl^t^aptgjuv+^juv] 


(6.25) 


36 I will assume here, and later on, that the extremum prob¬ 
lems which are encountered always have a unique solution. 
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The source current J ktr in Eq. (6.25) is implicitly deter¬ 
mined as a functional of g ka by the requirement 


0=<*Xa>/ = - 


sr 

8J ka 


(6.26) 


which is equivalent to 

0 J* ^[^juv]A[g a 0,g JUV H - /l JUV ]/lx o - 

(6.27) 

To see that r; nv is a general-coordinate invariant func¬ 
tional of its argument, we note that we are free to take 
hjxv* which is a dummy integration variable, to transform 
as a tensor with respect to general-coordinate transfor¬ 
mations of g^ v . By construction, Sgj is then a scalar 
with respect to such transformations, and therefore from 
Eq. (6.7), the compensating determinant A[g a p»gju V +^ju V ] 
is also a scalar. Equation (6.27) then determines J ^ to 
transform as a tensor, and so the right-hand side of Eq. 
(6.25) is manifestly invariant under general-coordinate 
transformations of g^. 

Let us next show that the source-free partition func¬ 
tion Z can be obtained by extremizing the gauge- 
invariant effective action functional. According to Eqs. 
(6.22) and (6.16), in the absence of an external source we 
have 


Z=ex W e' ri *^), 

o 


Z= 0 . 


SgZp 


(6.28a) 

(6.28b) 


The extremum in Eq. (6.28a) determines to take a 
value [g£p] at which 


6T 


6g 


ka 


[**",*&]=o, 


(6.29) 


and when expressed in terms of g ka , the reference-metric 
invariance of Eq. (6.28b) takes the form 


Og a p 


(6.30) 


Since g^lgapi is a continuous map from the manifold of 
reference metrics into itself, there is 37 a fixed point 
8af}=g*aP for which g^tgap] =£***' A * the fi Xed P oint > 
Eqs. (6.29) and (6.30) become 


8 r[g* Xtr ,g*p]=-Vr[ g *^ 4 ]=o , 

°gafi 


8g^ 

which together imply that 
—-—r. rp* x<r l=0 

ftgka 1 ‘" vL ^ i u * 

and so 36 we have 

Z =ext ^(e' r,nv[ * X " 1 ) . 
8 


(6.31) 


(6.32) 


(6.33) 


37 I am assuming that gap and g** lie in a closed convex set, 
so that the conditions of the Schauder fixed point theorem are 
satisfied. I wish to thank J. and L. Chayes for a conversation 
about the conditions for the existence of a fixed point. 


An alternative way of deriving Eq. (6.32) is to note that 
when variations of g*p are included, Eq. (6.20) is modi¬ 
fied to read 



v J 


+ 


fijko 


W k ° + bg ka J k ° 


, (6.34) 


which implies that 


6g* ( 


pinv|-^Xaj_ 


w 


•^[g^gapl-gerp 


Sgfc 
+^ X<r [g yS ,g„ P ] • 

At the solution g a p=gap of the equation 

^ Xtr [g rS ,g„p]=o, 


(6.35) 

(6.36) 


we learn from Eq. (6.16) that both terms on the right- 
hand side of Eq. (6.35) are zero, thus reproducing Eq. 
(6.32). 

Having now established the procedure for identifying 
the background metric and calculating its dynamics, let 
us restore the matter fields to the analysis. Following 
the notation of Eqs, (2.1) and (2.38), this is done by mak¬ 
ing the substitutions 


dlg^-^dlg^d^} , 

A'grav [gjuv 1 > ^mailer [ {^ ] fgp v ] “b &grav [gjuv 1 * 

•S'mailer [ [$ ]»gjuvl “ ^ d X V' — -g -^maiiert {^]»gjuvl 


(6.37) 

in Eq, (6,10), giving 38 
Z = / ^[gjuvl^WAtg^gjuv] 

Let us next divide the matter fields (0) into “light” and 
“heavy” components 39 as in Sec, II.B, and find the effec¬ 
tive action equations governing the dynamics of the light 


38 As in the earlier sections, I do not explicitly indicate the 
gauge-fixing procedure for the matter gauge fields. 

39 The heavy “matter” fields can include any fields which are 
not directly observable, including ones which are basically 
geometric or pregeometric in nature, and auxiliary fields. The 
only essential requirement for the discussion of Secs. VI.A and 
VLB is that the partition function be representable in the form 
of Eq. (6.38) for some choice of heavy fields \4 > H ). The dis¬ 
cussion, as given, applies only to the case when the observed 
matter fields \tp L ) appear as elementary fields in the funda¬ 
mental action. If, as has been much discussed recently, some 
of the light fields are effective fields for composites formed 
from the truly elementary fields, an extended effective action 
formalism is needed, along the lines discussed by Cornwall, 
Jackiw, and Tomboulis (1974). For a discussion of the effec¬ 
tive action for composites in a nonrelativistic solid-state phys¬ 
ics context, see Kleinert (1978). 
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fields. The most straightforward way of doing this is to 
introduce external sources [J L ] and expectation values 
[<j> L ] for the light matter fields {<t> L }, as well as an exter¬ 
nal source J** and expectation value g^ 0 " for the metric, 
and to construct the Legendre-transformed effective ac¬ 
tion functional r[(^ L ), g*" 9 g£p] in analogy with Eqs. 
(6.15)—(6.21) above. Following Eq. (6.28), the partition 
function Z of Eq. (6.38) can be expressed in the form 


7 , ''HI ?*■).**".*& 


6 



Z=0 , 


(6.39) 


and the fixed point argument of Eqs. (6.28)—(6.33) can 
then be used to show that Eqs. (6.39) are equivalent to 


Z=ext 






) 


(6.40) 


with r lnv the general-coordinate invariant effective action 
r inv =r[(^},g A ‘ r ,g 0 p]. (6.4D 


Equation (6.40) gives an exact description of the dynam¬ 
ics of the light matter-metric system in terms of a classi¬ 
cal variational principle 


(6.42) 


that is, for an isolated system, the background metric 
and the light-field expectation values must evolve ac¬ 
cording to a principle of stationary effective action. 

To put Eq. (6.42) in a more familiar form, let us as¬ 
sume the background metric to be slowly varying on the 
length scale of the heavy fields, so that the curvature 
dependence of r jnv can be approximated by writing 


r i„v[ () ,g° p ] = SefF.rn.nert ( 4> L ) ff 0 ] + S eff,grav [gap] 

+ small corrections , 


5 eff,matter [ yg°^] = minimal generally covariant 

extension of 




•^eff.grav =r inv [ (0} + o[ (a^ v ) 4 ] 

= fd 4 x 1 (X- 2A lnd ) , 

167rG; n d 

(6.43) 

with ^=/*[g a p] the curvature scalar constructed from 
g a p. As defined in Eq. (6.43), 5 ef f maller contains terms in 
T inv which are <f> L dependent and are of zeroth or first 
order in space-time derivatives of g a p f while Seff.grav 
contains terms independent of the matter fields <f> L f 
which are of zeroth through second order in space-time 
derivatives of g a p. Substituting Eq. (6.43) into Eq. (6.42) 


gives the classical Einstein equations 40 and the effective 
classical equations for the matter fields, 

j£= lS ~ + A - rV) 


— T’MV 

x mailer 


= ~ 7 i ^Seff,maner[(^),r P ] , 

V -g 

Tr|z7S effimaller [(^},r p ]=0. (6.44) 

5 l 9 I 

Of course, making the approximations of Eq. (6.43) is 
only a matter of convenience in dealing with slowly 
varying background metrics, and the exact dynamics of 
g a p and 1^ L }, including the effect of higher derivative 
terms in r inv [ (0 L },g"^], is always governed by Eq. 
(6.42). 

An alternative way of describing the dynamics of the 
light fields is to keep them as quantum variables and to 
introduce, inside the [<f> L ] functional integration, an ef¬ 
fective action which incorporates the quantum effects of 
the heavy fields [cf. Eq. (2.15) above]. To do this, we 
rewrite Eq. (6.38) in the form 

z= f d[</> L }e IWML] ' g ^ 1 , (6.45a) 

J dlg^di^Mg^g^] 


(6.45b) 


The dependence of W on g*p results from the fact that 
the general covariance of Eq. (6.45b) is broken by the 
fixed (nonscalar) light fields (^ L ), which act in the same 
manner as does the source term in Eq. (6.15), and 
prevent the application of the argument of Eqs. 
(6.11)—(6.14). Let us now introduce an additional exter¬ 
nal source for the metric and use it to construct a 
Legendre-transformed effective action functional for the 
metric, ni^-J.g^.g*,], as in Eqs. (6.15)—(6.22). [The 
prime on T' is to distinguish it from the functional 
r [{</> L }fg X<T fgafA introduced following Eq. (6.38), which 
was constructed by Legendre transforming with respect 
to both the metric and the light fields.] This allows us 
to rewrite Eq. (6.45) in the form 


z= f d [<j> L }ext gU (e 




(6.46) 


^Derivations of the Einstein equations similar to that of Eqs. 
(6.38) —(6.44) have been given by Fradkin and Vilkovisky 
(1977a, 1977b), by DeWitt (1979), and by Horowitz (1981). 
Fradkin and Vilkovisky (1977a, 1977b) and DeWitt (1979, 
1981) have emphasized that Eq. (6.42) contains corrections to 
the Einstein equations which are needed for rapidly varying 
metrics. For discussions of the “out-in” form of the semiclas- 
sical gravitational equations, see Kay (1981) and Horowitz 
(1981). 
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which gives an exact formulation of the quantum 
dynamics of the light fields and the background metric, 
expressed in terms of a general-coordinate noninvariant 
effective action r'. Because the integrand in Eq. (6.46) 
still depends on g*p, the fixed point argument of Eqs. 
(6.28) —(6.32) cannot be used to introduce a gauge- 
invariant effective action inside the light-field functional 
integration. An alternative way of seeing this is to note 


that the extremum in Eq. (6.46) makes g XtT a functional 
of the integration variables [<t> L } f and so the fixed refer¬ 
ence metric g^p cannot be equated to inside the 
functional integration. To proceed further, let us consid¬ 
er the mean-field approximation to Eq. (6.46), obtained 
by pulling the extremum over g** to the outside of the 
functional integration (which should be a physically 
reasonable approximation for the slowly varying com¬ 
ponents of g^), 


Z.,/.e*V,p 


(6.47a) 


Since Z is independent of g£p, Z m f is independent of g*p 
to within the accuracy of the mean-field approximation, 
and so we have 


e 

—rZ mf ~0 . (6.47b) 

°g a p 

Equations (6.47a,b) have the same structure as Eqs. 
(6.28a,b) above, and thus within the mean-field approxi¬ 
mation we can apply the fixed point argument of Eqs. 
(6.28)—(6.32), giving 

Z m/ «ext r<J f d\<t> L )e irL[ '* L] ' ga,,] , (6.48) 

with r[ nv the general-coordinate invariant effective action 
r[„v=r'[(^ i ),g Aa ,g 0 p] • (6.49) 

Assuming a slowly varying background metric and mak¬ 
ing an expansion of the primed effective action analo¬ 
gous to that of Eq. (6.43), we can approximate Eq. (6.48) 
by 

Znfxzext^pf 

g ^eff, matter 11 ^I + ^eff, 3 rav J 

(6.50) 

This gives the field equations for the background metric 
in the form 


8t tG\ 


(G' ,v +A; nd r v ) 


ind 


/ d[</> L je fS ^' nlltlCT[|0Z ' |,? ° g] r^ t e r 



rptfiV A-V 
*■ matter / > 


'rw pv _ 

1 matter — 


2 6 


‘S'eff, matter > 


(6.51a) 


with 10 + ) and | 0“ ) the “out” and “in” vacuum states 
for the observable matter fields. Thus, the background 
metric formalism, with the mean-field approximation of 
Eq. (6.47a) and an expansion for slowly varying metrics, 
gives the “out-in” form 40 of the semiclassical gravitation¬ 
al equations. The quantum field dynamics for the 
matter fields then follows in the usual fashion from the 
approximation to the partition function given in Eq. 
(6 50). Because the induced constants G[ nd and do 
not include the quantum effects of the light matter 
fields, they are not identical to the constants G ind and 
A ind defined in Eq. (6.43), which do include such effects. 
However, since one expects 

+^[(fpianck/fproton) 2 ] » (6.51b) 

with /p ro ton the proton Compton wavelength, the differ¬ 
ence between the primed and unprimed constants is nu¬ 
merically very small. 

B. Formulas for G|„ d and Am with a quantized metric 

To complete the analysis begun in Sec. VI.A, we must 
derive expressions for the induced gravitational and 
cosmological constants in terms of functional integrals 
over hpy and the matter fields, 41 and discuss the condi¬ 
tions under which these expressions yield finite answers. 
Since the gravitational effective action relevant to astron¬ 
omy and astrophysics is insensitive to the state of motion 
of the long-wavelength components of the matter fields, 
it is most convenient to start the derivation of this sec¬ 
tion from the formula 

e 'r i nv[f„pl =ext (e 'r tav [|? i ).* a ,l ) ; ( 6 . 52 ) 

10 I 

rather than from the functional r inv [(0},^ a ^] of Eq. 
(6.43). It is also convenient at this point to represent 33 
the gravitational compensatin g de terminant A[ g a p>gpv\ 
by an added action density ^—g JZ gh ost* and to adopt 
the convention that a functional argument h MV implicitly 
indicates a dependence on the ghost fields and that the 
integration measure dlh^] implicitly includes the ghost 
integration measure. By substituting the expansion of 
r inv «S effgrav from Eq. (6.43) into the left-hand side of 
Eq. (6.52), and noting that the right-hand side of Eq. 
(6.52) has a functional integral representation obtained 
by making the substitutions 

d[/i MV ]—►d[> JUV ]d(<£} , 

(6.53) 

S grav ^ ^"matter 4“ grav 


41 In an older terminology, we must compute expressions for 
the renormalized gravitational and cosmological constants, in¬ 
cluding radiative corrections arising from virtual metric and 
matter fluctuations, in terms of the bare parameters appearing 
in the fundamental Lagrangian. 
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in Eq. (6.25), we can rewrite Eq. (6.52) in the form 42 
= / d[h„ v ]d{<l>}e , S d * x * m ' gal! ' h ^ ( 

-^[ {»£a0»^juv] 

— ^~~~8 ( ^mattert { 4 *} »£juv] T“ -^gravt^juv] 

T“ “^ghostt^apt^juv]} 

+ V r ~i ^gflgaP'h^-hteJ^lgaf,] , 

£juv— ^juv + ^juv • (6.54) 

Since we now wish to study the effective action at gen¬ 
eral values of g a p f where it is not stationary, it is essen¬ 
tial to retain the source term J ktr [g a p] i Q The prob¬ 
lem of extracting expressions for G^ d and A ind from Eq. 
(6.54) has the same formal structure as that set out in 
Eqs. (5.1) and (5.2) and solved in Sec. V.A. Hence the 
desired formulas are obtained by making the following 
substitutions in Eqs. (5.8), (5.14), (5.18), and (5.19), 

/ fd[ ]= / dlh^dW , 


8 juv *8pv > 

•S[ / d*x , 

IF—.?[{*},gi*,V] . (6.55) 

In order to indicate explicitly the appearance of the 
source current in the following formulas, it is useful to 
introduce the notation 


■^’[W,g a p,h ltv ] = S’[{4>},g a p,h ltv ]~h k(T J^ J , 

^ J ’gaP’hfjv] z g [-^matter "t~ Xgrav ] 

+V^j? gf , 


J ka [g a p,y] 


Zap—VaP 



J^lga^y ] 


jr^\^y tX) , 


lajj ap 


SgjuvW 
in terms of which 

s=f . 


(6.56) 


(6.57) 


42 In Eq. (6.54) we have not required the total space-time 
volume to have a fixed value. Modifications required by a 
volume constraint and by the presence of boundaries are dis¬ 
cussed by Hawking (1979). A volume constr aint c an be includ¬ 
ed by adding a Lagrange multiplier term k 0 V^g to -2% which 
plays the role of a bare cosmological term and is discussed in 
more detail in Sec. VI. C below. Space-time boundaries require 
the addition to the Einstein-Hilbert action of a surface integral 
over the boundaries. Hasslacher and Mottola (1981) show that 
when the quantum fluctuations h MV in Eq. (6.54) are con¬ 
strained to have zero normal derivative on a boundary, so that 
the boundary does not fluctuate, a surface term of the expected 
form automatically appears in the induced gravitational effec¬ 
tive action. 


The tensors f k(T and ( f ha \t JV are implicitly defined by 
the relations 


fd[ 0) , 


0= (hfri 0)> 

Sg^U) 


Bap “^ap 


]e iS V^{x)h^{ 0) , 


(6.58a) 


(6.58b) 


V» v (x)=2 


Sg MV W 


f d 4 xj?[ {<!>},g a/} ,h MV ] 


Bap—^aP 


= V^(x) + V^ v (x) f 


V? v (x)=2 


d _ d d 
dg MV dx* aOAg^) 


_a_a a 

+ dx k dx a ao^a^v) 


X-^tW ,g a p,k MV ;x] 


Ba0 9 


Ff v (x)=-2 fd*zh^(z)^^ v {z,x) . 

After simplifications using Eq. (6.58), the formulas for 
A ind /G ind and G^ d take the form 


1 ^ind 
2-^ ^ind 


= (F I (0)> 0 , 


1 

167rG ind 


= <G(O)> 0 


i 

96 


fd 4 xx 2 


<^'(F 1 (x)F I (0))> o 


- {S'(V 2 (x)V 2 (0))) o 


(A( 0)> o = 


fd[ ]e lS A( 0) 
fd[]e* ’ 

fd[ ]e^A(x)B( 0) 


<jrC4(x).B(0))>o= . 

fd[ ]e ts 

f ? i(x)=F,(x)-(F,(x)> 0 , 


V 1 W= Vltv V l } v (x) , 

V 2 W= Vllv V$ v W , (F 2 U)> o =0, 

t/(x)=Eq. (5.14) with g /lv ->g /|v , jF-.-J’. (6.59) 

A second useful formula for A ind /G ind can be obtained 

by using Eq. (6.35) to calculate the conformal variation 
of r, giving 


1 ^ me] 

G ind 



~ k ~ w[f*,v al3 ] +s** 

®8ko 


(6.60) 
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Since 

~^-W[/^,V a p]=0 when/^= 0 , (6.61) 

&gka 

we learn from Eq. (6.60) that the condition for the 
cosmological constant A ind to vanish is the vanishing of 

This is of course expected, since when 
A ind vanishes, the induced gravitational action T inv [g a p\ 
is stationary at a Minkowski background metric T] a p. 
When rj a p is the stable ground state, the second-order 
fluctuation operator around r] a p has no negative eigen¬ 
values, and the functional integral formula of Eq. (6.59) 
is then guaranteed to give a real value for G^. 

Unlike the situation found in Sec. V.A, where 
(Cf(0)) o vanished, the term (Cf(0)> o in Eq. (6.59) con¬ 
tains nonvanishing contributions quadratic in the fluc¬ 
tuation metric, such as ((0 A juv /8x a ’) 2 ) o . Hence this 
term in the formula for G ^ 1 is qualitatively similar to 
the relation 

Gj-<*[*„„]>«> (6.62) 

proposed by Mansouri (1979, 1981), in papers suggesting 
that Einstein gravitation is generated by dynamical 
scale-invariance breaking in conformally invariant, 
order -/? 2 gravitational models. However, Eq. (6.62) 
[which omits the nonlocal V 2 terms of Eq. (6.59)] is not 
a quantitatively correct expression for Gj^J. 

C. Conditions for finiteness of G ^ 1 and A| nd 
and for the vanishing of A| nd 

Let us turn now to the issue of whether the formulas 
for G J and A ind given in Eq. (6.59) are finite. By con¬ 
struction, the fundamental Lagrangian density 
-^matter+-^grav contains a complete basis of dimension- 
four operators formed from the fields which are present, 
together with a number (say, N) of dimensionless un¬ 
renormalized couplings. The dimensional algorithm of 
Sec. II.C then guarantees that G,^ d and A ind will be cal¬ 
culable in terms of the corresponding N renormalized 
couplings . 43 If scale invariance remains unbroken, we 
get =0 = A ind . If dynamical breaking of scale in¬ 

variance occurs, we expect one of the N dimensionless 
couplings to be replaced by a scale mass as discussed 
in Sec. IV.B, and the theory will then yield nonvanishing 
predictions for G ; ~ d and A ind in terms of and the 
remaining V —1 dimensionless couplings. The ideal 
case, of course, would be that in which the fundamental 
action contains only one dimensionless coupling, so that 
after dynamical symmetry breaking and dimensional 


43 Note, however, that there is a dimension-two internal- 
symmetry scalar operator ^ 2 =R[h tlv \ which transforms as a 
Lorentz scalar with respect to the limiting value of the 
background metric appearing in Eq. (6.59). As a conse¬ 
quence, the U and V 2 terms in Eq. (6.59) in general will each 
be divergent, with the infinities cancelling only in their sum. 


transmutation, no free dimensionless coupling constants 
remain. 

Let us consider next the conditions under which the 
induced cosmological constant A ind vanishes, assuming 
initially that -^matter +-^grav has a unifying symmetry 
which leaves only a single dimensionless coupling con¬ 
stant, and which requires the vanishing of the bare 
cosmological constant. Then after dimensional transmu¬ 
tation, A ind will be calculable in terms of the scale mass 
(which is expected 44 to be in the range 10 14 — 10 19 
GeV), but in general A ind /^ 2 will be a number of order 
unity, in violent contradiction to Eq. (2.23). The only 
way to save the situation is for the underlying theory to 
have a “hidden” symmetry which guarantees the vanish¬ 
ing of A ind , as discussed recently by Pagels (1982). The 
difficulty with implementing this mechanism is that in 
order for the hidden symmetry to restrict A ind it must be 
an unbroken symmetry, and no natural candidate for 
such a symmetry is known . 45 

An interesting alternative possibility is suggested by 
recent work in which Ovrut and Wess (1982) use a 
cosmological constant as a mechanism for breaking su¬ 
persymmetry. Suppose that the unifying symmetry al¬ 
lows only a single dimensionless coupling constant but 
does not restrict the value of the bare cosmolo gical con¬ 
stant, so that we can freely add a term f d A xV — g k 0 to 
the fundamental action. Because k 0 has dimension four, 
any polynomial formed from k 0 and the fields will have 
dimension greater than or equal to four, and so the ad¬ 
ded term does not require the introduction of any dimen¬ 
sional renormalization constants with dimension smaller 
than four. After dynamical symmetry breaking, the 
theory now has two dimensional parameters, #c 0 and 
or equivalently, A ind and We can then impose as a 
renormalization condition the requirement that in the ab¬ 
sence of real (as opposed to virtual) matter, the Min¬ 
kowski metric 17 ^ be the stable background metric, 
which will require 46 

A i„ d =0. (6.63) 

This leaves only one dimensional parameter in terms 
of which all particle masses and Newton’s constant are 
calculable. 

In order to implement this alternative mechanism, we 
must have justifications both for assuming that the bare 
cosmological constant k 0 is nonzero, and for imposing 


44 This range extends from the so-called “grand unification 
mass” of particle physics [see Weinberg (1980b) for a review] 
to the Planck mass. 

45 An unbroken “hidden” symmetry is also required if the uni¬ 
fying symmetry specifies a definite nonzero value for the bare 
cosmological constant. For a recent survey of quantum gravi¬ 
ty with a cosmological constant, see Christensen and Duff 
(1980). 

^The fact that stability of the Minkowski metric requires the 
vanishing of Aj n d is noted and used as a renormalization condi¬ 
tion in Brout et al. (1980). 
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the renormalization condition that the induced (or renor¬ 
malized) cosmological constant A ind vanish. A possible 
rationale for assuming that /c 0 is nonzero has been given 
by Hawking (1979), who points out that in order to con¬ 
struct a partition function Z for a fixed total space-time 
volume one must include a Lagrange multiplier for this 
volume, and this is formally equivalent to including a 
bare cosmological term in the fundamental action. 47 A 
possible rationale for the renormalization condition 
A in d=0 could be provided by the observation that in a 
two-parameter theory, the ratio A ind /~^ 2 is not constant 
in nonequilibrium situations. If one could show that 
nonequilibrium processes in the early universe, such as 
back-reaction effects from particle production, resulted 
in the decay of A ind towards an equilibrium value of 
zero, 48 then use of the renormalization condition A ind =0 
in the equilibrium analysis of Sec. VI.B would be justi¬ 
fied. 

D. Structure and properties 

of the fundamental gravitational action 

In this final section I will comment very briefly on the 
structure and on some of the properties of the funda¬ 
mental gravitational action. I have assumed in the 
preceding discussion a general order-/? 2 gravitational ac¬ 
tion density of the form 

+B 0 <%*+C 0 jr , (6.64) 


with defined in Eq. (2.35). The study of gravi¬ 

tational actions of this type was initiated by Utiyama 
and DeWitt (1962), and a proof that they lead to a renor- 
malizable perturbation theory has been given by Stelle 
(1977). 49 When dimensional regularization is employed, 
all three terms in Eq. (6.64) are in general needed, as 
shown in detail for the case of a scalar field by Brown 
(1977) and by Brown and Collins (1980). Even though 
the action formed from & is a topological invariant in 
four dimensions, it is not a topological invariant in leo 
dimensions, and so makes a nontrivial contribution when 
multiplied by the power series in (<y—2)” 1 contained in 
the coefficient A 0 . The only circumstance under which a 
term = or 3ST) can be omitted from Eq. (6.64) 

is when the theory with S' deleted has special sym¬ 
metries, which guarantee that no divei^ences with the 
structure of S' are encountered. Thus, for example, a re- 
normalizable theory of matter and gravitation could be 
formulated without including any order-/? 2 terms in the 
fundamental action, only if .if matter itself had enough 
symmetry so that no divergences with the structure of 
or ST were encountered. Whether such matter 
actions can be constructed is not presently known. A 
more realistic possibility for omitting terms from Eq. 
(6.64) is afforded by the case of classically conformally 
invariant theories, in which there are hints 50 that the in¬ 
duced ST term may always have a finite coefficient, per¬ 
mitting one to take C o =0 in Eq. (6.64). It is possible to 


47 The value of kq would then presumably be a parameter characterizing the initial quantum fluctuation which led to the birth of 
the universe. 

48 For a review of gravitational particle production, see Parker (1977), while for an effective action formalism for particle produc¬ 
tion in the early universe, see Hartle (1977). In an earlier article, Parker (1969), p. 1066, postulated that “the reaction of the parti¬ 
cle creation (or annihilation) back on the gravitational field will modify the expansion in such a way as to reduce the creation 
rate.” Since Aj nd > 0 corresponds to positive vacuum energy, a naive extension of this postulate suggests that a state of the early 
universe with Amd>0 will decay by gravitational particle production to an equilibrium with A ind =0, at which point particle pro¬ 
duction ceases. Variants of this idea have appeared in models for the creation of the universe through a quantum tunneling event 
given by Brout et al. (1978), Brout et al. (1979), Guth (1981), Akatz and Pagels (1982), and Gott (1982). The models of Brout 
et al. and Gott postulate a transition from a particle producing de Sitter phase with A lnd =0, Ti^ ttcr — —/cg MV , /pii,ck to a stand¬ 
ard equation of state with P — yp (P—pressure, p—density) as a result of back-reaction effects of particle production. When the 

term — is transposed to the G MV + A indg Mv side of the Einstein equations, k is equivalent to an initially nonvanishing A ind /Gj nd . 

Attempts to find an instability associated with A ind ^=0, within the framework of the semiclassical approximation for the coupled 
matter-metric system [cf. Eq. (6.51) of the text] have not been successful. Abbott and Deser (1982) have shown that the de Sitter 
solutions obtained when the Einstein equations are solved with A jnd ^0,7’ /4V =0 are classically stable against small perturbations. 
Particle production calculations in de Sitter spaces using the semiclassical formalism have not yielded an unambiguous answer; see 
Parker (1977), p. 136, and Gibbons (1979), p. 666, for a discussion and references. Hence a dynamical argument to explain why 
A ind «0 would have to involve nonequilibrium phenomena and/or higher-loop quantum effects which are ignored in the semiclassi¬ 
cal approximation. 

49 For further references, see the review of Weinberg (1979). 

50 There are two pieces of evidence that a bare 3T term may not be needed in conformally invariant theories, both coming from 
the study of conformally invariant matter theories on an unquantized background manifold. The first is that apart from a total 
divergence, the conformal trace anomaly has only % and terms, which implies that the one-loop Lagrangian counterterm con¬ 
tains no divergences proportional to JT. [For a succinct discussion and references, see Tsao (1977).] The second is a general for¬ 
mula which Zee (1982b) has recently derived for the coefficient C ind of the induced JT term, 

Cind = ~"i382^/ E d4 *<* 2)2 '*'<* 2) - 

in the notation of Eq. (5.44). Since in an asymptotically free gauge theory one has v J'~(x 2 )“ 4 (logx 2 ) “ 2 for large x 2 [cf. Eq. (5.50)], 
the integral for C; nd is just barely convergent. Zee’s formula also shows that C^d is negative definite, and so the theory is free of 
tachyons; in this connection see also Horowitz (1981) and Yamagishi (1982). Since the gravitational theory of Eq. (6.66) is asymp¬ 
totically free, it seems a reasonable conjecture that Zee’s results will generalize to the case in which the metric is also quantized. 
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construct renormalizable order-/? 2 gravitational theories 
of greater complexity than Eq. (6.64) by adding new field 
degrees of freedom in a number of ways (for example, by 
including torsion 51 or superfields 52 ). A prime considera¬ 
tion in searching for the correct gravitational action will 
almost certainly be that it should unify in a natural way 
with the fundamental matter action -S? maltC r» when that is 
finally known; this may involve the introduction of “pre¬ 
geometric” fundamental variables 39,53 which are not 
directly classifiable as “matter” or “metric.” 

The momentum space graviton propagator calculated 
from the fundamental action density of Eq. (6.64) con¬ 
tains a term proportional to 


(. k z ) 


2\2 


= lim 


m‘ 


.o nt 


1 


k 2 +m 2 


(6.65) 


Since the second pole-term in Eq. (6.65) has an unphysi¬ 
cal, negative residue, order-/? 2 theories do not satisfy 
unitarity (with positive probabilities) at the tree level. 
However, unitarity is a statement about the asymptotic 
scattering states of a field theory and their 5-matrix, and 
hence unlike renormalizability, is a dynamical, rather 
than a kinematic statement. Thus if radiative correc¬ 
tions play an important role in the dynamics (and they 
certainly do in theories with dynamical scale-invariance 
breaking), violations of tree-level unitarity do not neces¬ 
sarily imply violations of unitarity in the full theory. 
This point was first made a decade ago by Lee and Wick 
(1969, 1970), who showed that if fields which have 
negative-residue “ghost” propagators at the tree level be¬ 
come unstable as a result of radiative corrections, then 
the 5 matrix for the asymptotic scattering states can 
obey unitarity with positive probabilities. The relevance 
of the Lee-Wick mechanism for quantum gravity was 
first pointed out by Tomboulis (1977) and has since been 
discussed by a number of authors. 54 As a concrete ex¬ 
ample [see Hasslacher and Mottola (1981)], let us consid¬ 
er a conformally invariant order-/? 2 theory with the fun¬ 
damental action 


-^grav— A 0 9+BoJr, B o=— » 

& = Gauss-Bonnet density [Eq. (2.35)] , 

£f=C llvko O lv> * , ( 6 . 66 ) 

with the sign of B 0 chosen to guarantee that the Euclide¬ 
an continuation of the partition function is represented 


5l Models with torsion have been discussed by Neville (1980) 
and by Sezgin and van Nieuwenhuizen (1980), who give fur¬ 
ther references. 

52 For a discussion of conformal supergravity see Kaku, 
Townsend, and van Nieuwenhuizen (1978). 

53 For attempts at pregeometric theories of gravitation, see 
Amati and Veneziano (1981), Terazawa and Akama (1980a, 
1980b) and Terazawa (I98la, 1981b). 

^ee Adler (1980b), Hasslacher and Mottola (1981), Tom¬ 
boulis (1980), and also Salam and Strathdee (1978). 


by a convergent functional integral. (The $ term in the 
action plays no role in the following discussion and in 
general does not affect the field equations.) Taking into 
account the fact that radiative corrections induce an ef¬ 
fective Newton's constant, and assuming that G ind has 
the correct positive sign, a simple calculation shows that 
the spin-2 part of the full graviton propagator has the 
form 


p( 2) 

r yya§_ 


k 2 [£Uc 2 r 2 k 2 +m 2 (k 2 )] 



_ 1 _ 

k 2 


_ 1 _ 

k 2 +£{k 2 ) 2 m 2 {k 2 ) 


(6.67) 


Here is a spin-2 projection matrix, m 2 (k 2 ) is the 

amplitude [analogous to {d/dk 2 )X{k 2 ) of Eq. (5.30)] 
which gives G£ d in the zero-momentum limit, 


m 2 (0) = 


1 

167TG ind ’ 


( 6 . 68 ) 


and £(k 2 ) is the (one-loop) running coupling constant for 
the action of Eq. (6.66), 


£(/c 2 ) 2 = 


£(M 2 ) 2 

1 + 7 b gifi 2 ) 2 log( k 2 /fi 2 ) 


(6.69) 


In the timelike region, where k 2 < 0, both |'(/c 2 ) 2 and 
m{k 2 ) have imaginary parts, and consequently the pro¬ 
pagator of Eq. (6.67) has two complex conjugate unstable 
ghost poles rather than a single stable ghost pole. Thus 
it appears that the Lee-Wick mechanism is applicable to 
order-/? 2 gravitational theories; more detailed checks on 
this are now needed. 

A further property of order-/? 2 gravitational theories, 
which is illustrated by Eqs. (6.67) and (6.69), is that they 
are asymptotically free. This follows from work of Julve 
and Tonin (1978), as corrected and extended by Fradkin 
and Tseytlin (1981) [see also Tomboulis (1980) and 
Christensen (1982)], showing that b >0 in Eq. (6.69) and 
in the analogous equation for the running coupling con¬ 
stant associated with the JT term in Eq. (6.64). The 
scale mass which characterizes the strong coupling 
region for the fundamental theory is presumably the 
Planck mass mp^. At energies much higher than the 
Planck mass, the theory becomes weakly coupled, and so 
no singularities are expected. 55 At energies much lower 
than the Planck mass, the induced gravitational term 
dominates, 

$Uc 2 r 2 k 2 + m 2 Uc 2 ) -* m 2 (0)= 1 ■ ■ , (6.70) 

k 2 -*0 1 07TCj j nd 


reflecting the presence of an extra power of k 2 multiply¬ 
ing §(k 2 )~ 2 in Eqs. (6.67) and (6.70), and giving gravita¬ 
tion the form seen in observational astronomy. 


55 For discussions of singularity avoidance in order-i? 2 
theories, see Hu (1979), Tomboulis (1980), and Hasslacher and 
Mottola (1981). 
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APPENDIX A: DETAILS FOR THE BASIC 
THEOREMS 

1. Arguments excluding dimension-two 
Lorentz-scalar operators 

a. Pure non-Abeifan gauge theories In axial 
and covariant gauges 

The necessity for gauge fixing and ghosts requires, in 
the case of nonAbelian gauge theories, that we give a 
somewhat more careful argument for the absence of 
dimension-two Lorentz-scalar and internal symmetry- 
invariant operators <? 2 than would be needed in the 
Abelian case. Let me give first the argument working in 
axial gauge 

A[ = 0 . (Al) 

Since axial gauge is a canonical gauge (Hanson et al ., 
1976), no ghost fields are present. Hence invariance 
under the subgroup of the Lorentz group which leaves 
the z axis invariant and invariance under global internal 
symmetry transformations restrict a candidate for ^ 2 to 


have the form 

=A‘A lx +A‘A iy +A’A " . (A2) 

Consider now the local gauge transformation 

SA ; =d li <t> i -g 0 f i J k Al<t> k , (A3) 

with <t> k =<t> k (x,y,t) independent of z, so that 

8A‘=d z <t>‘-g 0 f'J k At<t> k =0 . (A4) 

Under the transformation of Eq. (A3) we have 

Sf^Aid^'+Afa&'+Ald,® 1 ^ , (A5) 


and so 0' 2 is not invariant under the subclass of local 
gauge transformations which preserves the A [=0 gauge 
condition. Thus Eq. (A2) is not a physically observable 
dimension-two Lorentz-scalar operator. 

I give next a covariant gauge argument, following the 
notation of Kugo and Ojima (1979), which uses an inner 


product (•) and an outer product (x) to denote contrac¬ 
tion of internal symmetry indices with and f ,jk y 
respectively. In covariant gauge, we have 

-^=-T^v*^ V +"^GF+^FP > (A6) 

with the gauge-fixing (GF) and Fadde’ev-Popov (FP) La- 
grangian terms given by 


a 0 

^ GF =-a »BA„ + —BB 




B + —d tl A 
«o 


-d/B-A*) , 


Jfpp^—id^c-D^c , 

— 0^ — goA ju X, c*=c, c*=c. (A7) 

In Eqs. (A6) and (A7), S is an auxiliary scalar field, a 0 
is a gauge parameter, and c is the Fadde’ev-Popov ghost 
field. The Lagrangian density of Eq. (A6) is invariant 
under the Becchi-Rouet-Stora (BRS, 1976) transforma¬ 
tion 


8A m =kD fi c , 
8c=kg 0 (cxc)/2 , 
bc=ikB , 


85=0, (A8) 

with k an x-independent parameter which anticommutes 
with c and c, and all physically observable operators 
must be similarly invariant. In covariant gauge, Lorentz 
invariance and invariance under global internal symme¬ 
try transformations restrict a candidate for 0 2 to have 
the form (for any constant (3 0 ) 

tf^Ap-A^+Poc-c . (A9) 

Under the transformation of Eq. (A8), the change in ^ 2 
is 


b&^TA^'kdpC + p 0 [ikBc\kg 0 c-(cxc)]=£® , 

(A 10) 

and so ^ 2 is not a BRS invariant. Hence Eq. (A9) does 
not give a physically observable dimension-two Lorentz- 
scalar operator. 

We have thus concluded, by working in either axial or 
covariant gauge, that in a pure non-Abelian gauge theory 
there are no Lorentz and internal symmetry-invariant 
operators # 2 , and hence no action density terms ^ 2 R in 
curved space-time. 


b. Massless supersymmetric theories with spin-0 fields 

An extension of the above argument excludes Lorentz- 
and internal symmetry-invariant dimension-two opera¬ 
tors $2 i n massless supersymmetric theories with spin-0 
fields, Let q> be a massless spin-0 field which has a Ma- 
jorana spinor supersymmetry partner ip. Under super- 
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symmetry transformations, q> transforms 5 as 

8<p=i{xpa — atp) , (All) 

with a an x-independent parameter which anticommutes 
with tp. Lorentz invariance allows a candidate for 0 2 of 
the form 

<?2 =<P 2 , (A 12) 

but under supersymmetry transformations the change in 
is 

5(^2 = 2<p8<p=£0 . (AI3) 

Hence is not an internal symmetry invariant, and an 
action density term is excluded. (A dimension-two 
supersymmetry invariant is readily constructed by adding 
to $' 2 a fermionic piece proportional to xpxp/m 0> but this 
requires the introduction of a mass parameter mo-) 

2. Extension to massive regulator schemes 

When massive regulators are employed, we learn from 
the enumeration of Sec. II.C that there are Lagrangian 
density terms of the form 

T'=M\ U'=M 2 R , (A14) 


tion schemes. When there are no bare masses and no 
scalar fields apart from members of massless supermul- 
tiplets, no other dimension-four operator is present in 
curved space-time, and 8Jf then vanishes. We conclude 
that 

8(A; nd /G ind )=8(GiJ )=0 ; (A16) 

that is, under the necessary conditions discussed in Sec. 
II,D, the renormalized induced gravitational action cal¬ 
culated using massive regulators is unique, and agrees 
with that calculated by using the method of dimensional 
regularization. 

APPENDIX B: DETAILS FOR THE CALCULATION 
OF GJ IN SU (n) GAUGE THEORY 

1. Transformation to one-loop exact 
renormalization group 

In an SU(jz) gauge theory, the behavior of physical 
parameters under changes in the renormalization sub¬ 
traction point fi is governed [through Eq, (4.18)] by the 
function /3(g), which has the power-series expansion 

I3(g)=- (7&og 3 +M 5 + & 2 g 7 + ■ " > ■ <B1> 


with Af 4 and Af 2 schematically indicating polynomials 
which are, respectively, quartic and quadratic in the re¬ 
gulator masses. The term T ' contributes to the induced 
cosmological constant A lnd /Gi nd through the operator 
7X0) of Eq. (5.19a), while the term U' contributes to the 
induced gravitational constant Gfoj through the operator 
t/(0) of Eq. (5.19b). The coefficients of T' and U' 
(which in general depend logarithmically on the regula¬ 
tor masses) are determined by the requirement that 
A ln d/G lnd and Gj^J remain finite as the regulator masses 
tend to infinity. Consider now the differences 


S( A ind /G i nd ) = ( A ind /G ind ) massive 

regulator 


( Aind/Gjnd 


dimensional 

regularization 


y 


8 (Gj)=(Gj 


) massive (G; n d ) dimensional 
regu lator regu larizat ion 


(A15) 


between the finite induced constants calculated using 
massive regulators, and the finite values calculated using 
dimensional regularization. According to the dimension¬ 
al algorithm, differences such as these between the finite 
values of connected, one-particle irreducible matrix ele¬ 
ments evaluated in two different regularization schemes 
must be representable as the corresponding matrix ele¬ 
ments of a Lagrangian density polynomial 8Jf formed 
from the bare masses, the bare fields, and d/dx 1 *. The 
polynomial 8-2f cannot contain the terms T' and U' of 
Eq. (Al4), since any nonzero multiple of these bases is 
necessarily at least quadratically divergent as the regula¬ 
tor masses tend to infinity. The polynomial 8Jf also 
cannot contain any field-dependent dimension-four La¬ 
grangian terms which survive in the flat space-time lim¬ 
it, since these would give rise to differences in the flat 
space-time S matrices calculated in the two regulariza- 


Only the first two coefficients b 0 ,i are gauge invariant, 
and only these coefficients are invariant under coupling 
constant transformations g—>g' of the form 

g=g(g')=g' + 2 ^n(g') 2rt + l » (B 2 ) 

n - I 

which are analytic in a neighborhood of g =0. ’t Hooft 
(1979) pointed out that the noninvariance of b 2 >... 
under the transformation of Eq. (B2) could be exploited 
to define a transformation which, in a formal perturba¬ 
tive sense, makes the transformed coefficients b 2 >. .. 
vanish. Global conditions for the existence of a non¬ 
singular ’t Hooft transform were studied by Khuri and 
McBryan (1979); if singular transformations are not ex¬ 
cluded [see Frishman, Horsely, and Wolff (1981) for ar¬ 
guments suggesting the physical relevance of singular 
coupling constant transformations], then a transforma¬ 
tion to a two-loop exact renormalization group can al¬ 
ways be made, giving 

/3(g)=-(T8og 3 +*ig 5 ) • (B3) 

Following Adler (1981), let us now make a further, non- 
analytic transformation to a new “reduced’* running cou¬ 
pling constant g R for which a one-loop renormalization 
group structure is exact. (In the applications of the one- 
loop exact running coupling constant in Sec. V.D of the 
text, the subscript R is omitted.) Writing a R ~g Rt 
a=g 2 y the transformation is simply 

_1_J_, r °° da* 

a R ~~ 2 °Ja 0 (5 ') ’ 

P(a)=gP=—(^b 0 a 2 +b l a 3 ), (B4) 
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which is easily seen to give a nonsingular mapping from 
the half-line 0<a< oo to the half-line 0<a R < oo. The 
renormalization group structure in the new variable a R 
is determined by P R (a R ), given by 

P R (a R )=m) — -=m)(-a R )-——=--b 0 a\ , 

da oa £ 

(B5) 


and so has exactly a one-loop form. 

Explicitly integrating Eq. (B4) gives for the transfor¬ 
mation 



a* 



log 


aa 


+ log(l+aa) 


a = 


2 b x 
~bo 


* 


(B6) 


which for small a a can be developed into a series expan¬ 
sion, 

1 1 
—— = ——a 

a R a 

Equation (B7) can be inverted to give an expansion for a 
in terms of a R and log (aa R ) y 


log 


aa 


, ^ (-fla)" 

+a 2, - 


(B7) 


flal 


n 


a=a R (\+a R f) , 

/= 2 S Kfk > 

k=0 

fo=a log (aa R ) , 

/i=/o+«/o—» • • • • (B8) 

Because / 0 contains a logarithm, the transformation is 
nonanalytic at a*=0, which is why the coefficient b 1 
can be transformed to zero. Substituting Eq. (B8) into a 
perturbation series which has been brought to ’t HoofVs 
form yields a modified perturbation series in terms of the 
new running coupling constant g Ry for which the one- 
loop renormalization group is exact. The modified ex¬ 
pansion has the form of a joint power series in a R and 
log(aa^) in which, for a physical quantity with leading- 
order contribution at order a Rt the general term has the 
form a R [log(aa* )] p , with n >L and with 
p <rc — max(l,£). 


2. Leading short-distance contribution to V(f) 

Asymptotic freedom implies that the leading short- 
distance contribution to 'P(f) is obtained by doing a 
lowest-order perturbation theory calculation, with the 
coupling constant g 2 replaced by the running coupling 
constant g 2 (t). Thus from Eqs. (5.43) and (5.49) we get 


with F 2 a shorthand for F^ a F rika ' y and with the subscript 
0 E indicating the Euclidean vacuum expectation. In 
lowest-order perturbation theory, the square bracket in 
Eq. (B9) is given by 

{^{F\x)FhO)) )oe - (F 2 )Ie 

= 2[<.r-(FL(x)F£ v (0))>o£] 2 

=2[<^'(a [ ^' ff j(x)a [/( ^i,(o))> 0£ ] 2 , (BIO) 

with [,] indicating antisymmetrization of indices. Sub¬ 
stituting the Euclidean Feynman propagator 

6 IJ 8 

{.TU^WA^y)))^ ° v , (Bll) 

(Z7r) (x — yr 

and carrying out the differentiations and contractions, an 
elementary calculation gives 


f (^’(dix r A , a ^(x)3f fl Aij(0 )))o£]' 

=^(« 2 -i>4 

( 2 ;rf t 4 

=>4'(D=-^^-(« 2 -1) 


1 


(B12) 


{Irrf . t\- log^ 2 D 2 ’ 

yielding the value of C* given in Eq. (5.52) of the text. 


3. Dimensional continuation evaluation 
of comparison integrals 


I give here two evaluations of the integral of Eq. (5.55) 
by dimensional continuation. In the first calculation, 
only the power of u in the integrand is dimensionally 
continued, while the logarithms are kept in dimension 
four. In the second calculation [restricted for simplicity 
to the leading term in 0(w)] both the power of u and the 
logarithms are dimensionally continued, corresponding to 
use of the 2co -dimensional vacuum expectation in Eq. 
(5.22). The two calculations give the same answer, as ex¬ 
pected where a finite radiative correction is evaluated by 
different regularization methods. In the context of the 
second calculation, we can compare the analyticity prop¬ 
erties in co of the dimensional continuation of a finite 
sum of Feynman diagrams, with the analyticity proper¬ 
ties of the infinite sum of Feynman diagrams contained 
in the running coupling constant factor g\t). 

In 2m dimensions, the factor d 2a x in Eq. (5.22) is pro¬ 
portional to dtt 0> ~ l y and since T(x) has canonical di¬ 
mension 2co y the leading power behavior of the vacuum 
expectation (^(Z’(x)Z’(0)))o is t~ 2o> . Hence when the 
logarithmic sum Q(u)Alogu ) 2 is kept in four dimensions 
(and when a normalization factor of 77 ^/ir 2 is omitted), 
the continuation of the integral of Eq. (5.55) is 



V . 



u 


-to O(u) 

(log u) 2 


(B13) 


— (F 2 (x))oe] , 


(B9) and is convergent at u = 0 when R eco < 1. In order to 
put Eq. (B13) in a form where it can be analytically con- 
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tinued to a = 2 , let us first make the change of variable 
u =e“ u , giving 





(B14) 


with the contour of integration running along the posi¬ 
tive real axis. When Rea<l and lma>0, the integra¬ 
tion contour can be deformed to the contour C of Fig. 5, 
while when Rea < 1 and Ima < 0 , the contour can be de¬ 
formed to a contour C*, obtained by reflecting C in the 
real axis. Once the contour has been deformed to C or 
C*, the integral of Eq. (B14) converges for any value of 
Rea, and we can continue Rea to 2. Since Hermiticity 
of a quantum field theory requires that the regulariza¬ 
tion prescription be manifestly real (contour prescriptions 
can enter only through Feynman propagators), the limit 
as a —>2 must be defined as the average of dimensional 
continuations to co=2 + ie and to a=2— i€. That is, we 
must average the evaluations of Eq. (B14) with a=2 on 
the contours C and C*, or equivalently, take the real part 
of the evaluation on the contour C alone, yielding the 
formula given in Eq. (5.56) of the text. The in¬ 
equivalence of the evaluations on C and C* implies that 
the analytic continuation of Eq. (B14) to Rea> 1 has a 
branch cut running along the positive real axis from 
a —1 (space-time dimension two) to oo. 

To study the effect of dimensionally continuing the 
logarithmic terms in Eq. (5.55), we note that a momen¬ 
tum space factor logfc 2 continues into 


(k 2 r ~ 2 . ( , (/c 2 r- 2 -i 

-— + counterterm=--— 

a—2 a—2 


(B15) 


and corresponding to this, a coordinate space factor 
log(—x 2 )=logf continues into 


(x 2 ) 2 , , , (x 2 ) 2 1 

— -h counterterm =--- 

2—a 2—a 


(B16) 


Hence let us consider the integral 

1 


r u o 

I{co y y) — J Q du u 


— co. 


1 


2 9 


(B17) 


which when y=2—co describes the continuation of the 
leading logarithmic term in ©(«), and when y= 0 
reduces to the integral, studied above, in which the loga¬ 
rithmic factor is not continued, 

/(w, 0 )= f U °du u ^ . (B18) 

J ° dog u) 2 

To study the a-plane analyticity of Eq. (B17), we expand 
the factor (1 — u Y )~ 2 into a power series in u Y (since 
u 0 < 1 , this is permitted for y > 0 ), and then do the u in¬ 
tegrations assuming Rea< 1, giving 


/ (co,y)—y 2 f 0 °du u a '£(n+\)u nY 


00 


n =0 

uZ y+ 


=r 2 (w +1) - , , 

„-o ny+l-o) 


When y is regarded as a parameter independent of a, 
Eq. (B19) shows that I{co,y) is a meromorphic function 
of a, with poles at a=l + ny, n= 0,1,.... In the limit 
as y—> 0 for fixed a, these poles coalesce into a branch 
cut running from a=l to a= + oo, which is just the 
analyticity structure of l(a, 0 ) which was inferred from 
the discussion following Eq. (B14) above. When the 
value y= 2 —a, corresponding to continuation of the log¬ 
arithm, is substituted into Eq. (B19), we get 

/(a, 2 —a) 

. 2 / 1+1 -«(n + l) 

Up _ 

2n 1 — coin 1 ) 


=(2—a) 2 ^ ( n +D 


=( 2 -©)«£-" 2 F, 


2 , 


1— a 1—a 

2 — a’2—a 


+ 1 ;u 


2 —w 
0 


* 


(B20) 


with the hypergeometric function 2 F l (a= 2 , b; 

c + l;z) defined by 


2 F i(2 yb\b +1 \2 .)— 



n «0 


(n + l)z n 
b +n 


(B21) 


The singularities of Eq. (B21) are poles at — n> cor¬ 
responding to poles in a at co n given by 

2n +, - 1 <2 , ( B 22 ) 

n +1 


and a cut along the real z axis from z — 1 to z — oo, cor¬ 
responding to a cut along the real a axis from a = 2 to 
a= oo. Hence there is an infinite accumulation of poles 
on the real axis to the left of a= 2 , and a branch cut on 
the real axis to the right of a — 2. As a result, the limit 
co—>2 cannot be taken along the real axis, and instead 
must be defined as the average of limits from above and 
below the real axis, giving the real part prescription of 
Eq. (5.56). The fact that a = 2 is a branch point is a 
direct result of the fact that / (a, 2 — a) is the sum of an 
infinite number of Feynman diagrams. If the sum in 
Eq. (B20) is truncated at n =N , corresponding to retain¬ 
ing only contributions to the running coupling constant 
through A-loop order, one gets a meromorphic function 
of a which is regular at a =2. This is the result expect¬ 
ed from the discussion of the dimensional continuation 
of individual Feynman diagrams given in Sec. III. 

We must still show that, as a —>2 in a real part or 
principal value sense, /(a, 2 —a) approaches the leading 
term of Eq. (5.56) of the text. To do this, let us again 
make the change of variable u —e~ v t giving 


/(a,2—a)= f 


00 


log«o 1 


dv 


,(w--l)u 


V 


1 

1 —e<®“ 2)0 

(2 — co)v 


(B23) 


For Rea<l, lma>0, and v in the first quadrant, we 
have 
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Re[(a — 1 )y]—Re(a— l)Reu — Im(a — 1 )lmv <0 , 

(B24) 

Re[(a—2)y]=Re[(a— l)u]— Ren <0 , 
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and so the contour of integration can be deformed to C 
without encountering poles coming from vanishing of 
the denominator in Eq. (B23). We can then set 
io=2+i£, giving 


Re[/(2+/e, — ie)] = Re 




00 


dv 


lpgUQ * 1 V 


e v F(v,e) 


* 


(B25) 


with F(y,e) given by 


,iev 


F{V,€) = 


1 — e 


lev 


l€V 


T > 


F(V,€) 

F{V 9 €Y 



| v I «€ A , 

< 1, v imaginary . 


(B26) 


Since the integral of Eq. (B25) is absolutely and uniform¬ 
ly convergent for all e>0, we can take the limit as €—>0 
inside the integral, giving 


7(2,0)= lim Re[J(2+«V,-«V)] 
€—0 


= Re 



(B27) 


The result of this rather tedious analysis thus reproduces 
the leading, 0=1, term of Eq. (5.56). 


REFERENCES 


Abbott, L. F., 1981, Nucl. Phys. B 185, 189. 

Abbott, L. F., and S. Deser, 1982, Nucl. Phys. B 195, 76. 

Abers, E. S., and B. W. Lee, 1973, Phys. Rep. 9, 1. 

Adler, S. L., 1976, Phys. Rev. D 14, 379. 

Adler, S. L., 1980a, Phys. Rev. Lett. 44, 1567. 

Adler, S. L., 1980b, Phys. Lett. B 95, 241. 

Adler, S. L., 1980c, “Induced gravitation,” to appear in The 
High Energy Limit (the 1980 Erice lectures), edited by A. Zi- 
chichi (Plenum, New York). 

Adler, S. L., 1981, Phys. Rev. D 23, 2905, Appendix B. 

Adler, S. L., J. C. Collins, and A. Duncan, 1977, Phys. Rev. D 
15, 1712. 

Adler, S. L., J. Lieberman, Y. J. Ng, and H. S. Tsao, 1976, 
Phys. Rev. D 14, 359. 

Akama, K., Y. Chikashige, T. Matsuki, and H. Terazawa, 
1978, Prog. Theor. Phys. 60, 868. 

Akatz, D., and H. Pagels, 1982, Phys. Rev. D 25, 2065. 

Amati, D., and G. Veneziano, 1981, Phys. Lett. B 105, 358. 

Ambjom, J., and P. Olesen, 1980, Nucl. Phys. B 170, 60. 

Banks, T., R. Horsley, H. R. Rubinstein, and U. Wolff, 1981, 
Nucl. Phys. B 190, 692. 

Bardeen, J., L. N. Cooper, and J. R. Schrieffer, 1957, Phys. 
Rev. 106, 162. 

Batalin, I. A., and E. S. Fradkin, 1979, Phys. Lett. B 86, 263. 

Batalin, I. A., S. G. Matinyan, and G. K. Savvidi, 1977, Yad. 


Fiz. 26, 407 [Sov. J. Nucl. Phys. 26, 214]. 

Becchi, C., A. Rouet, and R. Stora, 1976, Ann. Phys. (N.Y.) 
98, 287. 

Berg, B., 1981, Phys. Lett. B 97, 401. 

Bernard, C., 1979, Phys. Rev. D 19, 3013. 

Bernstein, J., 1974, Rev. Mod. Phys. 46, 7. 

Bhanot, G., and C. Rebbi, 1981, Nucl. Phys. B 180, 469. 

Binder, K., 1976, in Phase Transitions and Critical Phenomena , 
edited by C. Domb and M. S. Green (Academic, New York), 
Vol. 56, pp. 1— 101. 

Bjorken, J. D., and S. D. Drell, 1965, Relativistic Quantum 
Fields (McGraw-Hill, New York). 

Boulware, D. G., 1981, Phys. Rev. D 23, 389. 

Boulware, D. G., 1982, work in progress. 

Brout, R., F. Englert, J-M. Frere, E. Gunzig, P. Nardone, and 
C. Truffin, 1980, Nucl. Phys. B 170, 228. 

Brout, R., F. Englert, and E. Gunzig, 1978, Ann. Phys. (Paris) 
115, 78. 

Brout, R., F. Englert, and P. Spindel, 1979, Phys. Rev. Lett. 
43, 417. 

Brown, L. S., 1977, Phys. Rev. D 15, 1469. 

Brown, L. S., 1980, Ann. Phys. (N.Y.) 126, 135. 

Brown, L. S., and J. C. Collins, 1980, Ann. Phys. (N.Y.) 130, 
215. 

Brown, L. S., and A. Zee, 1982, work in progress. 

Callan, C. G., 1970, Phys. Rev. D 2, 1541. 

Chanowitz, M. S., and J. Ellis, 1972, Phys. Lett. B 40, 397. 
Chanowitz, M. S., and J. Ellis, 1973, Phys. Rev. D 7, 2490. 
Christensen, S. M., 1982, to appear in Quantum Structure of 
Space and Time , edited by M. J. Duff and C. J. Isham (Cam¬ 
bridge University, Cambridge). 

Christensen, S., and M. Duff, 1980, Nucl. Phys. B 170, 480. 
Chudnovsky, E. M., 1978, Teor. Mat. Fiz. 35, 398 [Theor. 
Math. Phys. (USSR) 35, 538]. 

Coleman, S., 1979, in The Whys of Suhnuclear Physics (the 
1977 Erice lectures), edited by A. Zichichi (Plenum, New 
York) pp. 805—941. 

Coleman, S., and R. Jackiw, 1971, Ann. Phys. (N.Y.) 67, 552. 
Coleman, S., and E. Weinberg, 1973, Phys. Rev. D 7, 1888. 
Collins, J. C. (unpublished). 

Collins, J. C., 1975, Dimensional Regularization and Renormal¬ 
ization of Quantum Field Theory, Ph.D. thesis (King’s Col¬ 
lege, Cambridge). 

Collins, J. C., A. Duncan, and S. D. Joglekar, 1977, Phys. Rev. 
D 16, 438. 

Cornwall, J. M., R. Jackiw, and E. Tomboulis, 1974, Phys. 
Rev. D 10, 2428. 

Cornwall, J. M., and R. E. Norton, 1973, Phys. Rev. D 8, 
3338. 

Creutz, M., 1978, Rev. Mod. Phys. 50, 561. 

Creutz, M., 1980, Phys. Rev. D 21, 2308. 

Creutz, M., L. Jacobs, and C. Rebbi, 1979, Phys. Rev. D 20, 
1915. 

Crewther, R. J., 1972, Phys. Rev. Lett. 28, 1421. 

DeWitt, B. S., 1965, Dynamical Theory of Groups and Fields , 
(Gordon and Breach, New York). 

DeWitt, B. S., 1979, in General Relativity, edited by S. W. 
Hawking and W. Israel (Cambridge University, Cambridge), 
p. 680. 

DeWitt, B. S., 1981, to appear in Quantum Gravity II, edited 
by C. J. Isham, R. Penrose and D. W. Sciama (Oxford 
University, Oxford). 

Di Giacomo, A., and G. Paffuti, 1982, Phys. Lett. B 108, 327. 
Duff, M., 1981, to appear in Quantum Gravity II, edited by C. 


Rev. Mod. Phys., Vol. 54, No. 3, July 1982 


R44 


575 


Adler: Einstein gravity as a symmetry-breaking effect 765 


J. Isham, R. Penrose and D. W. Sciama (Oxford University, 
Oxford). 

Englert, F., and R. Brout, 1964, Phys. Rev. Lett. 13, 321. 
Englert, F., C. Truffin, and R. Gastmans, 1976, Nucl. Phys. B 
117, 407. 

Erdelyi, A., ed., 1954, Tables of Integral Transforms 
(McGraw-Hill, New York). 

Fadde’ev, L. D., and V. N. Popov, 1967, Phys. Lett. B 25, 29. 
Fadde’ev, L. D., and A. A. Slavnov, 1980, Gauge Fields 
(Benjamin/Cummings, Reading, Mass.). 

Fayet, P., and S. Ferrara, 1977, Phys. Rep. 32, 249. 

Fetter, A. L„ and J. D. Walecka, 1971, Quantum Theory of 
Many Particle Systems (McGraw-Hill, New York). 

Fradkin, E. S., and A. A. Tseytlin, 1981, Phys. Lett. B 104, 
377. 

Fradkin, E. S., and G. A. Vilkovisky, 1975, Phys. Lett. B 55, 
224. 

Fradkin, E. S., and G. A. Vilkovisky, 1976, “On Renormaliza¬ 
tion of Quantum Field Theory in Curved Spacetime,” Berne 
preprint. 

Fradkin, E. S., and G. A. Vilkovisky, 1977a, in Proceedings of 
the 18th International Conference on High Energy Physics, 
Tbilisi , U.S.S.R., 7976 (Dubna), Vol. 2, Sec. C, p. 28. 

Fradkin, E. S., and G. A. Vilkovisky, 1977b, Lett. Nuovo 
Cimento 19, 47. 

Frishman, Y., R. Horsley, and U. Wolff, 1981, Nucl. Phys. B 
183, 509. 

Fritzsch, H., and P. Minkowski, 1981, Phys. Rep. 73, 87. 

Fujii, Y., 1974, Phys. Rev. D 9, 874. 

Fujikawa, K., 1981, Phys. Rev. D 23, 2262. 

Fukuda, R., and Y. Kazama, 1980, Phys. Rev. Lett. 45, 1142. 
Gibbons, G., 1979, in General Relativity , edited by S. W. 
Hawking and W. Israel (Cambridge University, Cambridge), 
p. 639. 

Gott, J. R., Ill, 1982, Nature 295, 304. 

Gottlieb, S., and J. T. Donohue, 1979, Phys. Rev. D 20, 3378.. 
Gross, D. J., and A. Neveu, 1974, Phys. Rev. D 10, 3235. 
Gross, D. J., and F. Wilczek, 1973, Phys. Rev. Lett. 30, 1343. 
Gross, D. J., and F. Wilczek, 1974, Phys. Rev. D 9, 980. 

Guth, A. H., 1981, Phys. Rev. D 23, 347. 

Hanson, A., T. Regge, and C. Teitelboim, 1976, Constrained 
Hamiltonian Systems (Acad. Naz. dei Lincei, Rome), Vol. 
373. 

Hartle, J. B., 1977, Phys. Rev. Lett. 39, 1373. 

Hasslacher, B., and E. Mottola, 1980, Phys. Lett. B 95, 237. 
Hasslacher, B., and E. Mottola, 1981, Phys. Lett. B 99, 221. 
Hawking, S. W., 1979, in General Relativity , edited by S. W. 
Hawking and W. Israel (Cambridge University, Cambridge), 
p. 746. 

Hawking, S. W., and W. Israel, 1979, General Relativity (Cam¬ 
bridge University, Cambridge). 

Heisenberg, W., and H. Euler, 1936, Z. Phys. 98, 714. 

’t Hooft, G. (unpublished). 

’t Hooft, G., 1975, in Functional and Probabilistic Methods in 
Quantum Field Theory , edited by Bernard Jancewicz, Acta 
Universitatis Wratislavensis No. 368, Xllth Winter School of 
Theoretical Physics in Karpacz, February 17-March 2, 1975, 
Vol. I, p. 345. 

*t Hooft, G., 1979, in The Whys of Subnuclear Physics (the 
1977 Erice lectures), edited by A. Zichichi, (Plenum, New 
York), pp. 943 — 971. 

’t Hooft, G., and M. Veltman, 1972, private communication. 
Horowitz, G. T., 1981, in Quantum Gravity II, edited by C. J. 
Isham, R. Penrose, and D. W. Sciama (Oxford University, 


Oxford). 

Hu, B. L., 1979, “Quantum field theories and relativistic 
cosmology,” to be published in Recent Developments in Gen¬ 
eral Relativity , edited by R. Ruffini, Proceedings of the 
Second Marcel Grossmann Meeting, Trieste, Italy. 

Ichinose, S., and M. Omote, 1982, Nucl. Phys. B (in press). 
Jackiw, R., and K. Johnson, 1973, Phys. Rev. D 8, 2386. 
Johnson, K., M. Baker, and R. Willey, 1964, Phys. Rev. 136, 
1111B. 

Julve, J„ and M. Tonin, 1978, Nuovo Cimento B 46, 137. 

Kaku, M., P. K. Townsend, and P. van Nieuwenhuizen, 1978, 
Phys. Rev. D 17, 3179. 

Kallosh, R. E., 1978, Nucl. Phys. B 141, 141. 

Kataev, A. L., N. V. Krasnikov, and A. A. Pivovarov, 1982, 
Nucl. Phys. B (in press). 

Kay, B. S., 1981, Phys. Lett. B 101, 241. 

Kazama, Y., 1980, in Proceedings of the XXth International 
Conference on High Energy Physics, edited by L. Durand 
and L. G. Pondrum, A.I.P. Conference Proceedings No. 68, 
Particles and Fields Subseries No. 22, p. 1013. 

Khuri, N. N., and O. A. McBryan, 1979, Phys. Rev. D 20, 

881 . 

Kim, J. E., P. Langacker, M. Levine, and H. H. Williams, 
1981, Rev. Mod. Phys. 53, 211. 

Klein, O., 1974, Phys. Scr. 9, 69. 

Kleinert, H„ 1978, Fortschr. Phys. 26, 565. 

Kogut, J., and L. Susskind, 1975, Phys. Rev. Dll, 395. 
Kripfganz, J., 1981, Phys. Lett. B 101, 169. 

Kugo, T., and I. Ojima, 1979, Prog. Theor. Phys. Suppl. 66. 
Lane, K., 1974a, Phys. Rev. D 10, 1353. 

Lane, K., 1974b, Phys. Rev. D 10, 2605. 

Lee, T. D., and G. C. Wick, 1969, Nucl. Phys. B 9, 209; B 10, 

1 . 

Lee, T. D., and G. C. Wick, 1970, Phys. Rev. D 2, 1033. 
Leibbrandt, G., 1975, Rev. Mod. Phys. 47, 849. 

Linde, A. D., 1979, Zh. Eksp. Teor. Fiz. Pis’ma Red 30, 479 
[JETP Lett. 30, 447]. 

Linde, A. D., 1980, Phys. Lett. B 93, 394. 

Mansouri, F., 1979, Phys. Rev. Lett. 42, 1021. 

Mansouri, F., 1981, “Conformal Gravity, Conformal Algebra 
in Minkowski Space, and Dynamical Breakdown of Local 
Scale Invariance,” Yale preprint YTP81-12. 

Marciano, W., and H. Pagels, 1978, Phys. Rep. 36, 137. 
Matsuki, T., 1978, Prog. Theor. Phys. 59, 235. 

Milton, K. A., 1981, Phys. Lett. B 104, 49. 

Minkowski, P-, 1977, Phys. Lett. B 7l, 419. 

Misner, C. W., K. S. Thome, and J. A. Wheeler, 1970, Gravi¬ 
tation (Freeman, San Francisco). 

Nambu, Y., and G. Jona-Lasinio, 1961, Phys. Rev. 122, 345. 
Neville, D. E., 1980, Phys. Rev. D 21, 867. 

Nieh, H.T., 1982, Phys. Lett. A 88, 388. 

Nielsen, N. K., 1977, Nucl. Phys. B 120, 212. 

Nielsen, N. K., 1978, Nucl. Phys. B 140, 499. 

Ovrut, B. A., 1981, “Axial vector ward identities and dimen¬ 
sional regularization” (submitted to Nucl. Phys.). 

Ovrut, B. A., and H. J. Schnitzer, 1980, Phys. Rev. D 21, 
3369. 

Ovrut, B. A., and H. J. Schnitzer, 1981, Nucl. Phys. B 189, 
509. 

Ovrut, B. A., and J. Wess, 1982, Phys. Lett. B (in press). 

Pagels, H., 1982, “Vacuum energy and supergravity,” to ap¬ 
pear in Orbis Scientiae 1981 , edited by A. Perlmutter and L. 
F. Scott, (Plenum, New York). 

Pagels, H., and E. Tomboulis, 1978, Nucl. Phys. B 143, 485. 


Rev. Mod. Phys., Vol. 54, No. 3, July 1082 



576 


Adventures in Theoretical Physics 


766 Adler: Einstein gravity as a symmetry-breaking effect 


Parker, L., 1969, Phys. Rev. 183, 1057. 

Parker, L., 1977, in Asymptotic Structure of Space-Time , edited 
by F. P. Esposito and L. Witten, (Plenum, New York), p. 107. 
Politzer, H. D., 1973, Phys. Rev. Lett. 30, 1346. 

Sakharov, A. D., 1967, Dok. Akad. Nauk. SSSR 177, 70 [Sov. 
Phys. Dokl. 12, 1040 (1968)]. 

Sakharov, A., 1975, Teor. Mat. Fiz. 23, 178 [Theor. Math. 
Phys. (USSR) 23, 435]. 

Salam, A., 1968, in Elementary Particle Theory , edited by N. 
Svartholm (Almqvist and Wiksell, Stockholm), p. 367. 

Salam, A., and J. Strathdee, 1978, Phys. Rev. D 18, 4480. 
Savvidy, G. K., 1977, Phys. Lett. B 71, 133. 

Schrieffer, J. R., 1964, Theory of Superconductivity (Benjamin, 
New York). 

Schwinger, J., 1951, Phys. Rev. 82, 664. 

Sezgin, E., and P. van Nieuwenhuizen, 1980, Phys. Rev. D 21, 
3269. 

Simon, B., 1981, Large Orders and Summability of Eigenvalue 
Perturbation Theory: A Mathematical Overview , to appear in 
Int. J. Quantum Chem. (Proceedings of the 1981 Sanibel 
Workshop). 

Smolin, L., 1979, Nucl. Phys. B 160, 253. 

Stelle, K. S., 1977, Phys. Rev. D 16, 953. 

Stevenson, P. M., 1981, Ann. Phys. (N.Y.) 132, 383. 

Susskind, L., 1979, Phys. Rev. D 20, 2619. 

Symanzik, K., 1970, Commun. Math. Phys. 18, 227. 

Terazawa, H., 1981a, Phys. Lett. B 101, 43. 

Terazawa, H., 1981b, “Pregeometry,” to appear in the 

Proceedings of the Second Seminar **Quantum Gravity ,” 
Academy of Sciences of the USSR, Moscow , October 13 — 15, 
1981. 

Terazawa, H., and K. Akama, 1980a, Phys. Lett. B 96, 276. 
Terazawa, H., and K. Akama, 1980b, Phys. Lett. B 97, 81. 
Terazawa, H., Y. Chikashige, K. Akama, and T. Matsuki, 
1977a, Phys. Rev. D 15, 1181. 

Terazawa, H., Y. Chikashige, K. Akama, and T. Matsuki, 


1977b, J. Phys. Soc. (Japan) 43, 5. 

Tomboulis, E., 1977, Phys. Lett. B 70, 36l. 

Tomboulis, E., 1980, Phys. Lett. B 97, 77. 

Tsao, H.S., 1977, Phys. Lett. B 68, 79. 

Utiyama, R., and B. S. DeWitt, 1962, J. Math. Phys. 3, 608. 

Vainstein, A. I., V. I. Zakharov, and M. A. Shifman, 1978, 
Zh. Eksp. Teor. Fiz. PiVma Red 27, 60 [JETP Lett. 27, 55]. 

Van Nieuwenhuizen, P., 1981, Phys. Rep. 68, 189. 

Weinberg, S., 1957, Phys. Rev. 106, 1301. 

Weinberg, S., 1967, Phys. Rev. Lett. 19, 1264. 

Weinberg, S., 1972, Phys. Rev. Lett. 29, 388. 

Weinberg, S., 1976, Phys. Rev. D. 13, 974. 

Weinberg, S., 1979, in General Relativity, edited by S. W. 
Hawking, and W. Israel (Cambridge University, Cambridge), 
p. 790. 

Weinberg, S., 1980a, Phys. Lett. B 91, 51. 

Weinberg, S., 1980b, Rev. Mod. Phys. 52, 515. 

Wilson, K., 1968, Phys. Rev. 179, 1499. 

Wilson, K. G., 1973, Phys. Rev. D 7, 2911. 

Wilson, K., 1974, Phys. Rev. D 10, 2445. 

Witten, E., and S. Weinberg, 1980, Phys. Lett. B 96, 59. 

Yamagishi, H., 1982, Phys. Lett. B (in press). 

Zee, A., 1979, Phys. Rev. Lett. 42, 417. 

Zee, A., 1981, Phys. Rev. D 23, 858. 

Zee, A., 1982a, Phys. Rev. Lett. 48, 295. 

Zee, A., 1982b, Phys. Lett. B 109, 183, 

Zee, A., 1982c, Gravity as a Dynamical Consequence of the 
Strong, Weak and Electromagnetic Interactions (the 1981 
Erice Lectures), edited by A. Zichichi (to be published by 
Plenum, New York). 

ZePdovich, Ya. B., 1967, Zh, Eksp. Teor. Fiz. Pis'ma Red 6, 
883 [JETP Lett 6, 316]. 

Zimmermann, W., 1970, in Lectures on Elementary Particles 
and Quantum Field Theory (the 1970 Brandds lectures), edit¬ 
ed by S. Deser, M. Grisaru, and H. Pendleton (MIT, Cam¬ 
bridge, Mass.), pp. 395 — 589, 


Erratum: Einstein gravity as a symmetry-breaking effect 

in quantum field theory 

[Rev. Mod. Phys. 54, 729 (1982)] 

Stephen L. Adler 

The Institute for Advanced Study, Princeton, New Jersey 08540 


The following clarifications should help in reading Sec. 
VI: 

(1) Equation (6.11) is obtained by substituting Eq. (6.9), 
with 6’— 0, into Eq. (6.7). The step from Eq. (6.13) to 
(6.14) then makes use of Eq. (6.11), with 6 replaced by 
6 “ 1 and with g’ a p replaced by gjjj. 

(2) Equation (6.30) is obtained by combining Eq. 
(6.28b), which can be rewritten as 

with Eq. (6.29), which implies the vanishing of the first 
term on the left-hand side of the above equation. 

(3) In Eq. (6.58), (A^.(0)> is a shorthand for (A^r(0)>y, 
with J^ v [g a p] the external source current. 

(4) In deriving Eq. (6.59), use has been made of the 
identity 


0 = / d[]e i *Vi iv (x)V? f3 (y) , 

which follows from Eq. (6.58b) and the fact that V 2 P is 
linear in h^. From this identity we then get 

0 = (&'([V 1 (x)+V 2 (x)]V 2 (0))) o 
^(^i[Vi{x) + V 2 {x)][Vi(0) + V 2 (0)]))^ 

= (y'{v l (x)v x (0))) o -(^'(v 2 {x)v 2 {0))) o . 

I wish to thank A. Zee for comments on Sec. VI. 

In the references, the paper of Brout, Englert, and Gun- 
zig (1978) appeared in Ann. Phys. (N.Y.), not in Ann. 
Phys. (Paris). The citation of Utiyama and DeWitt (1962) 
in Sec. VI.D should also refer to DeWitt (1950) [DeWitt, 
B.S., 1950, Ph.D. thesis (Harvard University), unpublish¬ 
ed]. 
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Excerpt from S. L. Adler, Theory of Static Quark Forces, Phys. Rev. D. 18, 411^434 (1978). 


APPENDIX A: SCALAR PROPAGATOR CONSTRUCTION 


I construct in this appendix the scalar propagator 
A ab (x, y) defined in Eq. (81) of the text, which sat¬ 
isfies 

A ab (x, y) = (D^Dl + DiDi) A a& (*, y) 


= -6 a& 6 3 (x-y), 


D° r wM = S 0 Wx?w, 

D J x w(x) = + Sl(x) xj w(x), (Al) 


Copyright© 1978 by the American Physical Society. Reprinted with permission. 
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*S(*)= 72 (1 - r cothr) , 







sinh rj 9 


A st (x,y) = -6 3t 6 3 (x-y), (A7) 

we get the desired propagator A a& by transforming 
both SU(2) indices with the matrix M, 

A ab (x, y) = y). (A 8) 


where I have set k * 1. To change to general k one 
simply uses the scaling law 

* ab (x,y) tm „ ilK = ^ b (x, r ,K) 


= K& ab (KX f Kyjl). (A2) 


The reversal in sign of as compared with Eq. 
(49) [which I have made because the sign in Eq, 

(Al) corresponds to the convention I used in my 
calculations] has no effect on A a& , since D*D* is 
even in \ 0 . As in the vector propagator calculation 
in the text, I make extensive use of the results of 
Brown et al . 20 for propagators in pseudopaerticle 
fields. The first step of the calculation, following 
Manton, 31 is to make a complex gauge transform¬ 
ation which changes the potentials from X£, of 
Eq. (Al) to with 


= 72 ( 1 ~ r cothr), 

g aii Y i 

= — -p — (1 - r cothr) + i& la . 


(A3) 


Introducing a matrix M° a (x) given by 


M° a (x) - coshr 


( 


6“ 5 - 


r 2 ) 


-i sinhre 03 ' — + —-t— , 
r r 2 


(A4) 


Af 3 (x)M H W = 6 ab , 


it is straightforward to verify that 


From this point on I will work exclusively with 
the gauge-transformed potentials of Eq, (A3). For 
notational convenience I will drop all bars, but it 
should be kept in mind that I am now constructing 
the propagator £ in the new gauge, not the final 
propagator A given by Eq. (A8). The advantage of 
the potentials of Eq. (A3) is that they take the form 
used by Brown et al. as the starting point for their 
analysis, 

A att (x) = (XJ, b a 0 l ) = -rj 1 - )itva B v lnTr(x), 


7 r(x) = 


sinhr 



(A9) 


ql- )\i.va - (- bua )kla - )ftOa = 

Note that although 7 t(x) depends on x°, the potential 
A ait (x) depends only on the spatial components x J 
of x. Hence if I define a Euclidean time-dependent 
propagator A afr (x, y, x°, y°) by 


D*D* A a& (5, y, *°, y°) = -6 ab 6 4 (x - y), 
D° w(x) = + \(x) x] w(x), 


(A 10) 


then it will actually depend only on the time differ¬ 
ence \ = - y°, and the desired propagator A afr (x,$) 

is obtained by integrating over the time difference, 


A a 6 (x,y)=r d\ A“ 6 (x,y, \). 

* *00 


(All) 


The final observation needed, in order to make 
contact with the work of Brown et al., is that 7 t(x) 
can be written as a contour integral, 

e ix o sinhr 
*(*)*- r - 


2t r 


/ 


00 - i/f 


ds e 


Is 


1 




r 2 + (x° - s) 


2 9 


A>r. 


(A12) 


— M° 3 (x) = -€ Bto b^M ea + , (A5) 

which implies that 

D J X M° ff (x) = M aa (*)I5£. (A 6) 

So once we have obtained the scalar propagator 
satisfying 


I now will list a number of results from the an¬ 
alysis of Brown et al ., with occasional small 
changes in notation. Brown et al. construct the 
general scalar, isovector propagator A ab (x, y,x?,y°) 
satisfying Eq. (A10) for potentials A m (x) 

= -7) { ' )itva d v Imt(x) representing a general N-pseudo- 
particle (instanton) configuration 

»(*) = (!) +£^4, *.«*-*.. (A 13) 

5 X S 
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Their result takes the form of a sum of two pieces 
(with x, y Euclidean four-vectors) 


A ab (x, y) (2, = 


C ab (x t y) 

^TT 2 TT(x)TT(y) ’ 


Aab (x, y) - * ab (x, y) {1) + A a& (*, y) {2) . (A14) 

The first piece is constructed in terms of spin-£ 
propagators by the recipe 


Aab (x, y) U) = 


V ab (x , y) 

lir 2 (x - y) 2 TT(x)ir(y) ’ 


U ab (x, y) = ftr[ t“F <*’(*, y)r b F u \y, x)], 


F ( * , (x,y) = (l) + J2Ps 

3 


Llii 


^ ■ y, 
y s 2 ’ 


(A 15) 


CJx,y )= £ *‘i(x)C rii „*«;i(y) > 


r, s, m, t; 


C = _ 1 


(A 16) 


(* r -* s ) 2 (z r -zj 


(B.zP.l L h P-tP. 1 L h _ PsPu u 

loo r * sv o o * u o o n 

\“s“v 


rv 


^*(i,T J ), T tu = (-i,T J ), 

7^ = 311(2) Pauli matrices, x s = x- 2 s , y s = y-2 s - 
The second piece has the form 

_i 


+ 



1 

(*.-*j* ’ 


and with the numbers 32 h sv determined by the 
matrix inversion problem 



PtPv 


- 6 


$v > 


(A17a) 




(A17b) 


To make use of these rather formidable looking equations, I note that Eq. (A13) becomes identical to Eq. 
(A12) under the substitutions 


(U-o , 

*.-(5,0), * s 2 ~(x°-5) 2 +x 2 , (A 18) 

52 - f ds > P.~-( l/2n)e ,s , 

C 


so that the Sommerfield-Prasad solution is in effect a continuum of complex instantons. Corresponding to 
the substitution (1) —0, the terms (1) in Eqs, (A15) and (A17) must also be deleted. The transition from 
sums to integrals can be made with no ambiguity in A a& (x,y) (1) , giving (recall that \=x°-y°) 


00 


m 00 


d\ 


d\ A afr (x,y, \) (1) 

1 


A a& (x,y) (1, s J 

r 

~ f moo ~ ' 4 ;r 2 [( 1 c-" 5 f ) 2 + \ 2 ] sinh lie I sinhl^. 

g.-)? ■ 


iyi 


e~ ,x ° m iy °%tr[T a F ( * \x f y)r b F ( * *(y, jc)] , 


(A19) 


Making a shift s-*s+x° in the integration over s in F l+, (x, y) and a shift t — t + x° in the corresponding in¬ 
tegration over t in gives 


A a *(x,y) (1, = 


1 Ixl 
(2;r) 4 sinh I x 


lyi 

sinh |y | 




d\ e ix 
(x-y) 2: + \ 2 


t *x+it a T*x-is t °y + i(s + \) b r °y- £(f + X) 1 
5 2 + 1 2 7 x 2 +s 2 y 2 + (s + X) 2 7 y 2 + (t +\) 2 J 


(A20) 

Now make, in the order indicated, the following changes of variables: 

(i) \ — z - £(s + 1) = z - w , 

(ii) w-^(s + t), v-% (s-#), dsdt - 2dwdv . 


(A21) 
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Turning next to the second piece, I note that 
time-translation invariance implies that h sv 
-h(s - v). Anticipating the fact that only H 

-v) Is needed, I proceed first to extract 
this quantity from the matrix inversion problem of 
Brown et al. stated in Eq. (A17). Because the ex¬ 
pressions of Eqs. (A16) and (A17) contain singular 
factors (z u -z v )~ 2 , etc., it is necessary to sepa¬ 
rate the various integration contours r, s,w,v by 
small imaginary displacements. In order to do 
this in a way which preserves the validity of 
various algebraic operations used by Brown et al. 
in getting their solution, 33 it is necessary to sym¬ 
metrize over all possible "stacking orders” of the 
contours on the complex plane, a procedure which 
will eventually lead to the appearance of principal- 
value integrals in the answer. Summing over v in 
Eq. (A17a) gives 

[? M '-”] [?£“■]■? • 

(A23) 


Dropping the (1) in the expression for g 6t in Eq. 
(A17b) gives 



X [3P + (x® 

Again it is necessary to make, in the order indicated, 
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L-looking formula: 


°o - IK 


°o - IK 


e lw e l * 


(£-^) 2 + ( z - w ) ; 


v) a t • x- i(w + y) t •y + i(z + v) b 
7 x z +(w + v ) 2 y 2 + (z + v) 2 7 


t • y- i(z - t>)1 

?;<*-*)* J ■ 


(A22) 


(A25) 

As a consistency check, note that if we multiply 
Eq. (A23) by p 6 and sum, we get 


s,t Pt r* s \*r * q) 


-o=E 


Ps 2 , 


(A26) 


but in the continuum limit 

H J ds e u =0 

s Air j moo _ lK 


(A27) 


so that Eq. (A26) is in fact satisfied. Passing to 
the limit in Eq. (A25), and remembering that we 
must average over the cases where the r contour 
goes over and under s, we get 

[ r°° - IK 1-1 1 

(r-s) 2 J 7T 


r-i 


(A28) 


The final step in the calculation is to make the 
transition froms sums to integrals in Eq. (A16), 
bearing in mind the necessity of symmetrizing 
over the ordering of integration contours. Noting 
that 


T] <m)ltva x rti x Sv = - x a (r - s) , 


(A29) 


we get from Eq. (A16) (again with X=x°-y°) 


x 2 + (x° - r) 2 ] [ x 2 + (x° — s) 2 ] [y z + (y° — r) a ] [y 2 + (y°- s) 2 ] 


lu C ds dvhjs - v) 
J (r- s)(w - v) 


r) 2 ][W+~(x°- s) 8 ][y 8 + ( y°-u ) 2 ][y 2 + (y°- v) 2 ] 


(A30) 


the following changes of variables; 
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First term in { }: 

(i) r~r+x°, s~s+x°, 

(ii) \-~z - j(r+s) = z-w , 

(iii) w = Kr + s), v = |(r-s), drds = 2dwdv . 

Second term in { }: 

(i) r~r + x°, s~s+x Q , u~u+x°, v~v+x°, 

(ii) \~z x - \(u+v) , 

(iii) z 2 = i(r+s), w = i(u + v), v 2 = i(r-s), v x =&u-v), 

drds - 2dz ^lv 2J dudv-2dwdv 1 . 

After these transformations, the only place where w appears is in 



— v 2 + v x - w) 



giving as the final answer 


A a *(x, y) (2) » 


4 Ix| ly I 
(2;r) 4 sinh |X| sinh |y | 


x*y 


b 


18 


(A31a) 


(A31b) 


(A32) 


U GO <• GO* 

. * L 


°°*dw e lu) 

[x 2 + (!0-v) 2 ][2 2 + (^ + t;) 2 ] 


/ 


°0 - IK 


dz e 


le 


- °0 - IK 


[y 2 +(z-v) 2 ][ y 2 + (z +v) 2 ] 


GO 


GO 


e iv * 

v n 


/ 


CO. IK 


dz 2 e ** 2 


•CO. IK 


[x 2 + (z 2 - v 2 f] [ x 2 + (z 2 + v 2 ) 2 ' 


xp 


/ « e lvi rce-lK dz x e l *l 

.. dVl ~ J-.o.iK [y a + (*i-Vi) a ][y a + (*i+v7)*] 


(A33) 

Although it took a more involved argument to extract Eq. (A33) from the work of Brown et al, than was 
needed to get Eq. (A22), the evaluation of the contour integrals appearing in Eq. (A33) is relatively easy. 
Writing x = |x|, y = |y |, the answer is 


A a& (x, y) 


c2 1 = _L _jL 

4ir sinhx sinhy 


xi— coshx coshy - ~ j— 
)xy L * 2 \ 


sinhx . sinhy . 

— coshy +-- coshx 

y 


)] 


+ 


1/1 l\ /sinh(x-y) sinh(x+y)\ 1 / 
4 \x* + y*) \ x^y x + y ) ~ xy\ 


, sinhx\ / , sinhy 

coshx - —-—1 fcoshy-— 


y 


l 


y ( 


4ir sinhx sinhy 


{ 1 / sinhx . sinhy , \ sinhx sinhy 
2xy\ x y I x 2 y 2 


+ 


l/l l\ ainh(x-y) sinh(x + y) " ) 
4\x 2 + y 2 /[_ x-y ” x + y J [ 


(A3 4b) 


The fact that the final term in Eq. (A34a), which 
comes from the product of principal-value inte¬ 
grals in Eq. (A33), cancels away the leading large- 
y asymptotic behavior of the first three terms is 


1 -- 

a check that the limiting argument leading to Eq. 
(A28) has been carried out correctly. The evalua¬ 
tion of A a& (x,‘y) (1) , in which x and ^dependences 
are highly correlated, involves straightforward 
but very lengthy computations, on which I am now 
working. 
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Excerpt from S. L. Adler, Classical Quark Statics, 
Phys. Rev. D. 19, 1168-1187 (1979). 


APPENDIX A: PROPAGATOR FORMULAS 

In giving formulas for the scalar propagator A a& 
in the Prasad-Sommerfield background field, it is 
convenient to set k = 1; to change to general k one 
uses the scaling law 


A ab {x, y) gener a, 

= K& ab (KX, Ky ; 1). 

Writing 

A<*(x,y-1)= -L -JL - 2— S* 6 , 

4tt stnhx sinhy 

I find the following expression for Z ab : 
af( X ,y)\ i ( X ,y), 


(Al) 


(A2) 


i* 1 


a a& = 6 a& + 




x 2 y : 


x-y - 


y a y b 


X 2 


y 


2 3 


of = x“y b , 


of = x b y a -6 ab $‘y , 


(A3) 




*)■ 


<T--w- 

5 r 




X* 




X x = t/ 2 CO +/ 2 (*~) +/ 2 (*+.) +/ 2 (Ol 


2A 

A 


f da( 1 - a)e _ai coshaxcoshay , 


Xo - 


1 /coshx coshy - sinhx sinhy 

Py 5 " \ A ” x 


y 


)■ 


\.s 


2xyA 


f-/ 2 (0 -/afO+Zate.-)^^*)] 


= -| f da(l - a)e-°* 


sinhax s inhay 

x y 


X„ = 


2xA 


M/i(0+/i(*_ J] 


-e‘*[/i(0 +/i(z J]} 

2 r dae ^ c oshay ^(l-a ,)£ 

A A * 


\.= 


2yA 


M/i(*.-)+/i(0] 


(A4) 


-e- y [/ 1 (0+/ 1 (0]} 

= X f* da e' aA coshax 3mh(1 -~ . a . ) Z 
A y 


a = |*-?| , 

z„=x + y-A, z._=x-y-A } 

*_. = -* + y-A, 2__=-x-y-A, 
,. . e*- 1 1-2 

/l( 2 ) = „ 3 / 2 ( 2 )“ - 2 


It is easy to verify that, despite the factors x -2 
and y -2 in the o’s, Eq. (A3) is in fact analytic near 
x = 0 and near y=0. Near $ = ?, Eq. (A2) has the 
expected short-distance singularity 


1 ft ab 

A ab (x,y; 1) ^ — — +0(1), 


A -> 0 


4 n A 


(A5) 


while the limiting behavior for large y, with x 
fixed, is 


A->(*,y;l) ^ i-^(cothx-l) 

y w ' 


y 

? 


+ o 


(?) ’ 


(A6) 


which has 6JJ°(x) as the x-dependent factor. The 
expression for the differential operator DJfDJ used 
in verifying the propagator differential equation is 




sinhx 


= (air) V + C^+C,*' ^ r + C'MW+CiV-ftrt'+C^-lj- 


Cl = I (il^J " COthx ) ’ ° 2 = 2 G " COthX ) ’ C 3 =1+C 1’ (A7) 

=-2 (x “ sinhx) ’ C 5 =-C «- 

The projected covariant derivative of the scalar Green’s function, defined by Eq. (46) of the text, is given 
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by the following formulas: 


= ^ £ a*(*,y)rA.30, 


. , . 1 / * -y 9X 4 \ __ 

1 sinhx \ 4 A 9A / sinh 2 x ’ 

rrt*, -l 2 XCOShx\ A X 9X 2 . X 2 -X-y 3X ft 

T 2 x >y y sinhx ” sinh 2 x / 2 + sinhx 9x + A sinhx 9A ’ 

/ 1 9X 4 x 

T * (x > y > = ~ A sinhx 9A - sinh 2 x X * ’ 


/ \ / 1 
l(X,y)= Uinhx “ si 


tcoshxN x 9X. x 2 -X*y 9X. 

I X + - 4 ,j_ J . — 4 

sinh 2 x / 4 sinhx 9x A sinhx 9A 9 


x 2 y 2 9X 


T $\ x >y A sinhx 9A sinh 2 x 59 


with 


= “ ^ + X) 2^A l e '[/!( 2 -.) + /!( 2 -)l- e "[/2( 2 .>A( 2 .jl} » 


= - z X 4 + 2M/atO +/2( 2 -)]+ 


9X 2 _ _J_ 
9A x*y 


H- 


coshx coshy - e ‘ A » e 


9X 2 2 

—3- -Xo + 


1 si 

1F7 ~ 


sinhx coshy 


( coshx sinhx \ sinhy "| 

~— 7~) ~r\ 


In the region y»x, y» 1 the following formula is useful; 


A'(x,;y;l) L * = ^- |x'y 


b T s 2 + X 
2 y 


(y > -^y)^]+ 0 (O, 


1 1 1 1 x 2 -^*^ coshx 

72 y 2 x 3 ~ Ay x sinh 2 x A 3 y x 2 sinhx ’ 


T f = 


y coshx 


5 A 3 sinhx sinh 2 x 


1/ 1 | 1 \ 
A \ y + A — x y +A +x/ 
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(A8) 


(A9) 
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This article gives a pedagogical introduction to relaxation methods for the numerical solution of elliptic 
partial differential equations, with particular emphasis on treating nonlinear problems with 8-function 
source terms and axial symmetry, which arise in the context of effective Lagrangian approximations to the 
dynamics of quantized gauge fields. The authors present a detailed theoretical analysis of three models 
which are used as numerical examples: the classical Abelian Higgs model (illustrating charge screening), 
the semiclassical leading logarithm model (illustrating flux confinement within a free boundary or “bag”), 
and the axially symmetric Bogomol’nyi-Prasad-Sommerfield monopoles (illustrating the occurrence of 
topological quantum numbers in non-Abelian gauge fields). They then proceed to a self-contained introduc¬ 
tion to the theory of relaxation methods and allied iterative numerical methods and to the practical aspects 
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I. INTRODUCTION 

Over the last few years, numerical methods have played 
an increasingly important role in theoretical physics. 
Their prominence is attributable both to improvements in 
computers and decreased computational costs, and to the 
increased attention of theorists to nonlinear, nonperturba- 

Reviews of Modem Physics, Vol. 56, No. 1, January 1984 


tive problems in quantum field theory, for which purely 
analytical methods are inadequate. In treating quantum 
field theories computationally, two strategies are possible. 
The first, which has been intensively pursued recently, is 
to set up a discrete lattice analog of the full quantum field 
theory, and then to numerically evaluate the Feynman 
path integral by Monte Carlo techniques. This method 
has the advantage of giving results which can in principle 
be made as accurate as desired. Nonetheless, the necessity 
of using a four-dimensional computational lattice and of 
generating a large ensemble of field configurations makes 
simulation very costly, and in practice this has been a 
severe limiting factor. A second strategy is to first make 
analytic approximations, which replace the field theoretic 
problem by a classical variational problem involving an 
effective Lagrangian functional, leading to a system of 
partial differential equations which are then solved nu¬ 
merically. This approach is necessarily approximate, 
since exact knowledge of the effective Lagrangian is not 
possible without an exact evaluation of the Feynman path 
integral. However, the second strategy has the advantages 
that symmetries of the physical problem can be exploited 
to reduce the dimensionality of the computational prob¬ 
lem, and that only a single equilibrium field configuration 
need be generated, permitting the study of very large com¬ 
putational lattices even on small computers. We believe 
that the two strategies are, in a sense, complementary; 
eventually, simulations may be used to infer the form of 
effective Lagrangians, which can then be used to analyze 
large classes of problems of physical interest. 

Our aim in this article is to give a pedagogical review 
of the numerical analysis methods required by the second 
strategy. Assuming that an approximate nonlinear classi¬ 
cal effective Lagrangian has been given, we show how re¬ 
laxation methods can be used to solve the partial differen¬ 
tial equations which govern the equilibrium field configu¬ 
rations. We focus on problems which arise in gauge field 
theories of current interest and in Sec. II introduce and 

Copyright ©1984 The American Physical Society 1 


Reprinted with permission. 



R47 


585 


2 


Adler and Piran: Relaxation methods for gauge field equilibrium equations 


analytically characterize three nonlinear models which 
will be studied as illustrative examples. In Sec. Ill we give 
a self-contained introduction to the theory of relaxation 
methods and to practical aspects of their implementation, 
with special emphasis on treating nonlinear problems with 
singular (5-function) source terms. Readers primarily in¬ 
terested in numerical analysis can proceed directly to Sec. 
Ill, after reading only the brief survey and theoretical dis¬ 
cussion of Secs. II.A and II.B. In Sec. IV we give specific 
details of the application of the methods of Sec. Ill to the 
models of Secs. II.C, II.D, and II.E, together with a small 
sampling of numerical results. Certain technical details 
of the analytical structure of the three models are 
described in the Appendixes. 


II. THEORETICAL ANALYSIS OF MODELS 
TO BE STUDIED 


A. Introduction 

In this section, we give a self-contained theoretical 
analysis of the models which later on will be studied nu¬ 
merically. All of the models discussed below describe 
time-independent three-dimensional problems with a rota¬ 
tional symmetry axis, and hence lead to two-dimensional 
computational problems in cylindrical coordinates. Our 
primary focus will be on the statics of classical charges in 
nonlinear Abelian and non-Abelian gauge field theories, 
as formulated by using classical action functional 
methods. In Sec. II.B we give the basic formalism for 
classical Lagrangian statics and illustrate it by briefly 
considering the case of classical electrostatics. In Sec. 
II.C we discuss the statics of classical sources in the 
Abelian Higgs model, in which external source charges 
are screened. In Sec. II.D we discuss the statics of classi¬ 
cal sources using the leading logarithm semiclassical ef¬ 
fective action functional for an SU(«) non-Abelian gauge 
theory, and show analytically that this model describes 
flux and charge confinement. As a secondary topic we 
consider non-Abelian gauge field configurations with 
topological quantum numbers, as exemplified by the 
axially-symmetric SU(2) topological monopole solutions, 
the theory of which is discussed in Sec. II.E. In the analy¬ 
ses of Secs. II.C—II.E, we place particular emphasis on 
identifying special features of the models under study 
which must be taken into account when solving them nu¬ 
merically. 


B. Classical Lagrangian statics 


Consider a classical dynamical system described by the 
action 


S= f dtL, 

L='Z'Piq i -H(p,q), q = ^~ , 
; at 


(2.1a) 

(2.1b) 


where q t and p t are the canonical coordinates and momen¬ 


ta, where H is the Hamiltonian, and where the prime in¬ 
dicates that those coordinates which have identically van¬ 
ishing canonical momenta are omitted from the sum. We 
will be specifically interested in systems for which the 
equations of motion implied by extremizing S have non¬ 
trivial time-independent solutions, and want to find a 
variational principle for calculating the energy 


^static — H ip>q) 


( 2 . 2 ) 


associated with such static solutions. For time- 
independent solutions, extremizing S is equivalent to ex¬ 
tremizing L iq’,q = 0), and so evaluating Eq. (2.1b) at <j = 0 
gives 

L exJ =ext g L{q;q = 0) 


= -H=-V, 


static 


(2.3) 


Equation (2.3) gives a variational formulation of the prob¬ 
lem of calculating V static and is the fundamental equation 
of classical Lagrangian statics. 

As an illustration of Eq. (2.3), let us consider the fa¬ 
miliar example of classical electrostatics. The Lagrangian 
for the Maxwell field coupled to an external static source 
density j° is 1 

L= f , (2.4) 

where the fields E and B are related to the scalar potential 
A 0 and the vector potential A by 

E= — V^°—A, B=Vx A . (2.5) 


Specializing to static solutions with A=0 , we have 
L[A°, A;A=0]= /d 3 xj|[(V^°) 2 -(VxA) 2 ]-/ 0 ^ 0 ) , 

( 2 . 6 ) 


which is stationary when the potentials satisfy 

V-E=-V 2 ^°=/°, (2.7a) 

VxB=Vx(VxA)=0. (2.7b) 


If the potentials are required to vanish at infinity, the 
general solution to Eq. (2.7) is 

j°W) 

4ot|x-x'| ’ (2.8) 

A(x) = V^(x) , 


^°(x)= f d 3 x 


with 'I' an arbitrary gauge function. Substituting Eqs. 
(2.7) and (2.8) back into Eq. (2.6), we get, after an integra¬ 
tion by parts , 


L ta = f d 3 x[ — ■jA 0 V 2 A°—j 0 A°)= — -j f d 3 x j°A° 
= -T / d\d\' £ (X £^'i =- V suVlC , (2.9) 


boldface will be used throughout to denote spatial vector in¬ 
dices. 
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in agreement with the general formula of Eq. (2.3). As a 
final remark, we note that this example shows that Eq. 
(2.3) is not a minimum principle; although Eq. (2.6) is sta¬ 
tionary at B=VxA=0 , this value of B maximizes, rath¬ 
er than minimizes, 2 the Lagrangian L. 


C. The Abelian Higgs model 

The first, and simplest nonlinear model which we shall 
discuss is the Abelian Higgs model, coupled to an external 
source charge density. The fields of this model are an 
Abelian gauge potential A* together with a complex sca¬ 
lar field cp of charge e . The Lagrangian is 3 


( 2 . 10 ) 


L= f dhjf , 

Jf = y(E 2 —B 2 )—/°y4° + i4~-eA° <p 

ot 

V 

- I (/V+eA)<p I 2 —\C{ I <p I 2 —k 2 ) 2 , 


with E and B constructed from the potentials A and A 0 
as in Eq. (2.5), and with \<p\ 2 =<p<p*, where cp* is the 
complex conjugate of cp. When specialized to time- 
independent fields by setting A and cp to zero, Eq. (2.10) 
simplifies to 

y=y[(V^°) 2 -(VxA) 2 ]-i 0 ^ 0 


+e z {A°) z | cp | 2 — | (t V +eA)<p 
-|C( | cp | 2 -k 2 ) 2 , 


( 2 . 11 ) 


which is invariant under the time-independent gauge 
transformation 


A+e-'Vt/-. 


( 2 . 12 ) 


By a suitable choice of gauge 3 we can make the scalar 
field cp real, so that Eq. (2.11) becomes 


2 For neutral charge distributions (with J* d 3 j° = 0) the varia¬ 
tional principle 8L[ J 4°,A;A=0]=0 is minimax: the fields of 
classical electrostatics minimize L with respect to variations in 
A 0 , while maximizing L with respect to variations in A. A 
functional which (for neutral charge distributions) is minimized 
by the fields of classical electrostatics is 

L[A °,A]= /d 3 x( |[(V.4°) 2 + (VxA) 2 ]-y°4°j . 

Functionals of this form can be useful for mathematical pur¬ 
poses [see, e.g., Adler (1981a, 1981b) and Footnote 13 below], 
but unlike L have no direct physical interpretation. 

3 For a pedagogical discussion of the Abelian Higgs model, see 

Bernstein (1974). The analysis described in Sec. II.C was carried 

out by Adler and Pearson (1978, and unpublished); see Appen¬ 

dix B of Adler (1978a). 


Jf = |[(V^°) 2 -(VxA) 2 ] -j°A° 

+e 2 (A°) 2 q> 2 —('7q>) 2 —e 2 A?q> 2 — jC(<p 2 —K 2 ) 2 . 

(2.13) 

As our final simplification, we note that since Eq. (2.13) 
has no source term coupled to A, it is stationary with 
respect to variations of A around A=0 . Hence to calcu¬ 
late K static it suffices to consider the A=0 specialization 
of Eq. (2.13), giving 

L[A°, <p]= f d 3 x{-j[VA 0 ) 2 * ~j 0 A 0 +e 2 (A°) 2 <p 2 

-<V<?>) 2 -tCV-k 2 ) 2 ) , 

n , <2.14) 

^static— 0 [A ] . 

Varying the action of Eq. (2.14), we get the Euler- 
Lagrange equations 


V 2 A°=2e z A {1 <p z —j {1 , 

V 2 <p= —e 2 {A°) 2 <p + C<p{<p 2 —K 2 ) . 


2 a 0„2 ;0 


(2.15) 


These equations will be solved numerically in Sec. IV.B 
for a source j° describing point charges located symmetri¬ 
cally on the i axis, 


y°=Q8(x)8(y)[8(z-a)-8(z+a)] , 


(2.16) 


for which A 0 is an even function and cp is an odd function 
of z. A straightforward analysis 3 shows that the leading 
behavior of A 0 and <p, at infinity and in the neighborhood 
of the source charges, is given by the following formulas. 

At oo : 


CO ) 

<p~/c + -*-—exp[ — r(2C7c 2 ) 1/2 ] , 

A°~A 0( ' co) — — — exp[ — r{2e 2 K 2 ) l/2 ] . 

r \ r 2 j 

Atrhj-O: 


(2.17a) 


cp~<p^r‘ ij, k—— 2 + 4 


A°~i±) —+^ 0(0) 


eg 

477 


12 1 1/2 


(2.17b) 


with <p { “A 0( “ <p i0 \ A 0(0) constants and with 

r = (* 2 +y 2 +z 2 )'' 2 , ai8) 

''[l j = [^ 2 +T 2 -bUTa) 2 ] 1/2 . 

At large distances, the Higgs field cp approaches the con¬ 
stant k (there is a second solution to the equations with 
cp —> — cp) and the scalar potential A 0 shows the charac¬ 
teristic exponential decay expected in a Higgs phase, 
which arises from the screening of the source j° by the 
charged Higgs field. Close to the source charges, the 
Higgs field becomes infinite as r\i i with - y < k < 0 for 
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weak source charges Q satisfying the inequality 4 
r 12 


eQ 
4 TT 


I 

< 4 > 


(2.19) 


and the scalar potential has Coulomb-type singularities. 
The removal of the infinite Coulomb self-energies from 
the formula for K slalic will be discussed in detail later on, 
in Sec. III.E of the text and in Appendix C. 


D. Non-Abellan statics in the leading logarithm model 


The second nonlinear model which we shall discuss is 
constructed from an SU{«) non-Abelian gauge theory cou¬ 
pled to an external source charge density. The fields of 
this model are an SU(n) non-Abelian gauge potential A bfl t 
with b = 1 ,..., n 2 — 1 the internal symmetry index. 
Making the conventional rescaling of the gauge potentials 
by the coupling constant g, the action and Lagrangian for 
this theory are 


S= f Ldt, L= f d 3 xJf , 

= - L-(E a -E a -B a -B a )-; a0 ^ a0 , 

2 g 2 

with the field-potential relations given now by 
E ttJ =-^>jA a0 -~A a i , 


B a ‘=z iki 


a* 


-A a ‘+if abc A bk A c ‘ 


( 2 . 20 ) 


( 2 . 21 ) 


In Eq. (2.21), z* kt denotes the usual three-index antisym¬ 
metric tensor defined by 

z jki^ z ijk =z kij } e y*/ = _ e *;/ e i23^ j f (2.22) 


f abc are the SU(n) group structure constants [for SU(2), 

f abc = z a^ and 

@] is the covariant derivative defined 
(for arbitrary w a ) by 

3jW a =-^w a +f abc A bJ w c . (2.23) 

dx J 


Static (and nonstatic) extrema of the action of Eq. 
(2.20) have been extensively discussed in the literature, 5 
and can be found numerically by the same algorithm 
which we use later on to solve the topological monopole 
model. Hence instead of basing a numerical example on 
Eq. (2.20), we consider instead the much more interesting 


4 The fact that X becomes complex for large Q is an indication 
that, for large Q , pair production is important and that a field- 
theoretic discussion is needed. In a full field-theory treatment 
of the Abelian Higgs model, Eq. (2.14) is replaced by 

^static = ,<p] j 

A 9 if 

with L e ff an effective action functional which includes the ef¬ 
fects of virtual quanta. When radiative corrections are ignored, 
L e ff reduces to the Lagrangian of Eq. (2.14). 

5 For an exhaustive survey, see Actor (1979). 


model in which L is replaced by an effective action L eff , 
which (for slowly varying fields) incorporates the effect of 
radiative corrections to leading logarithm order, while 
keeping j a0 a classical 6 source. Both explicit one-loop 
calculations for the special case of constant field- 
strengths, 7 and more general renormalization-group argu¬ 
ments, 8 show that the action L ef f is obtained by replacing 
the coupling constant g 2 in Eq. (2.20) by a field-strength- 
dependent “running” coupling constant g 2 (^"), 


6 Two types of approximation schemes have been discussed in 
the literature for reducing SU(n) quantum chromodynamics 
with quantized source charges to classical source charge models. 
For methods involving a direct replacement of the SU(3) color 
charges by quasi-Abelian effective charges which respect the tri- 
ality selection rules for color singlet states, see Mandula (1976) 
and Adler (1982). For methods involving a study of the alge¬ 
braic properties of the SU(n) color charges, see Khriplovich 
(1978); Adler (1978b); Giles and McLerran (1978); Cvitanovic, 
Gonsalves, and Neville (1978); Rittenberg and Wyler (1978); Lee 
(1979); Adler (1979); Lee (1980); Adler (1980); Bender, Gromes, 
and Rothe (1980); Adler (1981a); Milton, Palmer, and Pinsky 
(1982); and Milton, Wilcox, Palmer, and Pinsky (1982). 

7 The one-loop Yang-Mills effective action functional for con¬ 
stant field strengths has been calculated by a number of authors. 
See, for an early calculation, Batalin, Matinyan, and Savvidy 
(1977), and for recent discussions and references, Schanbacher 
(1982) and Anishetty (1982). Methods for constructing gauge- 
invariant effective action functionals beyond one-loop order 
have been given by 't Hooft (1975a), DeWitt (1981), Boulware 
(1981), and Abbott (1981). 

8 Matinyan and Savvidy (1978) and Pagels and Tomboulis 
(1978) have shown how the structure of the renormalization- 
group improved effective action can be inferred from the con¬ 
formal trace anomaly. Renormalization-group arguments give 
an expression for of the form 


J?(^) = ±b 0 3 r \og{3 r /eK 2 ) 


I loglogt^/ey 2 ) 

bl log (^/eK 2 ) 


+ 0(<n , 


with bo and b\ the usual /^-function coefficients defined in one- 
and two-loop orders. Adler (1981b) has argued that this expres¬ 
sion may give the leading two terms in the effective action for 
weak fields (&/e« 2 | «1 as well as for strong fields 
| & /eK 2 1 »1, because the magnitude of the running coupling 
constant of Eq. (2.24) is small in both regions. This argument 
suggests that, very generally, the effective dielectric constant 
changes sign between the strong and weak field regions, which 
is the essential feature responsible for confinement in the lead¬ 
ing logarithm model. 

The corrections of order F are not determined by 
renormalization-group arguments and in general are highly non¬ 
local (i.e., they depend on derivatives of the field strengths). 
Adler (1983) gives arguments indicating that the nonlocal terms 
in L e ff become important in the ultraviolet (short-distance) limit, 
but are unimportant relative to the local terms in the infrared 
(long-distance, or confining) limit. The order-J^ terms also can 
have imaginary parts; for example, if k 2 in Eq. (2.28) is replaced 
by — k 2 , an additional imaginary term appears in at the 

order-J* - level. Hence even the sign of & at the extremum of 
cannot be determined by a renormalization-group argument. 
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5 


g 2 (f) = 


11 


l + T&og^og^/^ 4 ) 
with & the field-strength invariant 
P=E a -E a -B a -B a t 


(2.24) 


(2.25) 


which appears in the classical action. The constant b 0 in 
Eq. (2.24) is the asymptotic freedom constant 9 

6 0 = - 1 TT-C , 2[ s U(n)] = -3-r» . (2-26) 

while the mass \i is the renormalization point and g 2 is 
the value of the running coupling constant at 
Combining Eqs, (2.20)—(2.25) and defining the one-loop 
renormalization-group-invariant parameter 

K 2 = i^ e -4/(^ 2 ) j (2.27) 

e 

we get the effective action for the leading logarithm 
model, 8 

Ldf= / d 3 xJf e f f = / d 3 x[Jf(3 r )-j a0 A a0 ] , (2.28a) 

.¥{.?)= \b 0 .r\og{.r/ex 2 ) . (2.28b) 

When specialized to time-independent fields, 10 Eq. 

(2.21) for E al becomes 

E ai = - & jA a0 , (2.29) 

and Eq. (2.3) for K stat j c yields the variational problem 

V^ c = -ext A "> Aaj {L' S t[A° Q ,A°i]] ■ (2.30) 

The Euler-Lagrange equations for Eq. (2.30) are 

^/e^)=y fl0 , (2.31a) 

e kJm @j(zB am )=-f abc A b0 EE ck , (2.31b) 

where we have introduced a field-strength-dependent ef¬ 
fective dielectric constant e defined by 

e =_ a -sf(jn _ = L bXLr/K i ) ( 2 . 32 ) 

a( 


Applying a covariant derivative to Eq. ( 2.31b) gives 
the equation 


~E kim [ ] ( eB' am ) = -f abc ( A 60 )eE' ck 

-f abc A b0 @ k {EE ck ) . (2.33) 


9 In SU(3) quantum chromodynamics (QCD) with massless 
fermion flavors, Eq. (2.26) for b 0 becomes 

10 Since the physically relevant extrema of the effective action 
are the mean potentials induced when a source j a0 is added to 
the standard functional integration quantization formalism [see, 
for example, Abers and Lee (1973)], they must be time indepen¬ 
dent when the source is time independent. 


Using the easily verified identity (which holds for arbi¬ 
trary w a ), 

@j]w a =E kji f abc B bi w c , (2.34) 

we find that the left-hand side of Eq. (2.33) is 

\E kjm E kj \f abc B bl EB cm =0 , (2.35a) 

while on substituting Eq. (2.29) the first term on the 
right-hand side of Eq. (2.33) becomes 

f abc EE bk E ck = 0 . (2.35b) 

Hence the second term on the right-hand side of Eq. 

(2.33) must also vanish. After substitution of Eq. (2.31a), 
this gives the constraint 

f*t>c A bOjcO = 0f (2.36) 

which states that to get a static solution, the scalar poten¬ 
tial and the source charge density must locally lie in com¬ 
muting directions in internal symmetry space. 

In particular, for a source density j c0 describing a par¬ 
ticle with effective classical charge Q c at and an an¬ 
tiparticle with effective classical charge Q c at x 2 , 


; co =e c 8 3 (x-x 1 )-i-e c 8 3 (x-x 2 ), 

(2.37) 

the constraint of Eq. (2.36) becomes 


f abc A bo (x,)Q c =0 , 

(2.38) 

f abc A b0 (x 2 )Q c =0 . 



By making a suitable time-independent gauge transforma¬ 
tion, we can align Q c and Q c to lie in antiparallel direc¬ 
tions in internal symmetry space, 

e c =re, c c =-re, (2.39) 

with q a fixed internal symmetry unit vector. The con¬ 
straints of Eq. (2.38) can then be satisfied by making the 
quasi-Abelian ansatz 


A a0 = q a A° , 

A aJ =fA J , (A\A 2 ,A 3 )= A, 


(2.40) 


which describes a conserved electric flux of magnitude Q 
running between the two point sources, as is appropriate 
to a model for the quark-antiquark confinement prob¬ 
lem. 11 For the potentials of Eq. (2.40), the field-potential 
relations of Eq. (2.21) become 


I'The quasi-Abelian ansatz of Eq. (2.40) excludes “charge¬ 
screening” solutions of the type discussed by Sikivie and Weiss 
(1978, 1979), Kiskis (1980a), Jackiw and Rossi (1980), and Hilf 
and Polley (1981). Such solutions may be relevant as models for 
the behavior of an SU(«) gauge field with adjoint representation 
sources. Fundamental representation sources, such as quarks 
and antiquarks, cannot be screened by the gauge gluon field. 
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E aJ =q*E J , (E\E 2 ,E 3 ) = E=-VA° , 
(B\B 2 ,B 3 ) =B-VXA. 


The internal symmetry structure of the problem can now 
be completely factored away. Equations (2.30) and (2.28) 
simplify to 

^static ^^0 > 

L e[[ = f d 3 x[j?(^)-j°A°] , (2 42) 

J r =(VA°) 2 —(VxA) 2 , 
j°=Q8(x)8(y)[8(z—a) — 8(z +a)] , 


where we have again located the source charges symmetri¬ 
cally on the z axis, and the Euler-Lagrange equations of 
Eq. (2.31) become 

V-(eE)=;°, (2.43a) 

Vx(eB)=0, (2.43b) 


with the dependence on of and e given by Eq. 
(2.28b) and Eq. (2.32), respectively. We have thus reduced 
our model to a problem in nonlinear Abelian electrostat¬ 
ics. 12,13 

As in the discussion of classical electrostatics in Sec. 
II.B, the extremum over the vector potential in Eq. (2.42) 
can be carried out by inspection. From Eq. (2.43b) we get 

0= f d 3 x A-Vx(eB) 

= f d 3 xiB 2 — f ^ dS-e(AxB), (2.44) 

* J surface at oo 

and so if we restrict ourselves to solutions with a vanish¬ 
ing surface integral at infinity, we must have 

eB 2 — 0 , (2.45a) 


giving three branches (la, lb, and II, respectively), 

B—0 , E 2 > k -2 , 

B—0 , E 2 < k - 2 , (2.45b) 

e—0 => B 2 —E 2 —k -2 . 


In the strong-field region near the source charges, the 
asymptotic freedom of non-Abelian gauge theories re- 


12 Dielectric models for confinement in QCD have been dis¬ 
cussed in a qualitative way by a number of authors; see, for ex¬ 
ample, Kogut and Susskind (1974), ’t Hooft (1975b), Pagels and 
Tomboulis (1978), Friedberg and Lee (1978), Callan, Dashen, 
and Gross (1979), and Nambu ( 1981). 

13 Pagels and Tomboulis (1978) and Mills (1979) showed that 
when a single isolated charge is present, the leading logarithm 
model gives a linearly divergent infrared energy. A proof that 
the model of Eq. (2.42) gives a linear static potential for large 
source separations was first given by Adler (1981a), using the 
related minimum principle in which & is replaced by 
(Vi4°) 2 + (Vx A) 2 . (See the comments in Footnote 2 above.) 


quires that the solution of Eqs. (2.43) approach a 
Coulomb-type solution with E large and B vanishing; to¬ 
gether with continuity, this implies that a finite domain 
containing the source charges lies on branch la. Special¬ 
izing the analysis, for the time being, to this branch, we 
set B—A—0 and rewrite Eqs. (2.41) and (2.43a) as 

V-D=;°, VxE=0 , 


D=e(£)E , 

e(E)=±b 0 log(E 2 /K 2 ) ,E= |E 


(2.46) 


A graph of the nonlinear dielectric constant e(£), show¬ 
ing its intersection (for B —0 ) with the three branches of 
Eq. (2.45b), is shown in Fig. 1. 

Equations (2.46) are the basic statement of the problem 
which will be studied numerically in Sec. IV.C. In order 
to get a tractable numerical method, it is necessary (for 
reasons discussed in Appendix A) to rewrite the equations 
in manifestly flux-conserving form. To exploit the axial 
symmetry of the problem, let us work henceforth in 
cylindrical coordinates p,z,(j> defined by 


p=(x 2 + y 2 y' 2 , <t> = tan-'(y/x) , 


2\\/2 


(2.47) 


in which the coordinates of the point sources of Eq. (2.42) 
are p—0, z — ±a. We then note that D can be 
parametrized in terms of a single scalar function <l>(p,z) 
by writing 14 


D=- 


AN 

—V0xV<D=--^ 


277 


27Tp 


XV4> = VX 


A 

A 


2 7Tp 


<D 


(2.48) 


The representation of Eq. (2.48) automatically satisfies 
V*D—0 at points off the axis, and at points on the axis 
where <b is sufficiently smooth. The physical interpreta¬ 
tion of <b follows from calculating the total flux through 
a surface of revolution S (with element of area dA) 
bounded by a circle C of radius p (with element of arc 
length d\=dl(f>), as shown in Fig. 2. We get 

A 

flux through S — f dA-D— f dA*Vx $ 

J s J s 2 7Tp 



(2.49) 


showing that is simply the flux through S. If we draw 
the surface S so that it always intersects the z axis on the 
segment z > a , as shown in Fig. 2, the flux function <I> as- 


14 The flux function formulation was introduced in Adler 
(1981b). The analysis of the characteristic form of the flux 
function equation, and its numerical solution, were given by 
Adler and Piran (1982a). 
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0 2 4 6 8 10 


E/K 



E / K 


FIG. 1. (a) Plot of e(E) of Eq. (2.46), showing its intersection 
(for B=0) with the three branches of Eq. (2.45b). (b) Corre¬ 
sponding plot of^(^=£ 2 )ofEq. (2.28b). 



FIG. 3. Surface S (bounded by an infinitesimal circle O used to 
evaluate 4> on the axis at I z | <a. 


ensures that no additional flux sources or sinks lie at spa¬ 
tial infinity. 

The dynamical equation for is obtained from 


V X E=V X 


D 

8 


=0 . 


(2.51) 


Defining D— | D |, we can algebraically invert the consti¬ 
tutive equation /> — e{E)E to get 

e(E(D)) = e[D] , (2.52) 


so that Eq. (2.51) becomes a differential equation for D, 



(2.53) 


This equation can be rewritten by using the vector identi¬ 
ty 


sumes the following boundary values on the axis of rota¬ 
tion and at infinity: 


<D=0, 0, 

p= 0 , 


z >a , 


z 


<a , 


<D 


0 as p 2 +z 2 


00 


(2.50) 


To verify these, we note that <b is an even function of z, 
and that on the segment p=0, z > a, the surface S degen¬ 
erates to a point and intercepts no flux. Similarly, on the 
segment p—0 , \z\ <a , the surface S intercepts all of the 
flux in a positive sense, as illustrated in Fig. 3. The re¬ 
quirement that <b should vanish on the sphere at infinity 



v-(v 1 xv 2 )=v 2 -(vxv 1 )-v 1 *(vxv 2 ), 


with 


A 



vxv,=o=vxv 2 , 

A 

which when simplified by using 0-V4>=O gives 
V-[cx(p, | VO | )V<D]=0, 


<r(p, | V<I> | ) = 


1 

p 2 z[D] 



V<b 

l7Tp 


(2.54) 


(2.55) 


(2.56) 


Equation (2.56) is the formulation of the leading loga¬ 
rithm model which will be studied numerically in Sec. 
IV.C. As a check, we note that in the case of classical 
electrostatics, where e[D]= 1, Eq. (2.56) and the boundary 
conditions of Eq. (2.50) are satisfied by 


& = coS'fr 2 —cos'fr j), 


(2.57) 


fti=tan 1 


z—a 


^ 2 —tan -1 


z +a 


y 0 < #l, 2 <77 , 


FIG. 2. Surface of revolution S with circular boundary C used which, when substituted into Eq. (2.48), gives the expect¬ 
in Eq. (2.49) to evaluate the flux function <b. e d reS ult 
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D= 


A 



Qr 2 


A A 

4t rr\ Arrr \ 


i) ,/r 


= (x—x 
— [x 2 +y 2 + (zTa) 2 ] 1/2 


(2.58) 


In the case where e{E) is given by Eq. (2.46), the coeffi 
cient function cr is given on branch la by 


cr(p, | V<I> | ) = 



| V<E 

rrboKp 


(2.59) 


with J[w) implicitly defined by the transcendental equa¬ 
tion 


w=/log/,/>l. (2.60) 

For small w and large w, the behavior of J[w) is given by 


f=l+w-}w 2 +0(w 3 ) , 


w 


«1 , 




w 


logtu 


1+ _10glogl£ +0 

logu> 


loglogu; 
log w 


(2.61a) 


, w» 1 , 


(2.61b) 


as shown in Fig. 4(a), giving cr the behavior graphed in 
Fig. 4(b). The fact that cr becomes infinite as / —E/k ap¬ 
proaches 1 from above (i.e., as w <xZ) approaches 0) means 
that a solution which is initially on branch la can ap¬ 
proach branch II only as a degenerate limit and cannot 
cross back and forth between branch la and branch lb. In 
the vicinity of the source charge at p— 0, z — a, Eqs. 
(2.50), (2.56), and (2.59)—(2.61) can be integrated to give 
the leading behavior 15 


Qr i 

D=—y+0(1), 
Arrr\ 

E=F,*/ 


IjrKbor' 


+0(1) , 


(2.62) 


r const 

A°=k f dr ,'/ 


iTTKboir^) 


• \2 


+0(ri) , 


^e-U-costfO + O^sin 2 ^) , 


where the structure of the subdominant terms 0( ) has 
been indicated up to powers of logThe corresponding 
behavior at p—0, z — —a is obtained by reflecting the for¬ 
mulas of Eq. (2.62) in the z —0 plane. 

Once <I> has been determined by solving the boundary 
value problem formulated above, the static potential can 


15 Since Qy(l — cos-0,) exactly satisfies the boundary condi¬ 
tions of Eq. (2.50) around z —a, the leading subdominant term 
in 4> must vanish at 0, = 0,0, = 7r. This boundary condition el¬ 
iminates a possible term in 4> behaving as 0[r\(a + hcosO,)], 
giving the structure shown in Eq. (2.62). 




FIG. 4. (a) Plot of the function J[w) defined in Eq. (2.60). (b) 
Plot of p 2 <r> defined in Eqs. (2.59)—(2.61), as a function of D/k. 


be calculated by substituting V*D— j° into Eq. (2.42) and 
integrating by parts. This gives 

V static “ / dh{ED-jnE(D) 2 ]+Jf[E(0) 2 =K 2 ]} , 


(2.63) 


where an infinite constant f d 2 xJf{K 2 ) has been added 
to Eq. (2.63) to render the integral convergent at spatial 
infinity. A little algebra shows that Eq. (2.63) is 
equivalent to the following computationally convenient 
formula: 


^static — J* d X 2 (J 


Vd> 

2tt 


2 

(1+D, 


£=(/ 2 —l)/(2/u>) , 


(2.64) 


with cr, /, and w as defined above. A second useful ex 
pression for F static is obtained by using the identity 


Jf[E(D) 2 ]-Jf[E( 0) 2 ] = 


r E{D) dE , dJf(E f2 ) 
Je( o) a dE' 



—ED — ^dD'EiD') , 

j o 


(2.65) 
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which when substituted into Eq. (2.63) gives the familiar 
formula 

V static = f d 3 xJT, (2.66a) 

with the field energy density 

f dD , E{D > ) . (2.66b) 

Since we have noted above (and will see in greater detail 
below) that the region of support of D is confined to 
branch la, where E {D) ;> k, Eq. (2.66) gives the inequality 

^static ZK f d 3 xD . (2.67) 

To turn Eq. (2.67) into a useful lower bound on the large 
distance behavior of K static , we must exclude the infinite 
Coulomb self-energies. This is most easily done by ex¬ 
cluding from the x integration small spheres of radius e 
around the source charges, motivating the definitions 

domain) | x—X] | >e , | x—x 2 | ;>e) , 

( 2 . 68 ) 

Value = f a d 3 xm.x)ZK / n d 3 xD . 

When we write d l x=dldA , with / the length along and 
dA the element of area perpendicular to the flux lines of 
A Eq. (2.67) then yields the lower bound 16 

Static (R)^k f n dAD f dl >KQl mm =kQ(R -2e) , 

, “ (2.69) 

R = | x x — x 2 1 =2 a , 

showing that K static increases at least linearly for large 
source separations. A more detailed discussion of the re¬ 
moval of the Coulomb self-energies from the formula for 
Static i s given in Sec. III.E below. 

A great deal of insight into the behavior of Eq. (2.56) is 
obtained by putting it into the standard form for a 
second-order, quasilinear differential equation, 

a*'(x,<D,V<D)a jt a,<D+c(x,<D,V1>)=0, (2.70) 

and analyzing the structure of its characteristics. Defin¬ 
ing the inward directed unit normal ft and the correspond¬ 
ing normal derivative 0„, 


A 

n = 


V<D 


V<D 


dn=n*V, 


(2.71) 


we can see through a straightforward calculation given in 
Appendix A that Eq. (2.56) is equivalent to 


[(a^a^-a^+aa^^-ap-^-o. 

The coefficient a is given by 


(2.72) 


w 


aiogq- _ djlogf) ^ wf{w) ^ 
aiog Vd> [ d{\ogw) f{w) w+f{w) 


w = 


0„4> 


ID 


(2.73) 


jrboKp b 0 K 


16 The flux estimate of Eq. (2.69) is due to *t Hooft (1975b). 



FIG. 5. Plot of the function a(u;) defined in Eq. (2.73). 


the function a{w) is graphed in Fig. 5 and [from Eq. 
(2.61)] has the following approximate forms for small and 
large w: 


a=w+0{w 2 ) , 


w 


«1 > 


a = l 



loglogu? 
(log w) 2 


u>»l 


(2.74) 


From Eqs. (2.73) and (2.74) and Fig. 5 we see that a lies 
between 0 and 1 for D >0, but vanishes when D —0. 
Hence Eq. (2.72) is of degenerating elliptic type, 17 and has 
a real characteristic at a surface of constant 4>, where 

a 

| V4> | =0. The second normal derivative 0„<I> is discon¬ 
tinuous across this characteristic, which acts as a free 
boundary, dividing space into two causally disconnected 
regions. From the boundary condition of Eq. (2.50), we 
learn that the exterior of the free boundary is completely 
surrounded by surfaces on which d>—0. Hence d>^0 out¬ 
side the characteristic, giving the solution the qualitative 
form graphed in Fig. 6. In the vicinity of a point B on 
the free boundary where p=p B and where the radius of 
curvature of the free boundary is R Bj a simple analysis 
given in Appendix A shows that d> has the leading 
behavior 



7rb 0 Kp B 



l 


2 R 


2 

+ 0(n 2 ,/ 4 ) 


(2.75) 


with n and / normal and tangential Cartesian coordinates 
at B (see Fig. 7). Since d> is increasing towards the interi¬ 
or, we must have R B > 0, and so the free boundary is 
everywhere convex. As indicated in Fig. 6, the free boun- 


17 The theory of equations of this type, and extensive refer¬ 
ences, are given in Oleinik and Radkevic (1973). This book 
treats only the linear case [see Eq. (A 18)], rather than the quasi¬ 
linear case encountered in the leading logarithm model. An im¬ 
portant difference found in the quasilinear case is that the loca¬ 
tion of the characteristic depends on the solution to the equa¬ 
tion, rather than being a priori known. This is why the real 
characteristic of Eq. (2.72) behaves as a free, as opposed to a 
fixed, boundary. 
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dary intersects the axis of rotation at a point p= 0 ,z=z / 4 ; 
in Appendix A it is shown that z A >a, with the possibility 
z A =a excluded. Apart from this statement, we have been 
unable to characterize analytically the structure of the 
free-boundary—rotation axis intersection. 18 Once d> and D 
have been determined in the interior region, A 0 can be cal¬ 
culated from the formula 


A °(p,z) — f dz 
j o 


z*D(p,z') 

e[D(p,z')] 


(2.76) 


An interesting alternative method, discussed in Appendix 
A, is to determine A 0 from the known solution for e by 
solving the linear differential equation 


V*(£Vi 4 °) = -y° 


(2.77) 


within the free boundary. 

Since <I> and D vanish identically outside the free boun¬ 
dary, continuity requires that e remain zero in the whole 
of the exterior region. Thus the exterior scalar and vector 
potentials are constrained only by the requirement that 
the electric and magnetic fields satisfy the branch II con¬ 
dition 



(2.78) 


but are otherwise undetermined. In other words, the 
functional L e ff has an infinite equivalence class of C 1 ex¬ 
trema A 0 , A, corresponding to all possible ways of satisfy¬ 
ing Eq. (2.78) outside the free boundary. All members of 
this equivalence class 19 give the same^°,<I> inside the free 
boundary, and make the same physical predictions. 

The solution to the leading logarithm model is clearly 
qualitatively similar to the confinement domain found in 
the MIT “bag” model, 20 but there are important differ¬ 
ences. At the boundary of an MIT “bag” the fields (the 
first derivatives of the potentials) are discontinuous, cor¬ 
responding to the presence of a step function in the varia¬ 
tional principle formulation. In the leading logarithm 
model, the boundary is a characteristic across which the 
fields are continuous, with only the first derivatives of the 
fields (the second derivatives of the potentials) having 
discontinuities. This behavior corresponds to the fact 
that the variational formulation of the leading logarithm 
model involves a smooth action functional L eff . 


l8 The numerical results given below suggest that the free boun¬ 
dary intersects the rotation axis at a right angle (rather than at a 
cusp), but we have no proof of this. 

19 A question which remains to be clarified in the field- 
theoretic context is whether the degenerate exterior solutions 
should be interpreted as vacuum structure. For articles advo¬ 
cating this view see, for example, Savvidy (1977), Pagels and 
Tomboulis (1978), and Nielsen (1981); for possible problems 
with this interpretation, see Kiskis (1980b) and Cabo and Sha- 
bad (1980). 

20 The MIT “bag” model was introduced by Chodos et al. 
(1974)7 for a review, see Hasenfratz and Kuti (1978), and for a 
reinterpretation within QCD, see Johnson (1978). 



FIG. 6. Qualitative appearance of the solution of Eqs. (2.50), 
(2.56), and (2.59)— (2.61). 


E. Axially symmetric Bogomornyi-Prasad- 
Sommerfield monopoles 


As our final nonlinear example, let us consider a non- 
Abelian generalization of the Abelian Higgs model of Eq. 
(2.10), in which an SU(2) non-Abelian gauge field is cou¬ 
pled to a real scalar field qf, a —1,2,3, in the adjoint rep¬ 
resentation. The Lagrangian is 


L= f d 3 xJf , 


(2.79) 


= y(E a, E a —B a, B a ) 

+ \(@W a ) 2 -±{3 J <p a ) 2 -\C[{<p a ) 2 -K 1 } 2 , 


with the field strengths and covariant derivatives given by 


E a} =-@]A a0 —^A tt} , 


(2.80a) 


B aJ =e jkl 


dx 


-A al +\z abc A bk A cl 


<$<fP° = f t <P a -z abc A b0 <p c , 


(2.80b) 


3) jW a = -^-rw a + E abc A bl w c for w a =qf or A a0 , 

' 0x' r > 

[3> k ,3> j ]w a =z k h abc B bl w c . (2.80c) 


In writing Eqs. (2.79) and (2.80), we have for simplicity 
taken the gauge field coupling g to be unity. We will 



,/>=« 


B 


Free boundary 


(radius of curvature Rq ) 


FIG. 7. Geometry near the free boundary used in Eqs. (2.75) 
and (A 10)-(A 12). 
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tt 


again be interested in static solutions for which all time 
derivatives vanish, but this time (since no external source 
charge-density coupling to A a0 has been included) we will 
take^4 fl0 —0. After making these simplifications we have 

- L=H= f d 3 xJT, (2.81) 

with the field energy density 

jr’={B a -B a + {(3’ J <p‘ , ) 2 +iC[(<p‘ , ) 2 -K 2 ] 2 . (2.82) 

We will be interested in what follows in the finite ener¬ 
gy extrema of Eq. (2.81), which clearly must satisfy 

lim [<p a ) 2 =K 2 . (2.83) 

r—* oo 

Defining a unit SU(2) internal symmetry vector n a {r) on 
the sphere at spatial infinity by 

n fl (f)— lim <p a {r?)/K , 

(2.84) 

P=x/r , 

we see that the complete specification of the boundary 
condition for qf requires specifying the number of times 
the two-sphere on which n a lies is covered, when the 
two-sphere on which flies is traversed once. This number 
(which must be an integer when ft a is a continuous func¬ 
tion of r ) is the winding number or topological quantum 
number 

n= lim — f d 2 S i e ijk e abc ff a —n b ^ T n c . 

r— oo 8 77 j sphere at co 0x' 0 X * 


(2.85) 


Hence the problem of finding finite energy extrema of H 
breaks up into discrete topological sectors. 

Extrema of H for 0 are called’t Hooft (1974)—Po¬ 
lyakov (1974) monopoles. A very interesting special case, 
introduced by Prasad and Sommerfield (1975) and 
Bogomol’nyi (1976) and extensively studied since then, 21 
is obtained by setting C—0 but retaining the boundary 
conditions of Eqs. (2.83)—(2.85), giving 

H= f d 3 x±(B a jB aJ '+&j<p a &j<p a ) , 

( 2 . 86 ) 

lim q> a =Kn a , 

r—*eo 

winding number of n a =n . 


The Euler-Lagrange equations obtained by varying the 
functional H of Eq. (2.86) are 


0 , 

E kjm &jB am = -z abc <p b @ k <p c , 


(2.87) 


while the field-potential relations of Eq.(2.80) imply that 


2l For recent reviews on monopoles, see Jaffe and Taubes 
(1980) and O’Raifeartaigh and Rouhani (1981). 


eft o 

u ’ (2.88) 
z kjm @j@ m <p a = - z abc <p b B ck . 

Although Eqs. (2.87) are complicated second-order dif¬ 
ferential equations, they are satisfied by any q> a ,A aj satis¬ 
fying the first-order differential equations 

~^j<p a =£B aJ t £=±1, (2.89) 

with the cases £=1 ( — 1) termed, respectively, self-dual 
(anti-self-dual). 22 To see that Eq. (2.89) suffices, we note 
that by Eq. (2.88) we have 

-£&jB aJ =0 , (2.90a) 

while by using Eq. (2.80c) we get 

z kJm ^jB am = -£z kJm &j& m <p a 

=£z abc <p b B ck = - fz abc <p b 0 k <p c . (2.90b) 

Remarkably, Eq. (2.89) is also a necessary condition for a 
minimum of H , as may be seen by the following 23 rear¬ 
rangement of Eq. (2.86), 

H =H x +H 2 , 

H x = f d i xi(B ai +i& j cp a ) 2 , (2.91) 

H 2 = -£ f d 3 xB al -&j(p a 
= -£/ d 3 x&j(B4qf) 

= -£ f d 1 S i B ai q> a . 

J sphere at oo 

Since H 2 reduces to a surface term, the functional H can 
be extremal only for fields for which H , vanishes, giving 
the condition of Eq. (2.89). The surface term H 2 can be 
evaluated by noting that in order for the integral of Eq. 

(2.86) to converge, &jn a must vanish at large /*, giving 
the following relation between A and n° on the sphere at 
infinity, 

A al = n a n b A bi — e abc n b — n c . (2.92) 

3x J 

Combining Eqs. (2.80a) and (2.92), one finds, after some 
algebra, 

B° j n a = ~^—(z ikl n°A a ')- ie Jk, e abc n‘ , -^- j -n b -^ 7 fi c . 
3x* 2 0x' 

(2.93) 


22 This terminology stems from the fact that j(p a can be 
formally reinterpreted as a static Euclidean electric field 
strength Efy) = jA (£>, A ~(p a > in terms of which 
= — y(B a *B a +E(£ ) *E(£)). The self-dual (£—1) solutions satisfy 
while the anti-self-dual (£ = — 1) solutions satisfy 
Efy) = —B a j. Clearly, a £ = 1 solution can be converted to a 
£= — 1 solution simply by changing the sign of <p a . 

23 This argument is due to Bogomol’nyi (1976) and to Coleman, 
Parke, Neveu, and Sommerfield (1977). 
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The surface integral of the first term on the right-hand 
side of Eq. (2.93) vanishes; and so, referring back to Eq. 
(2.85), we get 

H 2 =ArrK^n . (2.94) 

Since the positivity of H implies that H 2 is positive when 
H x vanishes, we conclude that 


terms of the potentials h Xf 2 i ■ * «, and discuss the residual 
Abelian gauge invariance of the six-function ansatz and 
the choice of a gauge-fixing term for numerical work. 
From the equations given in Appendix B, a straightfor¬ 
ward calculation shows that the leading behavior of the 
potentials at infinity 25 is given by the following formulas 
(in which terms of order e ~ Kr are omitted): 


H 2 =Attk | n 


£,—n/\n 


(2.95) 


Hence the minimum of H in each topological sector is 
determined solely by the topological quantum number. 

The minimum of H in the n — 1 topological sector is 
given by the simple expression 



—(1— Krcothtfr) 
r L 


> 


A aJ = 


e*'V 


1- 


Kr 


sinh Kr 


(2.96) 


which satisfies Eqs. (2.89) with £= 1. This solution is the 
simplest of a family of solutions of Eq. (2.89), in which 
the Higgs field q>° and the potentials A 03 are axially sym¬ 
metric and reflection symmetric, as described by the fol¬ 
lowing ansatz: 




A af =fi 


f\~ n ^ , f 2 ^ 

2 + Pn 
P P 


-{z 3 a x +p 3 a 2 )<t> a n , 


AJ x'l'tf 


p~(x 2 ~}-y 2 ) l/2 J <£ —tan } {y/x) , 

^=( 0 , 0 , 1 ) , 


(2.97) 


p n — (cosrt<£,sinn<£,0), p=p x , 

A § A A 

4>„ =( — sinn^,cosn^,0), <f>=<t> x . 

The potentials h Xt2t f Xt2 , and a x 2 are functions of p and z, 
with z«-» — z reflection symmetry and behavior on the 
z —0 and p —0 axes as follows: 

h 2 >f \> a i even in z , 


hufi,a 2 odd inz , 

(2.98) 

h \ =f 2 =a 2 =0 at z—0 , 

=/2 = ^2=0, /i —n atp— 0 , n > 1 • 

Although analytic forms for the solutions of Eq. (2.89) 
within the ansatz of Eqs. (2.97) and (2.98) are now 
known, 24 we will treat the problem of finding the axially 
symmetric, reflection-symmetric minima of H as a nu¬ 
merical example in Sec. IV.D. In Appendix B we give ex¬ 
plicit expressions for the functional H when expressed in 


24 For the analytic construction of axially symmetric mul- 
timonopole solutions and references, see Prasad and Rossi 


(1981). 


h x =h cosd, fcsind , 
Sind cosd 

a i — — y 02 — 

r r 


f\ =/cosd, f 2 =/sind , 

, n “ aj x, P t (cosQ) 

h=K--+ 2 — 


/=2 
/ even 


,/+l 


, R ^ ^‘"’Pntcosd) 

/=«cosd+p 2 - 


1—2 
i even 


r = (p 2 +z 2 ) 1/2 , d — tan l {p/z) . 


(2.99) 


The leading behavior at the origin 25 is also calculated in 
Appendix B, where it is shown that even in the higher 
topological sectors n > 1, the Higgs field <p a has only a 
first-order zero at r=r0. 


III. RELAXATION METHODS FOR THE NUMERICAL 
SOLUTION OF ELLIPTIC PARTIAL 
DIFFERENTIAL EQUATIONS 

A. Introduction 

In this section we give a detailed introduction to both 
the theoretical and practical aspects of the numerical 
solution of elliptic partial differential equations of the 
type encountered in Sec. II. We assume that the reader 
has read Sec. II.A and especially Sec. II.B, and has at 
least glanced over Secs. II.C—II.E. However, the discus¬ 
sion of numerical methods which follows is essentially 
self-contained, and makes only minimal references back 
to the formulas in Sec. II. Motivated by the fact that the 
numerical solution of the models of Secs. II.C—II.E can 
be reduced to a sequence of solutions to linear differential 


25 Equations (2.99) and (Bll) are the expansions calculated in 
the gauge d 2 a l + d f P 2 = Q- For «= 1, this gauge condition does 
not uniquely fix the Order r terms in a i 2 near r=0; the general 
solution for m= 1 is a,= — bp(y + a) + 0(/* 2 ),a 2 = bz(y — a) 
+ 0(r 2 ), with a a free parameter. The n— 1 case of Eq. (Bll) 
corresponds to a=-y, while the standard form of the single 

monopole solution given in Eq. (2.96) corresponds to a = 0. [To 
put Eq. (2.96) in the form of Eq. (2.97), one uses 

f^z^cos-O+pW-O, z a]l r l ~{p a ^ — p(p )cost> + ( qfz 3 —qjz*) 

X sin-0.] The expansions of Eqs. (2.99) and (Bll) were derived 
by Adler (unpublished), Rebbi and Rossi (1980), and Houston 
and O’Raifeartaigh (1981). 
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FIG. 10. Computational mesh extended by one unit cell to form 
a border. At the half nodes within the border, the h factors of 
Table I are assigned the value zero. The corresponding summa¬ 
tion limits for the four coverings of Fig. 9 are given in Table II. 

signed to <p at nodes on the outer edge of the border, since 
these nodes automatically appear multiplied by an h fac¬ 
tor of zero. When this procedure is used, the summation 
limits for the four coverings of Figs. 9(a)—9(d) are as 
given in Table II. 

C. Iterative methods of solution 

Let us now suppose that the discretization procedure of 
the preceding section has been carried out on a computa¬ 
tional lattice with n t +1 ,n ; - +1 nodes in the p,z directions, 
respectively. Equation (3.25) then gives us a set of 
N = (rt,- + l)(rt ; - + l) linear equations in the N unknown 
variables <p t j. Since N is typically a very large number 
(up to «4X 10 4 in the computations described in Sec. IV), 
the direct solution of this set of equations by matrix in¬ 
version is not feasible, and we must resort instead to an 
iterative method of solution. 

Before describing the specific algorithms used to solve 
Eq. (3.25), let us first discuss a simple and familiar itera¬ 
tive method which will also be needed in the applications 
of Sec. IV. This is the Newton iteration for finding the 
roots of the equation 

/(w)= 0, (3.33) 


which is constructed as follows. Given an estimate w {n) 
of a root w* of Eq. (3.33), we Taylor expand fiw) around 
w {n \ giving 

f{w)= f{w in) ) + {w — U> ( ' ,, )/'(u> ( ' ,) ) 

+ |(u)-u) (n, ) 2 / ,, (u) (n, )+• • • . (3.34) 

When substituted into Eq. (3.33), Eq. (3.34) gives an exact 
power-series equation for w*. When Eq. (3.34) is approxi¬ 
mated by the first two terms in the series expansion, it 
gives a new approximation to w*, 

w*~w ln+1) =w M -£- wi " ) ] r . (3.35) 

The error of the new approximation can be estimated by 
subtracting the two equations 

o =/(u> (n) )+(u> (n+ ' ) —u> (n) )/'(u> (n) ) , (3 . 36a) 

0=f(w ln) )+(w*-w M )f'(w in) )+i(w*-w ln) ) 2 f"(u> in) ) 

+ 0[(u)*— u) ( b) ) 3 ] , 
giving 

u) (n + ,) -u)*«T(u) (n) -u)n 2 ^ i ^ + 0[(u) (n) -u)*) 3 ] 

f'(w w ) 

fiw ) 

(3.36b) 

Hence the error after n + 1 iterations is of order the 
square of the error after n iterations, and convergence 
proceeds very rapidly to w* t provided that the initial 
guess u) (0) lies close enough to w*. Rewriting Eq. (3.36b) 
as 



L„± 

,..(0) 

f"(w*) 

U) i0) -w* 

~ 2 

W —W 

f'(w*) 


+ O[(w i0) -w*) 2 ] , (3.37a) 

we see that a sufficient condition on u> (0) to guarantee 
that the Newton iteration converges to the root w* is 


TABLE II. Summation limits for the coverings of Fig. 9, using the “bordered” mesh of Fig. 10. 


Unit cell 

Lower ij limits 

Upper i limit 

Upper j limit 

“p-derivative covering” Fig. 9(a) 

0 


n J 

“z-derivative covering” Fig. 9(b) 

0 

«/ 

ttj-\ 

“nonderivative covering” Fig. 9(c) 

0 

»i 

n J 

“p,z derivative covering” Fig. 9(d) 

0 


tij-\ 
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-J I It)* 0 ' —10* I 

f"(w*) 

«1 . 

(3.37b) 


2 —J r 

4 \ l 

f(w*) 



a rr 

s^R 




If Eq. (3.33) has several roots w*,w *,. .., there will be 
an interval around each root w* within which the Newton 
iteration converges to that root. 

Let us now proceed to the simplest case in which we 
encounter the iterative method which will be used to solve 
Eq. (3.25). We consider the discrete functional 

L=T 2, 2, A rs<Pr<Ps - 2, J r<Pr > (3.38) 


r=l j=1 


r = 1 


with A„ a real, symmetric matrix with positive eigen¬ 
values. Since the matrix is invertible, L attains a 
unique minimum when the nodal variables 
<p ry r = 1,.. ., N satisfy the N linear equations 


0 = 


3L 


N 


d< Pr 


2 ^rstys Jr • 


(3.39) 


An iterative method for finding this minimum can now 
be constructed as follows. Let us repeatedly sweep 
through the <p rt proceeding from to q> N and then start¬ 
ing over again with <p v 


(p 1 > • • • y ^pNy^P\> • • • y ^pNy • • • * 

first sweep, second sweep. 


(3.40) 


at each step replacing the variable <p R being considered by 
the value which minimizes L when all other variables 
ip rt r^R are held fixed. Specifically, let be the values 
of all the nodal variables when the sweep reaches the vari¬ 
able <p R , so that at this stage L has the value 


L M ={ 1 £ Jr<Pr 


(n) 


(3.41) 


r = 15=1 


r = 1 


Regarding L as a function of the single variable <p Rt with 
the other variables fixed, we have 


L — 2 <PrArr +<Pr 


2 

s^R 


+12 

r^R s=£R r^R 


(3.42) 


Choosing <p R +}) to minimize Eq. (3.42) with respect to 
<Pr, we get 


,(n)_ 1 


A 


l RR 


^,A Rs q} s n) —jR 


(3.43) 


giving as the corresponding change in the action 
L {n +x) -L {n) = \[(cp { S + x) ) 2 — (cp l S ) ) 2 ]A RR 


I ( srS n + 1 ) _(n)\ 

+\<Pr ~~<Pr ) 


— \ A <pr \ A <p R 1 + 2 <p R 1 )A RR 


2A Rs <p ( s n) -J R 

s^R 
(n)' 


+ A <p R \ — A RR A<p { R } —A RR <p R } ) 


(n) 


— ~~T-Arr{ A<pj? ) ) 2 <0 . 


(3.44) 


(In the final line we have used the fact that since the ma¬ 
trix A„ has positive eigenvalues, the diagonal matrix ele¬ 
ments A rr are all positive.) Hence, under the iteration of 
Eq. (3.40), the Lagrangian is monotone decreasing. In 
problems of physical interest with positive definite A rst we 
expect the Lagrangian with the source term included to be 
bounded from below. Equations (3.44) and (3.43) then 
guarantee that in the limit as n becomes infinite, the 
L (n), s converge to the minimum value of L, while the 
<pj. n), s converge to the solution of Eq. (3.39). This method 
of solving the system of Eq. (3.39) is known as the 
Gauss-Seidel iteration. 

As we have just seen, in the Gauss-Seidel iteration each 
nodal variable is successively relaxed to the value which, 
at that stage of the iteration, minimizes L. An important 
variant of the basic method, called the successively over¬ 
relaxed (SOR) Gauss-Seidel iteration, is defined by the 
recipe 

/r ,(n)_^ /7 ,(n +l),SOR_ (n) , * (n) 

<Pr —► <Pr —<Pr +o>i±<Pr 

=cd<p [ £ +1) +{\-cd)<p [ £\ o)> 1 , (3.45) 

with <pj? + 1) and given by Eq. (3.43). In other 

words, instead of relaxing <p R ] to the value which mini¬ 
mizes Ly one systematically overshoots beyond this value. 
Using the over-relaxed iteration, the change in the La¬ 
grangian is 


i (. + USOR_ i (.) = ± [( ( ? + l).SOR ) 2_ (? ,(.) ) 2 ]iljui+(? ,( ? + l).SOR_ ? ,(.) ) 


2 ^R5<P5 W) — /ff 


= \cdk<$ R {coL<$ R +2<p'^)A RR +6>&q} R \ — ^rr^r^) 

= — yo){ 2— co)A rr { A<pj?^) 2 . 


(3.46) 
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Thus provided that 

0<o><2 , (3.47) 

the Lagrangian remains monotone decreasing, and the 
SOR iteration still converges to the solution of Eq. (3.39). 

For a general, symmetric matrix A„, each individual 
iteration step in Eq. (3.43) involves evaluating a sum of N 
terms, which would be computationally costly for large 
N. However, in the specific problems to which we will 
apply iterative methods, r and s are composite indices 

(3.48) 

s=U',j') , 


sparse matrix in which A Rs is nonvanishing only for a 
small number of index values s for each R. Because we 
have followed the prescription of squaring before averag¬ 
ing to ensure that the discretized action contains only 
nearest-neighbor couplings, we find in fact that Ars is 
nonvanishing only for the five index values s correspond¬ 
ing to the node R and its four nearest neighbors. Hence 
each iteration step involves only a short computation 
which is independent of the size of the computational lat¬ 
tice. If the sweep is performed 28 in “typewriter ordering” 

M:(0,0),(1,0), . .. ,(n f ,0),(0,l),(l,l),. .., («,-,l) , 

.. .,(0,rt ; -),(!,«;), .. ., («,-,«;■) , (3.49) 


denoting nodes of the computational lattice, and A„ is a 


then the over-relaxed iteration for Eq. (3.25) is 


(n) (n+l),SOR_+ 1) , /t 




Az Ag. 
A p A z 


(^i +1/2,; +1/2 +fc|+l/2j-1/2 +&I-1/2J + 1/2 + 


-1 


X 


Az 

A p 


(hi +1 n j +1 n ++1 nj -1 n }<p\ +1 j + -1 nj +1 n +**/ -1 nj -1 n W-t j 


Az 


^R-(u ... ..i u .. . ^(») i Ae. 


n +1) 




+ —8 /i0 (8y in ) 


77 


» hi +1 nj +1 n =Pi +1 /2 e / +1 nj + \ n 


(3.50) 


Equation (3.50) applies to all ij in the range 
0 <i<n t - ,0 <j<rij y since, by virtue of the “bordering^ 
procedure discussed above in Sec. III.B, all nodal values 
with indices outside this range appear in Eq. (3.50) multi¬ 
plied by vanishing h factors. An initial guess <p\j must be 
supplied as an input to the iterative process. In practice, 
to achieve a poor-man’s version of the “hierarchical’’ 
iterative schemes (see below), we follow the procedure of 
first iterating to convergence on a very coarse mesh start¬ 
ing from a specified <p\j 7 which is chosen to be reasonably 
close (without sacrificing simplicity) 29 to the anticipated 
solution of the problem. We then successively double the 
mesh and iterate to convergence, taking as the new initial 
guess after each doubling a linear interpolation of the 
converged values on the preceding coarser mesh. 

According to the discussion of Eqs. (3.41)—(3.47), an 
iteration with o>—1 produces the largest possible single- 
step reduction in the Lagrangian L. Hence at the begin¬ 
ning of an iterative solution one always makes three to ten 
complete passes through the computational lattice with 
co — 1 (or with an co which is gradually increased starting 


from 1) to eliminate the largest deviations between the ini¬ 
tial guess and the fully converged solution q>\j\ 
After these initial iterations, the optimal strategy is to use 
an co value larger than 1. The reason is that a general 
analysis 30 of the iterative process shows that in the 
asymptotic large-n limit, the difference cp { i j ) 

behaves as 

Vij-d?-*-"*"'. (3.51) 

with the decay constant y{co) attaining its maximum at an 
co value co= co Qpl ,1 < co opt < 2. Hence one clearly achieves 
the maximum rate of convergence by letting co tend to 
o> opt from below after the initial iterations. Values of co 
larger than co opt should be avoided, since in general they 
produce slower convergence than values of co an 
equivalent distance below o> opt , and since they can lead to 
instabilities in nonlinear problems. The optimum value of 
co can be estimated from the formula [Garabedian (1956)] 


2 

<Bopt ~ l+Oi 


(3.52) 


28 This terminology has been borrowed from Hockney and Eastwood (1981), who discuss alternative sweeps as well. 

29 In practice, there is no great gain in convergence to be achieved by using elaborate functional forms in the initial guess. 
30 See the books cited at the end of Sec. III. A for details. 
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with 

A=(«,-« y )“ 1/2 (3.53) 

a measure of the fineness of the computational lattice, 
and with C a constant which depends on the lattice 
geometry and the boundary conditions. (For the two- 
dimensional Laplace equation in rectangular coordinates 
with Dirichlet boundary conditions, C~3.) An empirical 
method for estimating the value of o> opt is discussed below 
in Sec. III.G; from the general form of Eq. (3.52) we infer 
the useful fact that when the computational mesh is dou¬ 
bled, so that 


n, —►2n,-, rij^lrij , 
h^h/2, 


(3.54) 


the corresponding change in o> opl is 


^opt 


^opi 
2 3“ ^opt 


(3.55) 


An intuitive (and mathematically correct) way to visu¬ 
alize the over-relaxation algorithm is to think of n as a 
time step and of the algorithm as a time-dependent dissi¬ 
pative process, with a steady-state equilibrium at the con¬ 
verged solution q>\™\ The starting guess <p\ 0 j in general 
deviates from * by both large localized transients and 
by relatively smooth errors. The initial iterations with 
co— 1 are used to rapidly eliminate the localized tran¬ 
sients. The later iterations with o>—o> opt minimize the 
relatively long time constant [y(a>)] _1 with which the 
smooth errors damp away. The optimal use of the over¬ 
relaxation method requires attention to eliminating both 
localized transients and smooth (or long-range) errors. 
One method of doing this in a systematic way is the 
“Chebyshev acceleration” method described by Hockney 
and Eastwood (1981), in which the lattice is scanned using 
an “even-odd checkerboard ordering” (as opposed to 
“typewriter ordering”) and in which co is incremented 
from o>— 1 to o—o 0pt in a prescribed way at the begin¬ 
ning of each even-semilattice and each odd-semilattice 
sweep. A second way of accomplishing this is through 
various “hierarchical” schemes 31 in which the full compu¬ 
tational lattice is scanned in only a fraction of the sweeps, 
with the remaining sweeps used to scan sublattices of the 
basic lattice, constructed by using a larger unit cell con¬ 
taining two, four, etc., fundamental unit cells. The Che¬ 
byshev acceleration and hierarchical schemes can be 
proved to be optimal ones, according to well-defined cri¬ 
teria, for solving elliptical partial differential equation 
problems based on the Laplacian (V 2 ) and similar linear 
operators. Since for nonlinear problems it usually is not 
possible explicitly to construct an optimal algorithm, we 
use instead a simpler method which is effectively 


^Hierarchical over-relaxation methods have been discussed, 
for example, by Brandt (1977) and Press (1978). 


equivalent. As discussed above, what we do is to start the 
iteration on a very small (typically 7x7) computational 
lattice, iterate to convergence, and then use an interpola¬ 
tion of this solution as the initial guess for iteration on a 
computational mesh which has been doubled as in Eq. 
(3.54), and so forth. After each doubling, co is reset to 1 
for several iterations to eliminate transients arising from 
the interpolation (these are strongly evident in the unit 
cells along the axis of rotation) and then increased to the 
co opt appropriate to the new mesh spacing h. This pro¬ 
cedure gives very satisfactory convergence and automati¬ 
cally generates a sequence of fully converged L values on 
progressively finer meshes, permitting an examination of 
the convergence of the discrete solution as the mesh spac¬ 
ing h approaches zero. 

Let us consider next the imposition of boundary condi¬ 
tions in carrying out the iterative solution. From Eqs. 

(3.1) —(3.3), we see that the differential equation and 
boundary conditions of our dielectric model are invariant, 
and the source current j° changes sign, under the reflec¬ 
tion operation z—► — z. This implies that the solution <p 
also has odd reflection symmetry, and vanishes on the 
equatorial plane z—0. Although this symmetry emerges 
automatically if the boundary value problem of Eqs. 

(3.1) —(3.3) is solved over the full physical space 
0<p< oo, — oo <z< oo, we can clearly save computer 
time if we impose the symmetry at the outset, by solving 
instead a boundary value problem on the half space 
0<p< oo,0<z< oo. Similarly, if the source current j° 
were replaced by 

/ o =G8(x)80>)[8(z-a)+8(z+a)] , (3.56) 

which is invariant under the reflection z —►— z, then the 
corresponding solution cp would have even reflection sym¬ 
metry, and 0 z <p would vanish at z—0. Again, although 
this symmetry would emerge automatically from the 
full-space boundary-value problem, we can halve the com¬ 
putational effort by using the symmetry to reduce the 
computation to an equivalent half-space boundary-value 
problem. 

When we solve the numerical problem on a half space, 
we introduce an inner boundary z— 0,0<p< oo on which 
a boundary condition must be specified, together with the 
outer boundary condition of Eq. (3.3). On this inner 
boundary, the appropriate boundary conditions are, 
respectively, the Dirichlet or Neumann conditions, 


<p —0 atz— 0 , j° odd [Eq. (3.1)] , (3.57a) 

0 z <p—0 at z—0, 7 ° even [Eq. (3.56)] , (3.57b) 


and we must translate each of these into a corresponding 
updating algorithm for the lattice nodes on the line z—0. 
(The more general Robin or mixed boundary condition 
a<p+00 z <p—0 can also be implemented computationally, 
but will not be encountered in any of the models studied 
in this paper.) In addition, in either the full-space or half¬ 
space problems, there is an inner coordinate boundary 
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the iteration will depend strongly on what is being mea¬ 
sured at the end of the iterative process. 

IV. NUMERICAL SOLUTION OF THE MODELS 
OF SEC. II 

A. Introduction 

Let us proceed now to apply the numerical methods 
described in Sec. Ill to the nonlinear models formulated 
in Sec. II. In Table III we summarize which dependent 
variables in the three models are discretized on the node 
lattice, and which are discretized on the half-node lattice, 
together with the boundary conditions which are imposed 
during iteration. In the brief sections which follow we 
discuss aspects of the numerical analysis which are specif¬ 
ic to the three models and give sample numerical results. 

B. The Abelian Higgs model 

Following the analysis of Sec. III.E and Appendix C, 
we explicitly subtract off the Coulomb self-energy from L 
by making the substitution 

(4.1) 

with Aq the Coulomb potential of Eq. (Cl) and with B° a 
new dependent variable. Because A £ and <p are both 
singular at the charges [cf. Eq. (2.17)], the charge coordi¬ 
nate z=a is taken to lie midway between nodes of the 
computational lattice: 

a~ («q + t)Az, (4.2) 


volume solution, p max and z max must be chosen large com¬ 
pared with the characteristic exponential decay lengths 
appearing in Eq. (2.17), requiring (for /c—1) that 

min(pmax ) z max)»niax[(2C)“ 1/2 ,(2e 2 )“ 1/2 ] . (4.3) 

Sample results for the Abelian Higgs model, calculated 
with 


K =C=e=Q 2 /(4ir) = l , 

(4.4) 

Pmax ^max fl = 1.625 , 

are shown in Figs. 12(a)—12(d). These figures give values 
of <p and A 0 (plotted vertically) on a plane passing 
through the axis of rotation (represented by the horizontal 
plane in the figures). One can see clearly the peaks in q> 
and A 0 at the charges, as well as the exponential decay of 
q> towards 1 and of A 0 towards 0 at infinity. In Figs. 
12(c) and 12(d), in which the vertical scale has been mag¬ 
nified by a factor of 10, one can also see that the structure 
of the solution extends to the computational boundary, in 
marked contrast to the behavior found below in the solu¬ 
tion of the leading logarithm model. 


C. The leading logarithm model 

In discretizing the leading logarithm model, we put the 
charge coordinate z— a on a node of the computational 
lattice, 


with Hq an integer. We choose the unit of length so that 
/c — 1, giving <p—► 1 as the boundary condition on <p at in¬ 
finity. Since this boundary condition follows from requir¬ 
ing L to be extremal [L is infinite if q>~M at infinity), it 
can be enforced computationally by simply iterating the 
nodal values for <p which lie on the outer boundary of the 
computational mesh. An alternative procedure would be 
to set <p — 1 on the outer boundary; the two methods give 
the same result in the limit asp max — ►oo,z max —>-oo, but the 
iterative boundary condition is preferable for finite 
meshes. 36 To get a good approximation to the infinite 


36 In general, for solutions with r~ n asymptotic behavior at in¬ 
finity, using an iterated boundary condition on the computation¬ 
al outer boundary gives greater accuracy than using a Dirichlet 
boundary condition (York and Piran, 1982). For linear prob¬ 
lems, for example, Cantor (1983) has proved that a sequence of 
solutions with the iterated boundary condition and with increas¬ 
ing (pmax> 2 maJ will converge to the true solution with 
(pmax> 2 maJ= (oo, oo ), and this sequence can even be used to 
study the asymptotic behavior of the true solution at r- oo. 
Such strong statements cannot in general be made when a Dir¬ 
ichlet boundary condition is used on the outer boundary. In the 
Abelian Higgs model, where q? approaches its asymptotic value 
exponentially at infinity, the difference between the two types of 
boundary conditions is not expected to be as marked as in the 
case of power-law asymptotic behavior. 


a=n e Az, (4.5) 

with n Q an integer, and enforce the step function boun¬ 
dary condition of Eq. (2.50) by requiring 

0 <j<n Qt 

< t>oj = {Q, j~n Q , (4.6) 

< J > o,;=0. n Q <j<nj- 

[An alternative procedure would be to put the charge 
coordinate midway between lattice nodes by taking 
a — ( n Q + t giving the boundary condition 

®oj — Q , 1 0 < j < n Q and <F 0l ; — 0, n Q + 1 < j < n- } .] Because 
the solution for <I> is confined within a finite free boun¬ 
dary, the numerical solution is independent of p m ax> z max> 
provided that these are large enough for the fully con¬ 
verged free boundary to lie entirely within the computa¬ 
tional mesh. To facilitate picking values of p ma x> z max 
which are large enough to contain the free boundary but 
are not excessively so, we have included a control parame¬ 
ter option in the program which permits the adjustment 
of the limits of the computational mesh during iteration. 

In carrying out the iteration we do not let the dielectric 
function e assume the value 0, but rather impose a 
minimum value e min by computing e and o from the for¬ 
mulas 
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TABLE III. Node and half-node assignments and boundary conditions for the models of Sec. II. 


Model 

Defined on 
node lattice 

Defined on 
half-node lattice 

Boundary conditions (or remarks) 

Outer boundary 

p = 0 z = 0 P - P ^ or z = 

r r r max max 

Abelian Higgs 

B° 


(charge coordinate 



Sec. II.C 

(cf. Appendix C; 


z — a midway 

o 

II 

o 

B° iterated 


A° = Ac+B°) 


between nodes) 






B° iterated 




<P 


cp iterated 

cp iterated 

cp iterated 






(could also use cp = 1) 




(charge coordinate 



Leading 



z = a on node) 



logarithm 

<D 


<D ^ Q, z < a 

<t> iterated 

o 

n 

e 

Sec. II. D 



<I> = z = a 






<t> - 0, z > a 





a 

a determined at all half-node points within 




computational mesh by updating from <t> 


hi 


h { iterated 

o 

II 

/j ( - (1 -n/r) COS-& 

Axially 

h z 


/i 2 = 0 

h 2 iterated 

h 2 = (1 - n/r)s\n$ 

symmetric 

/< 


/(“» 

/ t iterated 

/ t = n cos 2 £ 

monopole 

h 


O 

II 

< 

o 

K 

< 

f 2 = n sin-fr cos-ft 

Sec. II.E 



a\ =0 

iterated 

- (l/r)sin-fr 


a r 


a 2 iterated 

a 2 —0 

u 2 = (1 /r) COS-& 



FIG. 12. Sample results for the Abelian Higgs model, calculated for the parameter values in Eq. (4.4). The graphs show (a) cp , 
(b) A (c) cp with the vertical scale magnified by a factor of 10, and (d) A 0 with the same vertical scale magnification, all plotted verti¬ 
cally over a horizontal plane through the rotation axis. The values of cp and A 0 at the base of the figures are cp— 1 and A =0, 
respectively. 
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e=max[|Z» 0 log(£ 2 //f 2 ),e min ] , 


a = min 



V<j> 

1Tb 0 Kp 


_I_ 

» 2 
P ^min 


(4.7) 


This procedure avoids floating point underflows and over¬ 
flows, and corresponds to keeping the differential equa¬ 
tion for d> just barely elliptic even outside the free boun¬ 
dary. Provided that e min is chosen to be very small (we 
have used values ranging from 10“ 15 to 10 -35 with equal¬ 
ly satisfactory results), the results for V static are essentially 
independent of e min . Although convergence of the itera¬ 
tion is improved by over-relaxation, we have found that 
the nonlinearity of the combined iteration of Eqs. (3.86a) 
and (3.86b) leads to instabilities in the free boundary 
shape, if one attempts to use co values as large as the op¬ 
timum co appropriate to the linear subiteration of Eq. 
(3.86a)- These instabilities are avoided by limiting co to at 
most co =1.1 when iterating on meshes larger than 
25x25. Full convergence within the free boundary re¬ 
quires about 1—2 min of CPU time on a VAX 11/780 
computer for a 25x25 mesh, and around 1 h for a 
lOOx 100 mesh. Sample results on a 25x25 mesh [Adler 
and Piran (1982a)], computed for the parameter values 


*=1, C = (t> 1/2 > b Q = 9/{$'n 2 ) , 

(4.8) 

Pmax ^max 'jR=(X 4, 

are given in Figs. 13(a)-13(d). These figures show, 
respectively, the flux function d>, the field energy density 
2P with a vertical scale magnification of 100, and the 
logarithm of the dielectric constant e, all plotted vertically 
over a horizontal plane through the rotation axis. In the 
plot of d> the p =0 boundary condition of Eq. (4.6) is 
clearly visible, and in both plots of 2F one can see the 
Coulomb energy peaks. The plot of d> and the magnified 
plot of SIT show that the flux and energy are confined 
within an oval curve, approximating the continuum limit 
free boundary, which is also clearly visible in the contour 
plots of d> and shown in Figs. 14(a) and 14(b). Both be¬ 
cause we have imposed a cutoff e>e m in = 10“ 15 , and be¬ 
cause of finite mesh-spacing effects, the computational 
problem has low-level residual structure extending outside 
the continuum free boundary (but lying within a second, 
computational, free boundary), as can be seen in the plot 
of loge in Fig. 13(d). This residual structure, together 
with the fact that the location of the free boundary is not 
stationary under small variations around the equilibrium 



FIG. 13. Sample results for the leading logarithm model, calculated for the parameter values in Eq. (4.8). The graphs show (a) the 
flux function <h, (b) the field energy density (c) with the vertical scale magnified by a factor of 100, and (d) the logarithm of 
the dielectric constant e, all plotted vertically over a horizontal plane through the rotation axis. The values of <t>, and e at the 
base of the figures are 0, 0, and £ m j n = 10 -15 , respectively, with e falling 14 decades from the top of (d) to the base. [When E m j n is re¬ 
duced to 10“ 35 , the residual structure along the axis at the base of (d) is eliminated.] 
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solution <b, makes an accurate determination of the free 
boundary location more difficult computationally than an 
accurate measurement of F statjc , which is stationary 
around equilibrium [cf. Eq. (2.42)]. An analytic investi¬ 
gation by Lehmann and Wu (1983), described briefly 
below, shows that in the limit as R — >-oo, the continuum 
free boundary approaches an ellipsoid of revolution. 
Rigorous proofs of the existence of a continuum free 
boundary have been given by Lieb (1983) and by Gidas 
and Caffarelli (1983). 

As described above in Sec. III.E, to measure F static (R) 
we make a sequence of measurements of 
^staticW- F static (i? /2), with mesh geometries at the 
separations R, R/2 chosen so that the Coulomb self¬ 
energies cancel. For R values between 128 and 1, the cal¬ 
culation can be done in the original p,z coordinates, with 
uniform mesh spacings Ap,Az in the p and z directions. 
For R values smaller than 1, a Jacobian transformation as 
described in Sec. III.D is necessary. A simple 
“stretching”-type transformation which gives good results 
down to the smallest R values is given by 

p = H{p'fiM) , 

z=H{z\a) , 

(4.9) 


H(z\a) 


z\ z'<3a 



1/3 > 


3a <z'<4a , 


to be used with z'max^^p'max^^a. This transforma¬ 
tion has continuous first derivatives, leaves the mesh uni¬ 
form near the source charges (so that Coulomb self¬ 
energies still cancel), and has an outer region in which 
mesh points are distributed so as to sample in a roughly 
uniform way the field energy of a dipole source, as indi¬ 
cated by the following estimate, 

^dvpole^^ cc n+3cOS $) r 2 dr 


4 

3 


d 



\ 

3 



dz’ 

=0.05-^r , d = 0 
a 4 

(4.10) 

= 0.03^- , &=ir/2 . 

~4 * 


In the calculations for R values much smaller than 1 the 
free boundary cannot be resolved even on a 100 x 100 
mesh, but at small separations the infrared contributions 
to the static potential are no longer dominant, and so 
there is no difficulty in making an accurate determination 
of F static by using the transformation of Eq. (4.9). An al¬ 
ternative method for doing the calculation at short dis¬ 
tances is to use the transformation to bispherical coordi¬ 
nates given in Eq. (3.63). [This transformation is in fact 
useful at large distances as well; in Fig. 14(c) we show a 
contour plot of the flux function d> in bispherical coordi- 




Roiolion oxis-a><z<-o Rolalion axis o<z«d 



FIG. 14. Contour pbts obtained from the numerical solution of 
the leading logarithm model, with the parameter values of Eq. 
(4.8). The plots (a) and (b) show contours of the flux function <1> 
and the energy density in uniform Cartesian coordinates, 
corresponding to the elevation plots of Figs. 13(a) and 13(c), 
respectively. Specifically, (a) contains 21 equally spaced con¬ 
tours ranging from 0.001 to 1, while (b) contains 21 equally 
spaced contours ranging from 0.0001 to 0.01 (with the max¬ 
imum of scaled to 1). The plot (c) shows contours of the flux 
function <1> obtained on a bispherical grid. The coordinates are 
p and cosy, and the charges are located at p= ±oo. The free 
boundary is just inside the contour line <P/Q = 0.05. 
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nates, calculated for the distance a =4 at which the free 
boundary is clearly resolved.] The results of the calcula¬ 
tions using “stretched” cylindrical coordinates and using 
bispherical coordinates agree to better than 1 % at all dis¬ 
tances, and are presented in the form of a parametrized 
analytic fit to F stat j c in Adler and Piran (1982b). 

In solving a complicated problem numerically, it is im¬ 
portant to check the computer program, wherever possi¬ 
ble, against analytic expressions which are available in 
limiting cases. In the case of the leading logarithm 
model, systematic analytic approximations can be 
developed in the small-i? limit (Adler, 1983) and in the 
large-i? limit (Lehmann and Wu, 1983). At small R a 
perturbation analysis in powers of an appropriately de¬ 
fined running coupling £{R) gives 

FW*) == ~ - g -- [ gW + 0 ( g 3)] . 

R-+0 47rRjb 0 


SIR) 


f(w R ) 

w R 


1 

logu>* 


+ 0 


loglogUJ/; 
(logu)* ) 2 


+ 0 


1 

(log w R ) 3 


(4.11) 


V st * tic (R) = kQR + Q 3/2 ~ 

R — ► oo 3 


2 

77-bo 


1/2 

/c 1/2 log(/c 1/2 tf) 


+ ’ ’ ’ 


(4.13) 


This formula shows that the large R bound on the linear 
potential derived by Adler (1981a) is saturated. The nu¬ 
merical results for F static in the range R ~10— 100 yield 
coefficients of the R and log(/c 1/2 10 terms which agree 
with the analytic results of Eq. (4.13) to better than 1%. 
According to Eq. (4.12), at large R the limiting behavior 
of the free boundary is an ellipsoid of revolution 


1 = 


a' 


1 

irb 0 

1/2 

P K 1/2 

2 

2 <2 


a xn 


(4.14) 


with the major axis along the axis of rotation growing as 
R y and with the minor axis growing as R 1/2 . A study of 
the structure of the free boundary using the numerical 
solutions for R~ 50— 10 3 shows that the outer contours 
of <I> have a shape agreeing well with this formula. Hence 
in both limiting cases in which analytic approximations 
are available, they are in excellent agreement with the re¬ 
sults of the numerical solution for <!>. 


Wr= 2 n2 ’ Ap = 2.52/f 1/2 . 

A pH. 

When the numerical results for F stalic in the range 
k 1/2 R ~ 10 -6 —10 -8 are fit to the functional form of Eq. 
(4.11) with A P adjustable, we find A P «2.49, in excellent 
agreement with the analytic result. At large R y a sys¬ 
tematic expansion of the differential equation for in 
powers of\/R gives 37 


<S>=&°Hp/R l/2 ,z/R)+±-& l Hp/R l/2 ,z/R) + ■■■ , 

R 


<D 


( 0 )_ 


Q 


1-| 

irb 0 

1/2 1 
ap 2 K l/2 

x 2 

2 Q 

a 2 -z 2 


(4.12) 
a = \R , 


permitting the determination 37 of the leading two terms in 
the large-distance behavior of the static potential, 


37 The solution <F 0) positions the charges on the free boundary, 
reflecting the fact that the distance between the charges and the 
free boundary ( z A — a in Fig. 6) vanishes relative to R as R —► oo. 
The derivation of the logarithmic coefficient in Eq. (4.13) as¬ 
sumes the stronger statement that ( z A —a)/R vanishes faster 
than R~ l log R as R —► oo. The agreement of Eq. (4.13) with 
the numerical results gives a posteriori evidence for the validity 
of this assumption; for an analytic investigation of this issue see 
Lehmann and Wu (1983). 


D. Axially symmetric monopoles 


In solving numerically for the axially symmetric mono¬ 
poles, we use the leading terms of the asymptotic formu¬ 
las of Eq. (2.99) as Dirichlet boundary conditions on the 
outer boundary, without including the 1 = 2 or higher 
terms in the expansion. Consequently, the potentials ob¬ 
tained computationally will contain errors of order 
1 /Pmax> l/ z max> which can be made small by choosing p max 
and z max large enough. An important check on conver¬ 
gence is to verify that the bound 


H —4t7vc n 


(4.15) 


is attained. Since the leading terms which are retained in 
Eq. (2.99) make a contribution to the energy density given 
by 




asymptotic' 


n 


(p 2 +z 2 ) 2 * 


(4.16) 


we must include an analytic correction for the energy ly¬ 
ing outside the boundary of the computational mesh when 
we test Eq. (4.15). Following the notation of Eq. (3.70), 
we do this by writing 


H H inside ”1“ H outside > 


(4.17a) 


with 

D Z 

/* r max . /* max , , . _ 

ffinsid e = 4ir / o pdp f o dzJT (4.17b) 

determined computationally, and with a simple integra¬ 
tion giving 
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(a) 




FIG. 15. Sample results for the axislly symmetric Bogomol’nyi-Prasad-Sommerfield monopole problem. The graphs show the field 
energy density (a) for n = 1 and (b) for n = 2, plotted vertically over a horizontal plane through the rotation axis. The two cases 
can be characterized, respectively, as a fuzzy “ball” and a fuzzy “doughnut” of energy. 


^outside /outside / 


outside ^asyuptotic 


= n 


2ir 277 . _ i 

--f-sin 1 


'max 


'max 


'max 


2 \l/2 


(Pmax'b^max ) 


(4.17c) 


As illustrations of the solution of the monopole equa¬ 
tions, we have solved the n = l,2 cases on a mesh with 
Pmax=Zmax = 10, in units in which k=\ (and with the 
gauge function ^ of Appendix B taken as O). 38 For the to¬ 
tal energy H calculated from Eq. (4.17), we get (on the 
relatively coarse mesh plotted in Fig. 15) 


n = 1, H= 12.88 , 

(4.18) 

n= 2, H = 25.22, 

in good agreement with the values of 12.57, 25.13 expect¬ 
ed from Eq. (4.15). In Fig. 15 we give plots of the energy 
density ^ for the solutions. The n = 1 solution shows, in 
the large, the expected spherical symmetry, while the 
n—2 solution has the form of a fuzzy “doughnut,” in 
agreement with a variational calculation of Rebbi and 
Rossi (1980) and with the recently found analytic solution 
for this case. 39 However, we also observe that the n = l 


numerical solution has a dip in ^ along the rotation axis, 
whereas in the exact n = 1 monopole solution given in Eq. 
(2.96) above, decreases monotonically away from the 
axis. This dip is a remnant of the second-order errors in 
the discretization procedure which, in the present in¬ 
stance, weight too heavily the lattice cells along the axis, 
and thus lead to a reduction in the value of the con¬ 
verged discrete solution at the axis. When the lattice 
spacing is made finer, the dip becomes smaller, but the 
correct monotinicity properties of at p =0 appear only 
in the limit of zero mesh spacing. 
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38 As a check on the computer code, we have verified that the 
energy H of the numerical solution is unchanged when a choice 
0 is used. 

39 For a discussion of the axially symmetric solutions realizable 
within the six-function ansatz, see, for example, Prasad and 
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APPENDIX A: MATHEMATICAL FEATURES 
OF THE LEADING LOGARITHM MODEL 

We discuss in this appendix a number of mathematical 
features of the leading logarithm model. We begin by 
analyzing the structure of the real characteristic (the free 
boundary) of the differential equation 
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V-[a(p, | V<1>| )V<D]=0, 


(Al) 


with a defined in Eqs. (2.59) and (2.60) of the text. Using 
the chain rule and dividing by o, we see that Eq. (A 1) be¬ 
comes 


V 2 <P + ... alo i^ , Vdog I V4> | )-V4> + ^^a„<l>=0 ■ 


dlog V4> 


a - lQ g CT a ff,-, 

d P p 


(A2) 


This can be further rewritten by substituting 

v 2 =a p +a 2 +p _ 1 a p , 

vdog]vi>| )-vi>=-—^-(a.-a-O) 

| V<I> | 

= n i njd i dj<t> 


(A3) 


= n i a / n / a y <D - V<D | 


=a;*. 


with n the unit normal defined by 
V<1> 


A 

n = 


A A i 

, rrn = 1 , 


V<j) 


(A4) 


thus giving 


(a 2 +a 2 -a 2 )+ 


i + 


aiogq- 


aiog | 


•n 


<D 




1 + 


aiogcr 

aiogp 


/9 - 1 a„<D=0 . (A5) 


Now from Eq. (2.59) we have 

logcr = log(27TK') — logp — log | V<I> | -flog/(u>) , 

VO I 


w = 


a„<D 


(A6) 


irb 0 Kp irb 0 Kp 


and so the derivatives of cr appearing in Eq. (A5) can be 
expressed in terms of/(u>), 

I aiogo- _ aiog/ _ wf'jw) 

aiog [ vo | aiog | vo | fiw) 

t aiogo _ aiog/ _ wf{w) 
aiogp aiogp fiw) 

yielding Eqs. (2.72) and (2.73) of the text, 


(A7) 


[ (a 2 +a z 2 - a 2 )+aa 2 ]<d- ap-‘a p <D=o, 


a= 


wfiw) 


w 


(A8) 


f{w) w+fiw) 

In the vicinity of a general, off-axis point B with coor¬ 


dinates x B on the free boundary, we have 
—ap _1 a p <l>« — wp~ l d p <P~ | V<I> | 2 . 


(A9) 


so to first order in | |, Eq. (A8) can be approximated 

by 


, a„<i> 9 

—a 2 

1Tb 0 Kp B 


<D=0, 


(A 10) 


with / and n, respectively, tangential and normal Carte¬ 
sian coordinates at the free boundary, as shown in Fig. 7. 
When the radius of curvature of the free boundary (and of 
the nearby level surfaces of <b) is R B , then to the needed 
accuracy the behavior of near the free boundary has the 
form 


<D = F 


«-7 


i 


R 


B 


, Fi 0) = 0 


(All) 


Substituting Eq. (All) into Eq. (A10) determines the 
function Fiz) to be 


Fiz) = i 


i irbpKPB 2 


R 


(A 12) 


B 


giving Eq. (2.75) of the text. 

Let us consider next the point p=0,z=z A , where the 
free boundary intersects the axis of rotation. From Eq. 
(2.62) we know that A °= -1- oo at the source charge Q> and 
A 0 is arbitrarily large within a sufficiently small neigh¬ 
borhood of the point p = 0,z = a. On the other hand, since 
A 0 can be determined along the free boundary by integrat¬ 
ing dfA°=K out from the plane z=0, where A 0 vanishes, 
we have 


A°{p=0,z=z a )=kL , 


(A 13) 


with L the (finite) length of the segment of the free boun¬ 
dary lying within the quadrant drawn in Fig. 6. Hence A 0 
is finite at p=0,z=z A , and so we must have z A >a, with 
the possibility z A =a excluded. 

Let us suppose now that we have solved Eq. (2.56) or 
(A8), and hence know e = (p 2 o-) _1 as a function of x. As 
mentioned in the text, one way (not the simplest way!) to 
determine A 0 is to solve the linear differential equation 


V‘(eV^°)= — j° 


(A 14) 


within the free boundary. Since Eq. (A14) is the Euler- 
Lagrange equation corresponding to minimization (for 
fixed e > 0) of the functional 


f d 3 x[}e(VA 0 ) 2 -j°A°] , 


(A15) 


solutions will exist. To see that the solution is unique, 
even without the imposition of a boundary condition on 
the free boundary, let us suppose that Eq. (A 14) has two 
C 1 solutions.4? and^ 2 > so that &A°=AA\ satisfies 

V‘(eV&4°)=0 . (A 16) 

Multiplying by S/f° and integrating over the interior of 
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the free boundary, we have 
0= / d 3 x8A°V(eV8A°) 

= f d 3 xV(8A°eV8A 0 ) — f d 3 xe(V8A 0 ) 2 
= f dS (8A°EV8A 0 )- f d 3 xe(V8A 0 ) 2 . 

J free boundary J 

(A 17) 

The first term in the final line of Eq. (A 17) vanishes, be¬ 
cause e vanishes on the free boundary, and so Eq. (A 17) 
implies that V&4 °=0 in the interior. This, together with 
the requirement that bA° be an odd function of z, implies 
the vanishing of bA° within the free boundary. 

The seemingly paradoxical fact that Eq. (A5) requires a 
Dirichlet condition 4>=0 on the free boundary, while Eq. 
(A 14) requires no boundary condition for A 0 on the free 
boundary, has an interpretation in terms of the general 
boundary-value problem for second-order equations with 
non-negative characteristic form, given by Fichera (1956). 
The generalized Dirichlet problem takes the form 17 


L {u)=a k Hx)d k djtt + b k {x)d k u -\-c{x)u 
u =g on 2 2 U2 3 , 


—/(*) in H , 
(A18) 


a iJ =eb iJ , b k =d k e, c= 0 

=*>b k -d ja kj =0, (A22) 

so that the free boundary is in 2 0 . Hence no boundary 
condition for A 0 on the free boundary is needed when e 
has been determined as a function of x by first solving the 
equation for 4>. 

Suppose, on the other hand, that we attempt to solve 
the full nonlinear problem for A 0 given by Eqs. (2.41) and 
(2.46) directly, with s not known a priori. These equa¬ 
tions, when recast in the standard quasilinear form of Eq. 
(2.70), yield 

Ejj3j3jA®= —y° , (A23a) 

with Eij the field-strength dependent dielectric tensor 

£ ij = 8 lj }b 0 log[(VA (, ) 2 /K 2 ]+jb 0 T l tj . (A23b) 

A 

The unit vector / f - is defined by 

^ 0 f i4° 

h = ■ (A24a) 

and since 


with / and g functions defined on Cl and on 2 2 U2 3 , 
respectively. The sets 2 2 and 2 3 are subsets of the boun¬ 
dary 2 of the domain Cl, specified as follows. Let n k be 
the inward directed normal to the boundary. The set X 3 
is defined to be the noncharacteristic part of the boun¬ 
dary, where a kJ n k rtj> 0. The characteristic part of the 
boundary, where a kj n k rtj = 0, is divided into sets 
2 0 ,2 u 2 2 , defined by 


/•V*l>ccD'V < l>oc((£xV < l>)'V < l>=0, (A24b) 

I is orthogonal to the unit vector ft of Eq. (A4). Compar¬ 
ing Eq. (A23) with Eq. (A18), we see that 

a ,J =E ijf b'=c=0 , (A25) 

and so Eq. (A23) is elliptic in the interior region and de¬ 
generates on the characteristic, with 


b =0 on 2 0 , 


b >0 on 2] , 
b < 0 on 2 2 , 
b=n k (b k -dja k l) . 


(A 19) 


According to Eq. (A18), a Dirichlet boundary condition is 
required on 2 2 and 2 3 , while no boundary condition is 
needed on the subsets 2 0 and of the boundary. 

Let us now analyze Eqs. (A8) and (A 14) using this for¬ 
malism. In discussing Eq. (A8) it suffices to use the ap¬ 
proximate form of Eq. (A10), with / and n fixed Cartesian 
axes as in Fig. 7, giving 

d <t> 

a tt = 1, a ,n = 0, a nn = —^ -, b i =b n =c= 0, (A20) 

irb 0 Kp B 


so that 


b 


drt 


a m =- 


— 1 —0 2 „<d=--^-<o. 

irbQKpB &b 


(A21) 


Thus for Eq. (A8) the free boundary is in 2 2 , and the im¬ 
position of a Dirichlet condition 4>=0 on the free boun¬ 
dary is required. [Note that this condition, together with 
the discontinuity of 4> at the source charges given by Eq. 
(2.50), then implies that 4> = g on the interior line seg¬ 
ment p=0, | z | <a.] In Eq. (A 14), we have 


b = -a„[i& 0 iog(£V/c 2 )]- ib^djdiij) 


= -^b 0 (E- l d n E + n i d,T 1 ) . 


(A26) 


At a point B on the free boundary where the radius of 
curvature is R B , we see from Fig. 7 that 

n,9|£ = ^- > (A27a) 


while from Eqs. (2.61), (2.75), and (A6) we get the leading 
behavior of E in the vicinity of the free boundary, 


E^k(\ + w)=k 


1 + 


0„<D 


irb 0 Kp B 


K 


i + 


n 

r b 


^■E~'d n E = ~ , (A27b) 

Eb 

giving 

*>o 

b = ~~~< 0. (A28) 

Eb 

Hence according to the Fichera criterion of Eq. (A19), a 
Dirichlet boundary condition for A 0 is required at all 
points x B on the free boundary. This boundary condition 
is implicitly available in the form 
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A°(x b )= f dld,A°= f dlK , (A29) 

* ^z—0 plane ' ^z=0 plane * 

with f dl a line integral along the characteristic, but 
since i4°(x 5 ) thus becomes a function of the geometry of 
the free boundary (which is not known a priori ), this con¬ 
dition is difficult to implement in a numerical calculation. 
An important advantage of the flux function reformula¬ 
tion is that it replaces Eq. (A29) by the explicit Dirichlet 
boundary condition <P{x B ) = 0. 

APPENDIX B: STRUCTURE AND PROPERTIES 
OF THE SIX-FUNCTION ANSATZ 

Substituting the six-function ansatz of Eq. (2.97) into 
the field-potential relations of Eq. (2.80), we get the fol¬ 


lowing expressions for the various field-strength com¬ 
ponents (with 0 z =0/0z,0 p = 0/0p): 

I0jqf\£ a £ / =d 1 h l +a l h 2 , B' l F&=p- l [d p f l +a 2 f 2 ) , 

(<2 j <p°)fi£ j =d z h 2 -a 1 h u B4fi£ i =p-Ka p f 2 -a 2 f l ) , 

iaJ<p a )£y = d p h l +a 2 h 2 , B aJ 2 a p i =-p-\dJ l +aJ 2 ) , 

(Bl) 

Ji 2 -a 2 h u B aj ftp i =-p- 1 (d z f 2 -aJ,) , 
U8jqftyhfi —p-'ihif i —h\f 2 ), B a; ?“^=a p a 1 -a z a 2 . 

Substituting these expressions into Eq. (2.86) gives a for¬ 
mula for the Hamiltonian H directly in terms of 

^ 1,2 y * * ' > 


H = 47r Jo” P d P f 0 dzjr ’ 

JF’=- 5 [{d z hi+a l h 2 ) 2 + {d z h 2 -a l h l ) 2 + (d p hi+a 2 h 2 ) 2 + (d p h 2 -a 2 h l ) 2 ] 

+ [Oz/l + a + ~ a l/l ) 2 + (d pf l+ a 2f 2 ) 2 + (9^2 1 ) 2 ] 

2p 

+ j(d z a 2 -d f p l ) 2 +~(h l f2-h 2 f l ) 2 . 


Equations (Bl) and (B2) and the boundary conditions at 
p = 0, z=0, and r = oo given in Eqs. (2.98) and (2.99) have 
a residual Abelian gauge invariance of the form 

h\— ►fticos0— h 2 sinb , 

h 2 —*h 2 cosb + hisinb , 

f\ ~+f\ cos8 —/ 2 sin8 , (B3) 

/2-*/2Cos8+/ 1 sin8 , 
a\—*-a i -f 0 Z 8 , 
a 2~* a 2 + 3p6 , 

with 8 a function of p,z satisfying 

8=0atz=0, p= 0, r —► co . (B4) 

Hence in order for H to have a unique minimum, it is 
necessary to add to it a gauge-fixing term 

n g t= 4n fj P d P / 0 " ■ (B5) 

In the numerical work of Sec. IV.D, we shall use the fol¬ 
lowing choice of gauge-fixing: 

^ gf =^(a z a 1 +a p a 2 -^) 2 . < B6 > 

with ^ an arbitrary function which vanishes at the boun¬ 
daries. Minimization of H + Hg then picks out the 
member of the gauge-equivalence class of minima of H 
which satisfies 


0 z ai +0pa 2 — ^=0 . (B7) 

Since the differential equation for 8, 

(a z 2 +a^)8=^-(a z a,+a p a 2 ), o<z<a>, o<p<co, 

(B8) 

with the boundary conditions of Eq. (B4), gives a well- 
posed Dirichlet problem, the gauge condition of Eq. (B7) 
is always attainable and completely breaks the gauge de¬ 
generacy. 

Adding Eq. (B6) to the kinetic term for a 1(2 in gives 

z (0 Z « 2 - V i ) 2 + T (0 Z « i + - ^) 2 

= T l ) 2 ■+■ (dpfl i ) 2 + (0 Z ^2 ) 2- b (dpfl 2 ) 2- b V' 2 ] 

— i/s{d z a i 4-0 p fl 2 )4-0 z a j0pfl 2 — 0 z a 2 0pfl i - (B9) 

Although the final term in Eq. (B9) is a total derivative, it 
does not vanish when integrated over a finite domain 
0< z < z max>0<p<p max , and so should not be dropped in 
the numerical work. 

As discussed in the text, the minima of H in the sector 
with topological quantum number n are self-dual or anti- 
self-dual gauge fields. In the self-dual case, where 
— &jqf =B a \ the use of Eq. (Bl) gives the differential 
equations 
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I propose a simple set of equations for mean-field non-Abelian statics with c -number sources, at general inverse 
temperature^, working from the Euclidean path-integral representation of the Hamiltonian partition function. The 
problem of finding the background-field configuration, and the mean-field potential, for point sources can be 
reduced to a classical differential equation problem involving a suitably defined thermal effective action functional. 

As an application I study the interaction of a pair of static classical sources coupled to a quantized SU(2) gauge field, 
using the simplified model defined by keeping only the leading-logarithm renormalization-group improvement to the 
local Euclidean action functional. I prove that the mean-field potential in this model grows at least linearly with the 
source separation, giving a simple model for bag formation. The use of these methods to construct a Leading 
approximation to the qq binding problem in SU(3) quantum chromodynamics is discussed in two appendices. 

Appendix A describes the use of color-charge-algebra methods to generate an equivalent classical source problem, 
while Appendix B develops the properties of the transformation to a running coupling constant for which the one- 
loop renormalization group is exact. As a consistency check, in Appendix C I calculate the total mean-field ground- 
state energy, with source kinetic terms included, and show that it has the expected form. 


I. EFFECTIVE-ACTION FORMALISM FOR 
NON-ABELIAN STATICS 

I analyze in this paper the question of calculating 
the mean-field potential of classical point sources 
coupled to a quantized SU(2) gauge field, at zero 
and at finite temperature. This problem is of in¬ 
terest both in itself as a mathematical model, and 
because arguments based on the use of color- 
charge algebras suggest 1 that c-number source 
models should give a leading approximation to the 
problem of calculating the heavy quark-antiquark 
static potential in quantum chromodynamics. 

My analysis proceeds from a field-theoretic gen¬ 
eralization of the Euclidean (imaginary-time) ver¬ 
sion of Feynman’s sum over histories. In poten¬ 
tial scattering in one dimension, with Minkowski 
Lagrangian 

(1) 

the Euclidean sum over histories reads 




e~* H \x i )=N f f [dx] 


-s 


(2) 


On the left-hand side of Eq. (2) \x { ) and \x f ) are 
position eigenstates and H is the Hamiltonian, 
while on the right-hand side N is a normalization 
constant and S is the Euclidean action 


s - f 4Kf) !+ H ■ 1,1 

and j[dx ] denotes a functional integration over all 
paths x(t) obeying the boundary conditions x(0) 

= x if x((i)=x f . Setting x f — x t and integrating over 
initial states gives a formula for the partition func 
tion, 


Tr(e' s ")= f dx^Xile-'^Xi) 

= iV (dx { f 1 [dx\e~ s , (4) 

J \ 

where the paths in Eq. (4) now run from x(0) = Xi 
back to x{p) = x t . The generalization of Eq. (4) to 
a boson field theory containing spin-0 scalar fields 
and spin-1 gauge fields, denoted collectively by <f >, 
can be written as 

Z — Tr(e~ su ) = N f d<t>i f*‘ \d^>]e~ Ss . (5) 

On the left-hand side of Eq. (5), H is the Hamil¬ 
tonian operator defined from the stress-energy 
tensor 



while on the right, S B is the Euclidean action 






s > 



obtained by continuing ^ 00 from -1 to 1 in the gen¬ 
erally covariant form of the Minkowski Lagrangian 
density 2 




gen cuv 


- 1 =* 


00 



The trace on the left is understood to be evalu¬ 
ated in any canonical gauge, where the Hilbert 
space contains only physical states, while the path 
integral on the right again extends over periodic 
paths, with <f>(0)=<f>(&) = <i> i . The following obser¬ 
vation makes the form of Eq. (5) intuitively plausi¬ 
ble; For a field theory of scalars and spin-1 gauge 
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fields, the generally covariant Lagrangian density 
is linear in g 00f 

£ r +£«*,„ o) 

with £<o,i) independent of g 00 , whence from Eq. (8) 
we have 


£« = £< o) ~ £<i) , 
&B = “(^(O) + ^(i)) • 


( 10 ) 


But forming the Minkowski energy density T 00 t 



&£>* 

Sgoo 


— (—l)(«^<o) ” £<t>) ” 2 £(d — , 



we see that it is identical to the Euclidean Lagran¬ 
gian density £ s . Hence, the Euclidean action of 
Eq. (7) is afunctional representation of the opera¬ 
tor pH, just as in the potential theory case. A de¬ 
tailed justification of Eq. (5) can be obtained by a 
transformation from the conventional canonical 
formalism given by Bernard. 3 

I now apply Eq. (5) to an SU(2) gauge theory 
(with gauge potential b K and electric and magnetic 
fields E' and B*) coupled to a system of massive 
sources, and replace the source current density by 
its expectation, represented by a time-independent 
c-number external source f K . The equilibrium 
gauge field can be studied by keeping only the 
terms in H and S B which explicitly depend on the 
gauge field variables, 4 while omitting the source 
dynamics (hence, H in the following formulas is a 
truncated Hamiltonian, and not the Hamiltonian 
for a closed system). With this simplification, we 
have 


Z[f„ ] = Tr (e~ 8W ) = N 
where on the left 




( 12 ) 


(13) 


is an operator, while on the right 

s b- fjdt f d ' x (^ |(E' • E' + B' • B') - b u • 

(14) 

is afunctional. The mean-field potential 5 * 6 as¬ 
sociated with the static external source distribu¬ 
tion f K =(jo*0, j*J = 0), including self-energies, is 
defined as 


6 Vmean field J d*X b 0 * 6 


= Tr(e~ M J d 3 xb 0 • tfaj/nrie'*"). 


(15) 


Since 


f d*xbQ' &io = - f d*x 6)0 




(16) 


we can reexpress the mean-field potential directly 
in terms of the partition function 6 


= 6jln^fJ, (17a) 

=> V mesIl field{lnZ[fJ- lnZ[0]} , (17b) 



where I have fixed the constant of integration so 
that V mean field vanishes for vanishing source density. 
The problem of calculating ^[j M ] can be further re¬ 
expressed in terms of a classical differential equa¬ 
tion problem involving a classical background field 
c u and a vacuum effective action functional r[c,J. 
To do this, we write 

(18) 


and we introduce the time-independent classical 
background field $ M (#) induced by the time-inde¬ 
pendent external source distribution f K (x), 



gyff.l 

6f„(x) 


= Z' l N j db Bl J[ b, “ 4bj(i f 


e *s E 


(19) 


In this notation, the mean-field potential is given 

by 

^c. n f 1 e.d=-« / [fJ+ «40], (20) 

Defining the Legendre-transformed functional 
T[cJ by 

Mj AI ] = E[cJ - Jd*xCn(x)<} M (x), (21) 

a standard calculation 7 shows that 

6r[gJ ~ 

6c li (x) (22) 

Equations (18)—(22) are the principal result of 
this section. They show that the mean-field po¬ 
tential, for any inverse temperature /3, can be cal¬ 
culated by solving the classical differential 
equation problem of minimizing the functional 
I* -/d 3 # f M , withT the thermal effective action 
functional. 3 * 9 In the limit /3 — *>, where r reduces 
to the Euclidean vacuum effective action functional, 
this minimum problem reproduces the variational 
principle of the “Euclidean statics" method which 
I have advocated elsewhere, 1 but with some signifi- 
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cant differences in physical interpretation. 10 

According to Eqs. (28)-(22), the problem of 
studying the mechanism for confinement in the 
model discussed here can be rephrased in terms of 
the following two related questions. 

(1) Is there a physically reasonable class of vacu¬ 
um action functionals for which Eqs. (18)-(22) give 
a confining potential for static point sources? 

This question is answered in the affirmative in the 
following section. 

(2) Does the exact vacuum action functional cal¬ 
culated from the functional integral of Eq. (20) be¬ 
long to the confining class? 

The methods appropriate to studying these ques¬ 
tions are quite different. For a given functional 
or class of functionals r, the first question is one 
of classical analytic or numerical methods for in¬ 
vestigating partial differential equations. In the 
following section, Eq. (22) is investigated analyti¬ 
cally for the leading-logarithm approximation to 
the renormalization-group improved local effective 
action functional, for which T takes the simple 
form r[c )4 ] = r(E i •E i + B J *B i ); numerical methods 
_I 


of solution applicable to this class of functionals 
are currently being developed. 1 * 11 The second 
question is probably best studied by numerical 
Monte Carlo methods for doing the functional in¬ 
tegral. Since confinement is an infrared effect, 
it should suffice to establish the properties of T 
for slowly varying source currents j K and back¬ 
ground fields c K . In this case, appropriate lattice 
transcriptions of the functional integral of Eq. (19) 
may give quantitatively accurate estimates of the 
behavior of the continuum effective action. 

II. A SIMPLE MODEL FOR BAG FORMATION 

As an illustration of the formalism developed in 
Sec. I, I analyze the following simple model, ob¬ 
tained by keeping only the leading-logarithm re¬ 
normalization-group improvement 9 to the local 
Euclidean action functional 

r[Jj = Jrf 3 4£ efr -£?ff n ), (23) 

with 


£ eff = ■B.ffO*' 2 ) 

1 (E'• E'+ B'• B') l\ ,i. /E'-E' + B'-B'Y1 

= i ? L 1 ^ ln \ ? /J 

= -f6 0 i : ' 2 ln[F 2 /(eK 2 )] , 

=£,„(**) = -$b 0 <c*, 

&o= ^ C2[su(2)]= ^, 


= ; F 2 S e'-E' + B'.B' . 

\8x 2 / 


(24) 


r 


As has been extensively discussed in the litera¬ 
ture, 12 the minimum of £ e ff occurs at the nonzero 
field strength F=k. The source density f 0 is 
taken to be a pair of classical sources of equal 
magnitude, 

fo=QiS 3 (*“*i) + Q 26 3 (x -x 2 ) , 

\x\ — Xi \ =R y I Qi | — I Q 2 1 = Q • 

In analyzing the model defined by Eqs. (18)-(25), 

I make the physically plausible technical assump¬ 
tion that it suffices to minimize over potentials c K 
for which E * • E* is axially symmetric around the 
line joining the sources. 

The variational equations following from Eqs. 
(22)-(24) are 


X>,( € E') = f 0 , (26a) 

a*i m to i (iiB m ) = 3 i x(<LE k ), (26b) 

€ = €(E 2 )=-^f^-=|6 0 ln(F 2 /K 2 ) . (26c) 

°(2 * ) 

Acting with on Eq. (26b) and using Eq. (26a) 
gives the constraint 

O^xfo; (27) 

hence, if we write 

c°(x) = c(x)c(x), c • c = 1, (28) 

we have 

c(x l )xQ l = c(x 2 )xQ 2 = 0, (29) 
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which implies that 


c 0 (x) • £(x) = c(x)j(x ), 

(30) 

with either 


j{x) =±(?[6 3 (x - xi) - 6 3 (x - xz)] 

(31a) 

or 


/(x) = ±<?[6 3 (x-xi) + 6 3 (x-x 2 )] . 

(31b) 


From here on it will be convenient to work in the 
gauge with c(x) = $, in which we have 

jr2 = ^i^-J + c 2(^ x 5)2 + B'.B < . (32) 


We are now left with the purely Abelian problem 
of minimizing w[c ], to which we apply simple flux 
conservation estimates introduced by ’t Hooft. 13 
Varying W, we get the flux conservation equation 

~TC, (36) 

with 

c(E i E i ) where E^E^k 2 , 

0 where E*E *« k 2 . 


I now will show that the minimization of W with 
respect to the vector potential c* can be carried 
out explicitly, with the result 

min = f - ««<*>} . 

i ”[(5 r ) ! ]* 0tOr (^) ! *' ! ' lM> 

lor (jf) > . 

To prove Eq. (33), we note that J&efrCF 2 ) — £efr(x 2 ) is 
a monotonic decreasing function of its argument 
for 0 ^F 2 < k 2 , and is a monotonic increasing func¬ 
tion of its argument for k 2 ^F 2 . Hence, W is min¬ 
imized by the following choice of vector potential, 

c^z&aipfZ), p = (x 2 + y 2 ) 1/2 , 


a{p,z)=( dp'A(p',z), 

SO 

A(p', z) = 0 , where ^ > 


(34) 


A(p'. *)=[** -(^)y , Where (gr) , 

which gives (with <j>* the azimuthal unit vector) 
c J xz = 0 , 

B'= -z<£'A(p, z) =» 


F = max 


m ■-]• 


(35) 


and from which Eq. (33) immediately follows. 

(Note that it is at this point in the argument where 
the axial symmetry assumption has been used.) 
What is happening is that wherever the color-elec¬ 
tric field is less than k in magnitude, a color-mag¬ 
netic field fills in to bring the total squared field 
strength up to the value k 2 at which £ #11 is mini¬ 
mized. 


Evidently, wherever the E field strength is less 
than k , the D field vanishes. This fact can be 
exploited to get a lower bound on V m «n field and an 
upper bound on which by Eqs. (33), (36), and 
(37) can be rewritten as 

^at equilibrium — f d 3 x[£ t ff(EW) - EW] , (38) 

J D> 0 


with the integral extending only over the region 
where B i is nonvanishing. Dividing the integrand 
of Eq. (38) by D=(D i D i ) i/2 , we get 

ZT 1 [E i D i - £*!■]=«[$■«< + i/Ml (39) 


with 14 (in the domain where D > 0) 
u = (E j E j ) U2 /k> 1, 


fW = 


tr -1 

_ ■ fj 

u lnw 


>1 . 


(40) 


To turn Eq. (38) into a meaningful inequality, it 
is necessary to exclude the divergent self-energies 
of the charges by defining to be the contribution 
to Eq. (38) coming from the exterior of small 
spheres of radius r centered on the charges. We 
then get 


W t ^-kI t , 

Vmean field ^ Kl r 



A = domain 


D > 0 
\x -x t 
\x-x 2 






(41) 


The final step of the argument is to write d?x 
= dldA with l the length along the flux lines of D* 
and dA an element of area perpendicular to the 
flux lines. Denoting the flux by we have 


dAD = d\$\ ^d$ , 

[d 3 xD> r a* <(*) ^ $ tot 

•'A •'A 


(42) 


with l min the length of the shortest flux line. For 
the charge orientations of Eq. (31b), where the 
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flux lines terminate at infinity, we have 


*«» — 2 Q , 

Jmin — 00 t 


(43) 


and I T is infinite . 15 For the charge orientations of 
Eq. (31a), the flux lines run from the positive to 
the negative charge, giving 


♦tot —Q > 


l min —R - 2r, 

W r ^-KQ(R-2r), 

>kQ{R- 2r), 

which proves that the mean-field potential in¬ 
creases at least linearly for large R . In the limit 
of small R t a simple calculation shows that 

Vn, “" f| e' d - s elf-energles = - ^ 1+b(j / ln (i/ Rp _) > 

(45) 

as expected from a leading-logarithm renormaliza¬ 
tion-group improved formalism. Hence, the sim¬ 
ple model of Eqs. (18)—(25) interpolates smoothly 
between asymptotically free behavior at small 
source separations, and ‘'baglike, ,,le confining be¬ 
havior at large separations. The above analysis 
readily generalizes to the full renormalization- 
group improved local effective action functional , 9 
provided that the effective action minimum remains 
at nonzero Euclidean field strength k . 17 More gen¬ 
erally, the results obtained above support the con¬ 
jecture that a bag will form for large source sepa¬ 
rations, irrespective of the functional form of 
r[cj, whenever the minimum of T occurs at po¬ 
tentials c K with nonvanishing mean-square field 
strength. 
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APPENDIX A: CONNECTION WITH SU(3) 
QUANTUM CHROMODYNAMICS 

I briefly describe in this appendix how the meth¬ 
ods of the text, together with the color-charge- 


algebra analysis of Ref. 1, can be applied to give 
a leading approximation to the qq binding problem 
in SU 3 quantum chromodynamics (QCD). In the 
static quark limit , 1 the gluon source current for 
the qq binding problem in QCD is 


} A, = 0 , 

; a0 = O 3 (x-*i)+<?j6 3 (x-x 2 ) 


(Al) 


with Q* and Qj the quark and antiquark color- 
charge matrices. As discussed in Ref. 1, the 
gluon source current is now a 9x9 matrix opera¬ 
tor acting on the nine-dimensional Hilbert space 
spanned by the q , q color states. The analysis of 
Sec. I can be extended to this case by including a 
factor j tr ff tr f in all formulas and symmetrizing 
all inner products, so that Eq. (12) becomes 


= T t^trjTr^e' 8 ") 

tr, tr,- (f db A , jf^' dlb A ]e~ s ^ , (A2) 
where on the left 




H = J \(E a ‘E ai +B ai B ai ) 

~\(b A j A +j A b A )^ (A3) 


is an operator in the product of the q,q and gluon 
Hilbert spaces, while on the right 

s^f'dtj d’xfp- 1(E ai E ai + B ai B ai ) 

-\(b A K + KK)] (A4) 

is a functional in its dependence on the gluon vari¬ 
ables, but is still a matrix operator in the finite 
dimensional q,q color Hilbert space . 19 Using 
cyclic invariance of the trace, the steps leading to 
Eqs. (18)—(22) go through just as before, giving 

(AS) 

V mean field = - W[j A ] + W[0 A ] , (A6) 

6W\j A U-} tr,tr ff (f d 3 xc A (x)6J A (x)j , (A7) 

Wlj u ] = r[c ^ 1 — T tr, tr j d 3 x c A (x)j A (x)J , (A 8 ) 

6 r[c»]= 7 tr,trj ^J d 3 x6c A (x)j A {x)j . (A9) 

Note that Eq. (A7) defines c* to be a potential 
which, like the source current j* f is matrix valued 
in the nine-dimensional qq Hilbert space. To con¬ 
struct a QCD analog of the analysis of Sec. n, we 
must calculate a leading approximation to the ef¬ 
fective action. In the classical limit, the effec¬ 
tive action density is given by 19 
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F 2 = { tr # tr- (E A ‘E Ai + B A >B A ‘), 
^ A1 = -~i-c A " + iP;(c J ,c 0) , 

BA ’ = * m (is cAl -2 P / (c *> c '))> 

P a (u,v)J-/ abc (u b v c + v c u b ). 


(A10) 


The renormalization-group improvement of Eq. 
(A10) is obtained by taking g* to be a running-coup¬ 
ling function of the argument x giving, in lead¬ 
ing-logarithm approximation (cf. remarks in 
Ref. 28), 

r[c?]= fd\[£. tt (F 2 ) - £. {1 U 2 )], (All) 

^. tt (P S )=\b a F 2 ln(F 2 /e K 2 ), K 2 = J ^- e - i/lb ^ ) , 

(A12) 

‘•-17f c ' [su<3), -57 • 


To carry out the remainder of the analysis of 
Sec. II, we must reexpress Eqs. (A8)~(A12) in 
terms of number valued, as opposed to matrix 
valued, source density and gluon variables. To do 
this, let us recall 1 that the qq color-charge alge¬ 
bra is spanned by a basis w £,. . . , which 
satisfies the SU(2)xU(l) outer product algebra 


P “ t n ^ — 1> 2, 3 

(A13) 

w> 4 ) = 0, 

is orthonormal in the color-trace inner product, 


i tr # trj(w^OTj) = -jf 6 rs , (A14) 

and over which the quark and antiquark color 
charges have the expansions 

+^J~w£, 

1 (A15) 

- w 2 -~ 2 ~ ^ * 

[As a check, we note that tr ff trj(Q*Q*) = (8/27) 
x (18/4) =4/3. J 

Expanding q£, c A , E Ai t and B Ai over the 

basis w£, with c-number coefficients, reduces the 
variational problem 

tr a tr ? ^ f d 3 x&c£{x)j£(x)j = 0 (A16) 

to a classical SU(2)xU(l) problem, analogous to 
that discussed in Sec. II. According to Eq. (A15), 
the U(l) effective q and q charges are opposite in 


sign, while the SU(2) effective charges have equal 
magnitudes. Hence, the quark and antiquark ef¬ 
fective charges can be made antiparallel by an 
SU(2) gauge transformation, 20 leading to a solution 
with the same form as that obtained from Eq. (31a) 
in Sec. II, apart from the substitutions 21 ’ 22 

Q-(|) 1/2 , 

11 (A17) 


APPENDIX B: TRANSFORMATION OF THE 
RUNNING COUPLING CONSTANT TO EXACT 
LEADING-LOGARITHM FORM 


As already noted, the argument given in the text 
for a confining mean-field potential generalizes to 
the full renormalization-group improved local-ef¬ 
fective-action functional, provided that the effec¬ 
tive-action minimum remains at nonzero Eucli¬ 
dean field strength. When expressed in terms of 
the 0 function, this condition translates 9 into the 
requirement that the integral 


r JsL 

j p(g') 


(Bl) 


should be convergent at its upper limit. Assuming 
convergence of the integral in Eq. (Bl), I show in 
this appendix that one can make a nonanalytic 
transformation to a new running coupling constant 
g R for which the one-loop renormalization-group 
structure is exact. The transformation is simply 


(with a R = g l R) a^g 2 ) 

1 , . C" da' 


a 


R 


= ~7 b o 


f da 

l ’ 


(B2) 


where 0=^/3 has the power-series expansion [with 
the coefficients given for SU(3) QCD with N f light 
quark flavors 23 ] 


^(a)=-[{6 0 Q! 2 + 6 1 Q! 3 + O(a 4 )J, 

(B3) 

6„ = ^T(11-K ) , 


Substituting the expansion of Eq. (B3) into Eq. 

(B2), we learn that for small running coupling con¬ 
stant, 

26 

a R =■ a - #3 2 (lna + const) + * * * , a — ~r L (B4) 

°o 


and so — 0 when a — 0. On the other hand, the 
convergence of Eq. (Bl) implies that 



(B5) 


and so a R - 00 as a - 00 • Hence the transformation of 
Eq. (B2) gives a nonsingular mapping from the half 
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(B2) gives a nonsingular mapping from the half 
line 0 ^ ot < to the half line 0 ^ ot R < The re¬ 

normalization-group structure in the new variable 
a R is determined by ?), given by 

j3 R (a R ) = p(a) j^-=p(a)(-a R 2 ) 

= -\b 0 <*r, (B6) 

and so has exactly one-loop form. 

A particularly interesting case of Eq. (B2) is ob¬ 
tained when jJ(a!) terminates at two-loop order, 

P(a) = -(£ b 0 <* 2 + b\ a 3 ), (B7) 

a situation which can always 24 be achieved [pro¬ 
vided Eq. (Bl) converges 26 ] by an analytic trans¬ 
formation of the running coupling constant [i.e., 
by a rearrangement of the perturbation series 
which does not introduce coupling-constant logari¬ 
thms], In this case, Eq. (B2) can be explicitly 
integrated to give the transformation 

~ a [ ln (^) + ln(1 +aa) ]» < B8) 

which for small act can be developed into a series 
expansion 



The series of Eq. (B9) can be inverted by sub¬ 
stituting 

a = 0^(1+ £»*/), (BIO) 

which after some algebra gives 


inverse transformation given by Eqs. (B10)-(B13), 
yielding a series expansion in powers of a R and 
ln(aaj*). 2e In this series, the terms of order oir 
contain only powers 0,1,. . ., « - 1 of ln(aa A ). 

An important property of the one-loop running 
coupling ct R is that it simultaneously maximizes 
the domains of analyticity of the renormalization- 
group improved local effective action £ #fl (.F 2 ) and 
of the j3 function /3(a). For a general running coup¬ 
ling a(f), the renormalization-group Improved lo¬ 
cal effective action density is given by 9 

£ llL 

•"' ' — 2 a(t) ’ (B14) 

t = { In (F 2 /e K 2 ). 

Substituting the one-loop running coupling a Rf 



gives 

| boF 2 ln(i^/W), (B16) 

as used in Eq. (23) of the text. As a function of 
complex F 2 , Eq. (B16) is analytic apart from a 
cut in the F 2 plane running along the negative real 
axis from F 2 = 0 to F 2 = -«>. Such a timelike cut 
is expected from unitarity, and so £ #ff R has the 
maximum allowed analyticity domain in F 2 . To 
study the analyticity properties of the general 
£ e „tF 2 ), let us calculate the derivative 

d(F) £ ."(- F2) = 2 + 8 ‘ (B17) 


n“i n“l 

(-ga.)"(l + a,/-)" 



(-a*/)" 

n 


(B11) 


Substituting 

/=£ “«/» (B12) 

*■0 

into Eq. (BIO), and equating the coefficients of like 
powers of ct Ry gives explicit expressions for the 
coefficients f k as polynomials in ln(aa A ), 

/ 0 = aln( a a fi ), 

ft =/o 2 + a/o - a\ . . . . (B13) 

Hence, starting from any convenient calculation- 
al scheme (for example, minimal subtraction in 
dimensional regularization), the QCD perturbation 
series can be reexpressed in terms of a R by a 
two-step transformation*. First, one transforms 
to a running coupling constant for which /3(a) is 
given by Eq. (B7), and then one substitutes the 


From Eqs. (B2) and (B15), we get 

d / 1 \ fi (a) 
dt\a(t)f cF 9 

and so 


d(F 2 ) “ 2 a(t) 


1 Platt)) 
4 a(t) 2 


(B18) 


(B19) 


We have seen above that at the spacelike F 2 
where t vanishes, both a R and a become infinite. 
Hence, d£ tU (F*)/d(F 2 ) is singular at spacelike F 2 
unless j3(a)/a 2 is bounded as a becomes infinite. 
This is possible with /3(a) an entire function [which 
corresponds to the maximum allowed analyticity 
domain for the function /5(a)] only if /3(a)/a 2 is a 
constant. Hence, the one-loop running coupling 
gives the maximal analytic extension of the renor¬ 
malization-group substructure of QCD. 27 This re¬ 
sult suggests that the one-loop model of Sec. II 
may give a universal, leading, semiclassical ap¬ 
proximation to the confinement problem. 28 
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APPENDIX C: TOTAL GROUND-STATE ENERGY 

As a consistency check on the formalism of Sec. 
I, I show here that when source kinetic terms are 
included, the ground-state expectation of the total 
Hamiltonian for a system of two well-localized 
sources, in mean-field approximation, is 

<01 H t |0> = V mein (x u x 2 ) + recoil terms + constant. 

(Cl) 

To most simply parallel the discussion of the text, 
I consider only the case of massive distinguishable 
fermion sources, with classical 29 SU 2 charges, for 
which H t has the form 

H t = J d 3 x T°° = 3C +3C kln , 

*=f**(ji& + ~ S ° ■ *•) ’ ( C 2 ) 

^0 = ^iQi^i + ^ 2 Q 2^2 J 

3Ciiin —J d 3 x(^i*Mi + ^2* 8 0^2) • 

Taking the ground-state expectation of Eq. (C2), 
we have 

<o|ff r |o>=<o|3e|o>+<o|3e kta |o> . (C3) 

To apply mean-field theory, one assumes a Har- 
tree factorization of the ground state (g= gluon, 

5 = source) 

| 0 ) = | 0 >,|o>, 

with 

,< 0 | 0 >, = s < 0 | 0 > s =l, 

• <0|i’o|0) # =Qi6 s (x-x 1 ) + Q 26 3 (x-x 2 ) = k ■ 

Hence, for <0| 3C| 0) we get 

<o|3e|o> = ,<o|tf|o>,. 

(C 6 ) 

H=f d 3 x(p-|(E^E^ + B^BO-bo-j;), 

which involves the truncated Hamiltonian intro¬ 
duced in Sec, I. Since in the limit /3 — only the 


6 Xi Vmehn field (x lr X 2 ) = J Sx C 0 (x) * “ X t ) 


ground state contributes to the partition function, 
from Eqs. (12), (18), and (20) of the text and Eq. 
(C6) we learn that 


< 01 3C 10 } = wlj 0 ] = - V me an field (x t , x 2 ) + cons tant 

■ (C7) 

To evaluate the second term in Eq. (C3), 
the source field equations of motion 

we use 

iB 0 jpi — Qi^t • b 0 (x) + recoil terms, 

? 8 0 ^ 2 =Q 2^2 4 b 0 U) + recoil terms , 

(C 8 ) 

which, together with Eqs. (C4) and (C5), give 

< 01 3C klo | 0 > = ,< 0 | J d 3 xb 0 (x)-£(x)| 0 ),. 

(C9) 

The right-hand side of Eq. (C9) can be reexpressed 
in terms of the /3 — °» limit of the partition function 
and then further rewritten using Eq. (19) of the 
text, giving 

,< 0 |/^b 0 . i 0 W|O>,-lim lnZ[xj„]jJ 

X-l 


(ClOa) 

= J d 3 x c„ (x) • j^(x) . 

(ClOb) 

Hence, we have 


( 0 1 3 C kl „ I 0 > = Co(xf) • Qi + c 0 (* 2 > • Q 2 , 

(Cll) 

and we can complete the proof of Eq. (Cl) by show¬ 
ing that 

Co(xi) * Qi = V mean fieidUi, * 2 ) + constant, 

(Cl 2a) 

C 0 (x 2 ) * Q 2 = V mean fieidUi, x 2 ) + constant. 

(C12b) 

To prove Eq. (C12a), let us write c 0 (x) in the 
form 

c 0 {x) = cb A) {x t xu x 2 ) + c 0 <B> (x, Xi) 

(C13) 

with Cq B) chosen so that c 0 U) is regular near x=%\ 
and satisfies 

b x c^ A) (x, x u x 2 )]\ x . h =0 . 

(Cl 4) 


Using Eq, (15) of the text, in the /3 — «s limit, we 
get 


(C4) 

(C5) 


f <fxcb A) (x,xux 2 ) •Qibi^x-xi) + f tPxCo B} (x,xi)*Qt& Xi & 3 (x-xt) . (C15) 


The first term on the right of Eq. (C15) can be rewritten, by use of Eq. (Cl4), as 


6 *i/ rf3x Co j4) (x,Xi,x 2 )-Qi6 3 (x-xi) - f d*x[d Xi cb A) (xj x 1 ,x 2 )]*Qi6 3 (A:-x 1 ) = 6 :ei [co j4) (xi,Xi,X2)' Qi] , (C16) 
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while the second term on the right of Eq, (C15) is 
independent of x 2 , Hence, Eq. (Cl5) implies 

V mean field (#1 j X 2 ) ~ Ql * Cofa'l) ~ ^l(Xi) "b l^ 2 (x 2 ) j (Cl7) 

with V t independent of x 2 and V 2 independent of x\ . 
But since translational, rotational, and local SU 2 
gauge invariance imply that both terms on the left- 
hand side of Eq, (C17) depend only on the relative 
distance xi - x 2 , the terms Vi and V 2 on the right 
must be constants, proving Eq. (C12a). A similar 
proof, using 6^, gives Eq, (C12b), 

According to Eqs. (Cll) and (C12), a consistent 
mean-field approximation to the source wave equa¬ 
tions is given by 


2913 

iB 0 jpi = c 0 (xi) + recoil terms 

= [V r m«nfiew(xi, x 2 ) + constant]^ + recoil terms , 
id 0 ip 2 = c 0 (x 2 ) • Q 2 ^ 2 + recoil terms 

= [ V mn Add(xi, x 2 ) + constautfy 2 + recoil terms , 

(Cl 8) 

These are just the usual one-body wave equations 
obtained from the potential theory of two sources, 
interacting through a potential Vmeanfieid(xi,x 2 ), in 
the limit that the sources are well localized. 
Hence, the formalism of Sec. I reproduces all of 
the expected potential theory results- 30 


*R. Giles and L. McLerran, Phys. Lett. 79B , 447 (1978); 
S. L. Adler, Phys. Rev. D 17, 3212 (1978); S. L. Ad¬ 
ler, i&id. 20, 3273 (1979). For a review, see S. L. 
Adler, in The High Energy Limit , proceedings of the 
of the 18th International School of Subnuclear Physics, 
“Ettore Majorana^, edited by A. Zichichi (Plenum, 

New York, to be published). Further references are 
given here. [In QCD, the quantized gauge field is the 
underlying SU(3) gauge field, while the effective c- 
number sources lie in an unquantized, overlying SU(2) 
xU(l) gauge field. See Appendix A for a detailed dis¬ 
cussion.] 

2 I define £^ ncov to be a scalar, so that 
S U =f d*X r “ V . 

3 C. W. Bernard, Phys. Rev. D £, 3312 (1974). I thank 
L. Dolan for bringing this reference to my attention. 
For a discussion of the generalization of Eq. (5) to the 
case when fermions are present, see D. J. Gross, 

R. D. Pisarski, and L. G. Yaffe, Rev. Mod. Phys. 53, 

43 (1981). Equations (12)—(14) remain valid for mas¬ 
sive fermion sources at rest. 

4 The functional measure in Eq. (12) is understood to in¬ 
clude the exponential of the gauge-fixing term, and 
the compensating Faddeev-Popov determinant (which 
can be represented as an additional functional Integral 
over ghost fields). When the kinetic terms and func¬ 
tional integrals for the source fields producing j M are 
included, S E is properly gauge invariant, justifying 
use of the Faddeev-Popov functional measure. In the 
situation studied in this paper, where the only sources 
present are Infinitely massive sources at rest, the 
source current can always be made time independent 
by an appropriate time-dependent gauge transforma¬ 
tion. In such static-source gauges, the source func¬ 
tional integral can be omitted, leaving the expression 
for the partition function given In Eqs. (12)-(14), with 
Ip =(To*0, J f = 0 ) and with f 0 time independent. The 
static-source formalism is no longer invariant under 
all gauge transformations, but remains invariant under 
the subclass of time-independent gauge transforma¬ 
tions. Since in a stationary state we have d(&j)/dt 
-dCj/dt = 0 , we are assured that the mean-field po¬ 


tential can be calculated from the expectation of the 
scalar potential (Bq) s c 0 . 

6 In a linear system the incremental potential 6 Vmean Held 
can be defined as either (f d 3 x B^ • 6 ^ 0 ) » which gives the 
mean energy change when an increment in source den- 
sity63 0 is brought infrominfinity, oras (6fd 3 x§E J • E J / 
g 2 ), but in general these expressions are not equiv¬ 
alent: only the former can be used for nonlinear 
systems and is renormalization group invariant. When 
we study the nonrelativistic motion of the sources, the 
leading coupling of the sources to the gluon field in¬ 
volves only the values of Bq at the source positions. 
Hence, an average potential calculated from (Jd 3 xb 0 
• 6 ~jo) is the correct starting point for a mean field, 
potential theory analysis of the source motion. See 
Appendix C for further details. 

e From Eq. (17) we can see that the zero temperature 
(/3 oo) mean-field potential is not the same as the 
static potential calculated from the Wilson loop, which 
in the notation used here is 

V static =lim {W[ill - W[ 0 ]}. 

The physical interpretation of Viatic is that it is the 
ground-state eigenenergy of a static qq system. [See, 
for example, the derivation of the Wilson-loop formu¬ 
la given by L. S. Brown and W. I. Weisberger, Phys. 
Rev. D 20 , 3239 (1979).] Since eigenenergies are de¬ 
fined only by continuation back to Minkowski space¬ 
time, it is not surprising that an imaginary source 
occurs when we formally represent IWic by a Eu¬ 
clidean path integral. The motivation for introducing 
V m ean field is that it can be calculated strictly within the 
Euclidean formalism. In a perturbation expansion in 
the external source strength j M , the mean-field and 
Wilson-loop potentials agree in order (j^) 2 , but differ 
beyond this order. In the Abelian case, there are no 
terms of higher order than (j^) 2 , and so the two form¬ 
alisms give the same static potential. In the non-Abel- 
iancase, the formalisms are inequivalent, and give 
different formulations of the confinement problem. It 
appears that the simple effective action approach to 
confinement developed in this paper can be obtained 
only by using a mean value formalism. I wish to thank 
R. F. Dashen for several discussions of these points. 
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(See also Appendix C and Ref. 30 below.) 

7 E. S. Abers and B. W. Lee, Phys. Rep. 9C , 1 (1973). 

8 Since r Is not gauge invariant, the gauge-fixing condi¬ 
tions used in solving for c p must be chosen to be com¬ 
patible with the gauge noninvariance of r. 

9 The use of an effective action in this context was first 
suggested by H. Pagels and E. Tomboulis, Nucl. Phys. 
B143, 485 (1978). 

l0 In particular, c M is not to be used as Minkowski space 
Cauchy data and time evolved, as was implied in Ref. 

1. In canonical gauges, the physical interpretation of 
c u is that it is the expectation of , and is Minkowski 
time independent. Also, in Ref. 1 I used the incorrect, 
renormalization-group noninvariant formula for the 
potential (see Ref. 5 above). 

n S. L. Adler and T. Piran, in High Energy Physics — 
1980, proceedings of the XXth International Conference, 
Madison, Wisconsin, edited by L. Durand and L. G. 
Pondrom (AlP, New York, 1981), p. 958. 

**I. A. Batalin, S. G. Matinyan, and G. K. Sawidi, Yad. 
Fiz. 26 , 407 (1977) [Sov. J. Nucl. Phys. 26, 214 (1977)1; 
G. K. Sawidy, Phys. Lett. 71B , 133 (1977); H. Pagels 
and E. Tomboulis, Ref. 9. See J. Ambjorn and P. Ole- 
sen, Nucl. Phys. B170, 60 (1980) for a discussion of 
corrections to the local effective action approximation, 
and extensive references. Many of these references 
consider only constant color Reids, which has tended 
to obscure the fact that the gauge theory vacuum lead¬ 
ing to the effective action of Eq. (23) is Lorentz in¬ 
variant, with fl)| J> M |0) = E *| 0) = (0|B*|0)=0 in the 
absence of sources. The vanishing of these expecta¬ 
tions is reflected in the fact that the minimization of 
r[c M ] of Eq. (23) leads to a partially indeterminate 
variational problem, solved by any random color-elec¬ 
tric and magnetic fields 2 * and B* satisfying 2 * * 2 * 

+ B* • B* =k 2 . When sources are added, the variational 
problem of Eq. ( 22 ) is fully determinate only in the in¬ 
terior of the “bag”, where D > 0, but remains parti¬ 
ally indeterminate (in the sense described above) in 
the exterior region where D =0. As a result, one 
cannot argue that there are nonvanishing gluon gauge 
potential or gauge field vacuum expectations by con¬ 
sidering the limit of the exterior solution as a weak 
source is turned off; this line of reasoning applies only 
when the variational problem in the presence of sources 
is fully determinate in all of space. 

13 G. *t Hocft, in Recent Progress in Lagrangian Field 
Theory and Applications , proceedings of the Marseilles 
Colloquium, 1974, edited by C. P. Korthes-Altes et 
al . (Centre de Physique Theorique, Marseilles, 1975). 

l 4 Toprove/(i<) ^1, let$*/—1, <£ = 2 /(m Inn 2 ). Asimple 
calculation shows that ip satisfies the differential equa¬ 
tion 

+<pp- - ftg - j ^ 7 > 0 for u > 1 . 
du u^lnu* 

Integrating up from u =1 (where ip =0), this implies 
that ^ is positive for u > 1 . 

1B In this case, one cannot neglect the surface term (as 
was done in the text) in the integration by parts leading 
from Eq, (33) to Eq. (38) but rather, one must work 
directly from Eq. (33). For the charge orientations of 
Eq. (31b), simple estimates (see H. Pagels and 
E. Tomboulis, Ref. 9) show that W has a positive in¬ 


finite infrared divergence at equilibrium, correspond¬ 
ing to a vanishing partition function£. Hence, the 
configuration with nonvanishing color flux at infinity is 
automatically excluded from the physical spectrum. 
Note that when the correspondence with QCD is made 
as in Appendix A, charge-conjugation symmetry or 
permutation symmetry will select either the effective 
charge orientations of Eq. (31a) or those of Eq. (31b), 
but not both. In the qq problem, the averaged poten¬ 
tials cf are charge-conjugation odd, selecting Eq. 
(31a). For the qq' sector, the averaged potentials c f 
are symmetric under permutation of the sources, se¬ 
lecting Eq. (31b), and giving a vanishing partition func¬ 
tion contribution. This is the expected result for a 
system which cannot be in a color singlet state. 

I 6 A. Chodos, R.L.Jaffe, K. Johnson, C. B. Thom, and 
V. F. Weisskopf, Phys. Rev. D 9, 3471 (1974). 

i7 If JBefr attains its minimum at F®0, and vanishes there 
as F a , a simple estimate shows that the asymptotic 
behavior of the potential is V meui field # 

This confines for a > 3, but gives a linear potential 
only in the limit a —(See H. Pagels and E. Tom¬ 
boulis, Ref. 9.) 

18 I am assuming a standard canonical quantization, in 
which only the constrained components of bf are ma¬ 
trix valued. Hence, the differentials dbf t and d [bf ] 
in Eq. (A 2 ) are ordinary numbers. The assumption 
of canonical quantization is consistent with the conclu¬ 
sion reached at the end of the analysis, that the mean 
matrix-valued potential cf is Abelian apart from a 
time-independent gauge transformation. This means 
that cf is nonzero, while cf contains only a single 
spatial degree of freedom. The two spatial degrees of 
freedom in bf which are orthogonal to cf can then be 
canonically quantized by the standard Dirac bracket 
procedure. For example, taking the q and q to lie on 
the* axis, the gauge transformation rotating the q 
effective charge to be antiparallel to the q effective 
charge can be chosen to depend on z only, giving cf , 
cf* 0, but cf t y =0. This matrix-valued structure in 
the potentials is compatible with axial-gauge quantiza¬ 
tion. 

l9 The classical limit of the effective action can be read 
off from Eq, (A 2 ) by approximating e~ SB by 

e- 8 Bnl-S E , 

and so is given by the field-strength terms in Eq, (A4), 
acted on by the quark color trace-J tr ff tr 3 , 

20 Color-charge-algebra solutions of this form have been 
discussed by I. Bender, D, Gromes, and H, J. Rothe, 
Z.Phys. 5C, 151 (1980). 

21 In the formulation of Ref. 1, there arose the issue of 
how to fix the integration constants K^ t) in the La¬ 
grangian for the overlying algebra. The present analy¬ 
sis corresponds to taking the K t 's all equal, which 
differs from the rule which I had originally postulated, 

22 The effective Lagrangian analysis of theqq binding 
problem has the following Feynman diagram interpre¬ 
tation: Working in Coulomb gauge, the effective La¬ 
grangian for the Coulomb gluons arises from Feynman 
diagrams which may be characterized as a central 
“blob, ” containing one or more closed gluon loops, 
from which n ^ 2 Coulomb gluons emerge. The effec¬ 
tive Lagrangian contribution to qq binding is obtained 
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by stringing such "blobs” between q and q lines, 
attaching each emerging Coulomb gluon to either the q 
or the f line. This procedure yields the usual re¬ 
normalization-group improved one- Coulomb-gluon ex¬ 
change graph, and its nonlinear generalizations, which 
are responsible for the weak field-strength modifica¬ 
tion in the effective action which leads to confinement* 

23 Previously in this paper, I have taken N f to be 0, 

m G. 't Hooft, in The Whys of Subnuclear Physics , pro¬ 
ceedings of the International School of Subnuclear 
Physics, Erice, 1977, edited by A. Zichichi (Plenum, 
New York* 1979), pp. 943-971. 't Hooft restricts his 
discussion to the case of analytic running coupling 
constant transformations. Nonanalytic transforma¬ 
tions similar to those of Eq. (B 8 ) have been recently 
investigated by Y, Frishman, R. Horsely, and 
U. Wolff, Phys, Lett, (to be published) and Weizmann 
Institute report (unpublished). 

2 b N. N. Khuri and O. A. McBryan, Phys. Rev. D 20, 881 
(1979). 

2fl Simi lar coupling-constant logarithms have been found 
in three-dimensional QCD (which is related to the 
behavior of the four-dimensional theory at high- 
temperature phase transitions) by R. Jackiw and 
S. Templeton, Phys. Rev. P 23, 2291 (1981), and in 
chiral perturbation theory by H. Pagels, Phys. Rep. 
16C, 219 (1975). In using the modified expansion to 
evaluate Euclidean Green's functions, it may be im¬ 
portant to keep the — ie in the Feynman denominators 
even after continuation to the Euclidean section. This 
gives a definite prescription for circling the spacelike 
pole in ot R and chooses a definite branch of the space¬ 
like cut in lna A . The rearranged power series will in 
general contain imaginary contributions to the Euclid¬ 
ean Green's functions in each order, but (under the 
conventional assumption that the Euclidean Green's 
functions in QCD are real) these will cancel when the 
entire series is summed. Hopefully, the rearranged 
series will give real contributions to the Euclidean 
Green’s functions which converge fast enough to give 
useful estimates (as, for example, is the case In the 
Wilson-Fisher expansion in critical phenomena when 
applied in 3 or 2 dimensions). Good convergence of the 
rearranged series would be an indication that the infra¬ 
red behavior of QCD is effectively controlled by a weak 
coupling regime. 

27 There appears to be a close analogy between trans¬ 
formations of the radial coordinate in the theory of 
Schwarzschild black holes In general relativity, and 
transformations of the running coupling constant in 
QCD, with the concept of maximal analytic extension 
playing a key role in both cases. In both theories the 
natural coordinate (or coupling) in which one does cal¬ 
culations does not give the maximal analytic exten¬ 
sion. Moreover, the transformations which yield the 
maximal extension have very similar functional form: 
Eq. (B 8 ) relating ot R ~ i to o " 1 closely resembles the 


transformation r* = r + 2Af In \r/2M -11 which is used 
to remove the coordinate singularity at the horizon in 
black hole physics. 

28 A second interesting analogy is the fact that the leading 
logarithm effective action 

roc f d 4 *F 2 (x) lnF 2 (x) 

has the same structure as the quantum-mechanical 
entropy 

S - —fe B Trplnp, 

which has many special and useful formal properties 
[see A. Wehrl, Rev. Mod. Phys. 50, 22 l (1978)]. Per¬ 
haps this analogy can be exploited to understand the 
thermodynamic aspects of hadronic behavior. As a 
simple application of the entropy analogy, suppose that 
in the discussion of Appendix A we had applied the re¬ 
normalization group improvement argument locally in 
theg,^ color space, thus obtaining 

Z ef f (/ 2 ) =£*>otr, tr—[ / 2 In (/Vex 2 )], 

f 2 = -$(E Ai E Ai + B A, B Ai ), 

instead of Eqs. (A10) and (All), which in terms of / 2 
read 

£ e fr( F2 ) = 3 &o(tr ff tr-/ 2 J In (tr^tr^/ 2 /** 2 ). 

Since l e ff yields the same stress-energy tensor trace 
anomaly as does JC e ,r , it is also an acceptable form for 
the effective action density. By some simple algebra, 
we find 

h it (/ 2 ) -C.ff (F 2 ) = 5 Jo(tr < ,tr ir / 2 )tr < ,tr~(p In p), 

P = / 2 /(tr.tr-/ 2 ), tr^tr-p =1. 

Since p is a color density matrix, we can use the posi¬ 
tivity of the entropy to conclude that 

Z e ,T</ 2 )«£ ef f(F 2 ), 

and so the use of 2 e ff would give at least as strong a 
linear potential as is obtained with £ eff . 

29 The discussion of Appendix C is readily generalized to 
the QCD case by taking s (o| • • • | 0 ) s to be an expecta¬ 
tion with respect to the source spatial (but not color) 
wave functions, and including the source color wave 
functions in | 0),. Following Appendix A, the only 
changes are then the replacement of arrows by octet 
color indices, and the inclusion of a factor tr c tr - in the 
inner products involving c 0 appearing in Eqs. (C10)- 
(C18). 

30 In contrast to the mean Held approach, the Wilson- 
loop formula evaluates (0|J7 r |0) directly, without ap¬ 
proximation, in terms of a Euclidean functional inte¬ 
gral with imaginary sources. (See also the remarks in 
Ref. 6 above.) 


Erratum: Effective-action approach to mean-field non-Abelian statics, 

and a model for bag formation 
[Phys. Rev. D 23, 2905 (1981)] 

Stephen L. Adler 

The discussion of Appendix C contains several errors. The conclusion that { 0 |3C kl0 10) =2 V mean field is 
correct, but in QCD this matrix element cannot be expressed in the form of Eq. (Cll). Moreover, in 
general the decomposition of Eq. (C13) is not possible, and so the argument leading to Eq. (C12) is erron 
eous. A corrected version of Appendix C appears in S. Adler, in Proceedings of the Fifth Johns Hopkins 
Workshop on Current Problems in Particle Theory , edited by G. Domokos and S. K, Domokos (Johns 
Hopkins Univ., Baltimore, to be published). 

In the sentence preceding the final paragraph of Sec. I, Eq. (20) should read Eq. (12). 
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We study the statics of quasi-abelian quark and antiquark source charges in the approximation in which leading loga¬ 
rithm radiative corrections are retained in the gauge gluon effective action functional. We show that the partial differential 
equation for the flux function is of degenerating elliptic type, leading to flux confinement within a free boundary which is 
a characteristic. The static potential increases linearly for large source separations, with a logarithmic subdominant term. 


L Introduction . As shown in a recent letter [1], by 
making a mean field approximation and a quark 
source charge approximation, the partition function 
for quantum chromodynamics (QCD) can be reduced 
to a relativistic model in which quarks couple to a 
pair of classical abelian background gauge fields obey¬ 
ing effective action dynamics. For the case of a single 
massive quark flavor, the functional integral formula 
for the S-matrix in this model is 

5 = ext^|exp(ir inv [i]) 

V 

X J d[»//] d [\jj] exp^i J d 4 x \j)(ip - /tt)i//jj , 


ext^ { } indicates the extremum of the curly bracket 
over all values of the quasi-abelian gauge potentials 
A. 3 * 8 . Making the standard rescaling by the coupling 
constants ~+A/g> introducing an effective action den¬ 
sity by writing 

r imW = / d 4 * £ effW- ( 2 ) 

and specializing to the case of static, infinitely massive 
sources, eq. (1) gives a formula for the static potential 

5= lim exp(-i F static T) 

J — 

= ton ext^" jexp (i T f d 3 x (£ ef[ [A/g] -A a Q J a 0 


&= 7 


A 

d = a 


A 4 


» 


^1,2,4,5,6,7 = ^ 



\1,2,4,5,6,7 = 




In eq. (1), the matrices X 3 > 8 are quasi-abelian quark ef- 

A 

fective charges, the functional r inv [/l] is the gauge-in¬ 
variant gluon * 1 effective action, and the notation 


1 Also at Racah Institute for Physics, Hebrew University, 
Jerusalem, Israel. 

41 More generally, when there are light quarks which are ob¬ 
served only through their effect on the massive quark sys¬ 
tem, rjnv [A ] is the gauge-invariant effective action of the 
gluon plus light quark subsystem. 


+ mass terms) 



J~ a 0 = -^ 5 XV. (3) 

which can be rewritten (dropping the mass terms) as 


Static = ~ ext A 


fd 3 x(. e en [A/g] -Apl) 



When solved using the classical approximation to the 
effective action, 


£ sn ~\g- 2 7, 9 ip 2 , (5) 

eq. (4) gives classical abelian electrostatics. The leading - 
logarithm model is defined by including radiative cor- 


©1982 North-HoUand 
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rections to -£ eff to leading logarithm order, giving 

( 6 > 

where Z> 0 (> 0) is the one-loop j3-function confinement 
and jtt is the subtraction point, g 2 = g 2 Qj. 2 ). 

The model of eqs. (4) and (6) was studied for iso¬ 
lated quark sources by Pagels and Tomboulis [2], who 
showed that the infrared energy is linearly divergent. 
The leading logarithm model with a pair of oppositely 
charged point sources (as is appropriate to a qq sys¬ 
tem), 

J a 0 = Q4 a [8\x- Xl )-8\x-x 2 )], (7) 

where q a is the unit internal vector, was studied by 
Adler [3,4], who gave a variational argument * 2 show¬ 
ing that for large source separations, F stat i c is 
bounded from below by a linear potential. In this let¬ 
ter we give the results of a detailed analytical and nu¬ 
merical study of the leading logarithm model, with 
particular emphasis on the structure of the domain 
within which the flux is confined. 

2* Analytic formulation . Introducing scalar and vec¬ 
tor potentials 0 and A by writing 

A% = q a 4>, A“ = q“A r (8) 

the variational problem of eqs. (4)—(8) can be rewrit¬ 
ten as 


V = / 0 , VX£= 0, 

VXtf=0, V-J?=0, 

D = eE, H- eB, e = d£ ff /d(^), (11) 

with the field-strength dependent dielectric constant e 
given in the leading logarithm model by 

e = ^Z> 0 log(7/x 2 ). (12) 

The source-free Euler—Lagrange equation for H can 
be satisfied by taking 

eB - 0, (13) 

giving two branches 

(I) * = 0 , 

(II) e=0 -+B 2 =E 2 -k 2 . (14) 

Near the source charges, asymptotic freedom requires 
that the solution approach a Coulomb-like solution 
with E large and B vanishing; together with continuity, 
this implies that a finite domain containing the source 
charges lies on branch (I). 

Specializing the analysis to branch (I), we are then 
left with a problem in nonlinear electrostatics, 

V‘Z>=/ 0 , VX^ = 0, 


^static ext <M 


/d 3 *[£ eff (7)-0/ o ] 


9 = E 2 -B 2 , E=-V<l>, b=vxa, 


/ 0 = C[S 3 (* - *i) - 5 3 (x - x 2 )\, 

with J2 e ff in the leading logarithm model given by 
4? ef f I(7) = I b 0 9 Iog( 7/ eK 2 ), 

k 2 = iii A /e) exp [-4/(6 0 /)]. (10) 

The Euler—Lagrange and constraint equations implied 
by eq. (9) are 


The analysis of refs. [3,4] used a euclidean version of the 
variational principle of eq. (4), in which 7- ( E a ) 2 - ( B a ) 2 
was replaced by (E a ) 2 + (B a ) 2 . The two descriptions coin¬ 
cide in the interior of the confinement domain, where B a - 0. 


D = e(E)E, e(E) = d£ eH (E 2 )/d^E 2 ). (15) 

Let us work henceforth in cylindrical coordinates 
p = (x 2 +y 2 )ll 2 , z, $, with the source charges located 
symmetrically on the z-axis at z = ±a> and let § be the 
azimuthal unit vector. As shown by Adler [4], eqs. 
(15) can be rewritten in manifestly flux conserving 
form by introducing a flux function <£(p, z), in terms 
of which D is given by 

D = — (1/2tt)V0 XV$ = —(0/27tp) X V<£ 

= VX [(§/27rp)<£]. (16) 

The physical interpretation of <£ follows from calculat¬ 
ing the total flux through a surface of revolution S 
(with element of area d^4) bounded by a circle C of 
radius p and z-intercept z (with element of arc-length 
d/ = d/0), 
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flux through S = J dA*D 

S (17) 

= /(L4-VX [(0/2rrp)<^>] = /d/*(0/27rp)<^> = <^>- 
S C 


From eq. (17) we learn that <£ assumes the following 
boundary values on the axis of rotation and at infini- 


ty* 3 , 




a 

O 

II 

0 

A 

o 

n 

Q. 

|z| >a. 


ii 

0 

a 

O 

II 

Q. 

\z\<a. 



as p 2 + z 

> 

(18) 


which together with eq. (16) guarantee that the equa¬ 
tion = / 0 is satisfied. To get a differential equa¬ 
tion for <£ we rewrite the equation V X J? = 0 in the 
form 

V* {(fi/p) X [(— 9/27rpe [D]) X V<£] } = V‘(V0 X E) 


= £‘(VX V0)-V0-(VX£)=O. (19) 

Expanding out the triple product on the left-hand side 
of eq. (19) and using § * V<£ = 0, we obtain 

V‘[a(p, | V$|)V<£] =0, (20a) 

ff(p,|VS|)=l/p 2 e[Z>], ( 20 b) 

D=\\$\l2i,p, e[D] =e(E{D)\ 

with E(D ) obtained by inverting the equation 

D = Ee(E)=d£ e f f (E 2 )ldE. ( 21 ) 


Eqs. (18)—(21) are the basic statement of the bounda¬ 
ry value problem for <£. Once <£ has been determined, 
the static potential can be calculated by substituting 
V*Z) =/ 0 into eq. (9) and integrating by parts, giving 

^static 

= /d 3 x [ED - £ e{{ (E(D) 2 ) + e cff (E(0) 2 )], (22) 

These boundary values correspond to the convention of al¬ 
ways drawing the surface S so that it crosses the z-axis at a 
point Zq> a. 
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where an infinite constant has been added to eq. (22) 
to render the integral convergent at infinity. By using 
the identity 

J2 C{{ (E(D) 2 ) - £ ef{ (E( 0) 2 ) 

E(D) E(D) 

= f d E'd£ e{{ /dE'= f d E'D(E') 

E{ 0) 2T(0) 

D 

- ED - J E(D’)dD\ (23) 

0 

eq. (22) can also be rewritten in the familiar form 

D 

< 24 > 

0 

Considerable insight into the behavior of the solu¬ 
tions of eq. (20) is obtained by rewriting it to explicit¬ 
ly show the structure of the second derivative terms 
(including those arising from the | V<£| -dependence of 
a). Defining the inward-directed unit normal n and the 
corresponding normal derivative d n , 

ii = Vfc/|V$|, d n =ii-V, (25) 

a straightforward calculation shows that eq. (20) is 
equivalent to 

[d 2 p + d 2 + (a - l)d 2 ] 4> - ap- 1 d p 4> = 0, (26a) 

a = a(p, |V<£|) = 1 + 6 log a/0 log | V<£|. (26b) 

A 

Letting I be the unit tangent to the surfaces of con¬ 
stant <£ and bj be the corresponding tangential deriva¬ 
tive, we have 

d 2 + d? = d 2 + d? + first derivative terms, (27) 

p z n l v ' 

and so the characteristic form of eq. (26a) can be writ¬ 
ten as 

d 2 +ad 2 . (28) 

Thus, eq. (26a) is elliptic, parabolic or hyperbolic ac¬ 
cording as to whether a > 0, a = 0 or a < 0. Before 
proceeding further with the analysis of the characteris¬ 
tics, let us use eq. (26a) to study the structure of z- 
axis translation-invariant solutions, for which <£ = <£(p). 
We then have 0 2 <£ = 0, 0 2 <£ = 0 2 <£, and eq. (26a) re¬ 
duces to 
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a(d 2 p - P -h p y S = 0, (29) 

which on branch (I) (where a ^ 0) has the solution 

$ = $0 + kDq p 2 • (30) 

Eq. (30) describes a uniform flux D = zZ) 0 filling all 
of space; we see that eqs. (20) and (26) do not admit 
solutions describing a translation-invariant bounded 
flux tube. 

The analysis of eqs. (8), (9), (11) and (13)—(30) 
applies generally to any effective action density of the 
form £ C ff(E 2 )\ let us now specialize to the case of the 
leading logarithm model, as given in eqs. (10) and (12). 
Eqs. (20b), (21) and (26b), which implicitly define a 
and a, are conveniently written in terms of a dimen¬ 
sionless function f(w ) defined as the solution to the 
transcendental equation * 4 

"'“/log/, f>l, (31) 

giving 

o(p, |V<£|)= (2 ttk/p|V<I>|)/(w), 

a = w/'(w)//(w), w = d n ®lnt> 0 KP = 2 D/b Q K. (32) 

For small w and large w, the behavior of f(w) is given 
by 

/= 1 +w + O(w 2 ), |w| <€ 1, 

/- (w/ log w)[ 1 + 0(log log w/log w )], w > 1, (33) 
giving for the corresponding behavior of a 
a-w + 0(w 2 ), |w|^l, 

a = 1 — (log w)~ *+ 0(log log wjQog w) 2 ), w > 1. (34) 

Comparing eqs. (34), (32) and (28), we see that the 
differential equation of eq. (26a) is of degenerating el¬ 
liptic type [5], and has a real characteristic at a sur¬ 
face of constant <£ where D cc | V<£| = 0. The second 
normal derivative 6^4? is discontinuous across this 
characteristic, which acts as a free boundary, dividing 
space into two causally disconnected regions. From 


The choice of branch is again dictated by the requirement 
that the solution be continuously connected to the strong 
field, asymptotically free regime. 
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the boundary condition of eq. (18) and the continuity 
of <£, we learn that $ = 0 on the free boundary. Since 
the exterior of the free boundary is completely sur¬ 
rounded by surfaces on which <E> = 0, <E> vanishes identi¬ 
cally outside the characteristic, and therefore the ex¬ 
terior solution lies on branch (II) of eq. (14). At a 
point B on the free boundary where p = p B and where 
the radius of curvature of the free boundary is , eq. 
(26a) can be integrated to give an approximation to 
the interior solution, 

<!>™^b 0 Kp B /R B )(n-±l 2 /R B ) 2 , (35) 

with n and / normal and tangential cartesian coordi¬ 
nates which are zero at B. Since <£ is increasing to¬ 
wards the interior, we must have R g > 0, and so the 
free boundary is everywhere convex. This in turn im¬ 
plies that all points on the free boundary lie within a 
finite distance of the origin, with the free boundary in¬ 
tersecting the z axis at points p = 0, z = ±z fi . Since 0 
= 0 at z = 0 and, from eq. (12), E - \ V<f>| = Kon the 
free boundary, we have 0(p = 0, z = ±z B ) = ±kL 9 with 
L the length of the segment of the free boundary lying 
in the quadrant p > 0, z > 0 of the p, z plane. But 
since the scalar potential 0 becomes infinite at the 
source charge coordinates p = 0 ,z- ±a , this implies 
that z B >a. The only qualitative feature of the solu¬ 
tion which we have been unable to characterize analy¬ 
tically is the detailed structure of the free boundary-z- 
axis intersection; the numerical results strongly suggest 
that the free boundary is smooth, with no cusp at the 
z-axis, but we do not have a proof of this. Turning fi¬ 
nally to the static potential, we learn from eqs. (31) 
and (32) that inside the free boundary E is bounded 
by 

E(D)/K=f(w)> 1, (36) 

which when substituted into eq. (24) gives 

(37) 

Writing d 3 x = d/ d^4, with / the length along and d^4 
the element of area perpendicular to the flux lines of 
D, eq. (37) yields the lower bound * s 


* 5 For a discussion of the removal of the infinite Coulomb 
self-energies from eq. (38), see ref. [3]. 
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r static 

R - \x 1 - x 2 \ = 2a. (38) 

In determining F stat j c computationally, we use eq. 
(23), which in the leading logarithm model can be con¬ 
veniently rewritten in the form 

^static =/ d ^ ic(V<I./27r) 2 (l + £), 

£=(/ 2 -1)/(2/>v). (39) 

We believe that the connection found above be¬ 
tween degenerating elliptic operators and a linearly in¬ 
creasing static potential is a very general one. For ex¬ 
ample, let us briefly consider the case in which 
-G e ff(^ 2 )? rather than attaining its minimum at a non¬ 
zero value of E as in the leading logarithm model, at¬ 
tains its minimum at E = 0 and behaves there as i? eff 
^'E'y. Then for an isolated charge at r = 0 in spherical 
coordinates, one has r~ 2 ~ D ~Ey~^ , giving [2] for 
the infrared energy in a box of side L (and presumably 
for the long-distance behavior of the static potential 
of two opposite charges) 

^infrared ~^ 3 ^~^ (7_3)/(7_1) - ( 40 ) 

Substituting £ eff ^E^ into eqs. (21)—(26) above, we 
find that 

a(D = 0 ) = 1 +d(log£’ 2 - 7 )/d(log£’ 7 “ 1 ) = ( 7 - l)" 1 . 

(41) 

Thus for finite 7 , where the infrared energy grows less 
strongly than linearly with L , the characteristic form 
of eq. (26a) is always elliptic, while in the limit as 7 
where the infrared energy grows linearly, the 
characteristic form again degenerates * 6 from elliptic 
to parabolic at D = 0. 

3. Numerical results. We have developed a numeri¬ 
cal method for solving eq. (20), based on a two-step 

* 6 These estimates do not determine the structure of the con¬ 
finement domain. Very likely, when/? e ff has an infinite 
order zero at E = 0, the confinement domain fills all of 
space. (We wish to thank J. Chayes and L. Chayes for dis¬ 
cussions about this case.) For a renormalization group argu¬ 
ment suggesting that the minimum ofi? e ff in fact lies at 
non-zero E , see ref. [4]. 


procedure in which <£ is updated by a single over-re¬ 
laxed iteration of eq. (20a) for fixed a, and then a is 
updated by using eqs. (20b), (31), and (32), with a 
Newton iteration used to solve the transcendental 
equation for/. The boundary conditions of eq. (18) 
are applied on the rotation axis, while the boundary 
condition <£ = 0 is imposed on the perimeter of the 
computational grid, which must be chosen large 
enough to completely enclose the ultimate free bound¬ 
ary. A detailed discussion of the theoretical and practi¬ 
cal aspects of the numerical algorithm will be given 
elsewhere [6]; we give here some sample results from 
our calculations, done for Q = (4/3) 1 / 2 and - 9/8tt 2 




(b) 


Fig. 1. (a) The flux function 4> on a plane through the z-axis, 
plotted vertically on a linear scale. The base of the figure is at 
0=0. (b) The energy density £, similarly plotted. 
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Fig. 2. The dielectric constant e on a plane through the z-axis, 
plotted vertically on a logarithmic scale. From the top of the 
figure to the base spans 14 decades, corresponding to e m hi 
= 10“ 15 . (When e m i n is reduced to 10“ 35 , the residual struc¬ 
ture along the axis at the base of the figure is eliminated.) 

[corresponding to SU(3) QCD with three light quark 
flavors]. The results shown in figs. 1 and 2 were com¬ 
puted for k = 8, and required roughly 1—2 min of 
CPU time on a VAX 11/780 computer, for conver¬ 
gence on a single-quadrant 25 X 25 computational 
mesh. The figures all show elevation plots of physical 
quantities measured on a plane passing through the ro¬ 
tation axis. Fig. la shows the flux function $ plotted 
on a linear scale starting at $ - 0; the discontinuity of 
eq. (18) was enforced by taking the charges on lattice 
sites where <£ = QJ 2, and taking <£ = Q (<£ = 0) on the 
axis for \z\ < a (|z| > a), as is clearly viable on the 
plot. Fig. lb shows the field energy density £(x) 
plotted on a linear scale starting at £ = 0, with the 
Coulomb energy peaks (which rise by a further factor 
of 100 before being cut off by the finite mesh size) 
clearly visible. From fig, 1 we see that $ and £ are 
nonzero only within an oval-shaped curve, which is 
the continuum-limit free boundary, and which crosses 
the axis of rotation with no visible cusp in £. The 
complete independence of the interior and exterior re¬ 
gions can be seen from fig. 2, which shows e plotted 
on a logarithmic scale ranging from an imposed lower 
limit of e min = 10“ 15 at the base to a maximum (gov¬ 
erned by the mesh spacing) of e ~ 0.1 at the Coulomb 
peaks. A logarithmic plot of £ looks very similar, 


apart from having more pronounced dimples at the 
charge sites. Repeating the computation on meshes up 
to a factor of 4 finer shows good convergence as the 
mesh spacing goes to zero * 7 . We find that in the 
range 0.25 < k 1 / 2 R < 128, the large-distance behav¬ 
ior of F static (/?) is fitted by the formula 

^static = ( 4 / 3 ) 1/2 K-^ + 1.95 kV 2 log (k V 2 R) 

+ constant - 0.40/7?, (42) 

with uncertainties of order 1 in each final decimal 
place. Thus, the computational results indicate that 
the bound of eq. (38) is saturated and that the leading 
correction to the linear potential is logarithmic, in 
marked contrast to the AR + B + CR “ 1 form ex¬ 
pected [7] in the string model and in strong-coupling 
lattice gauge theories. This feature of the effective ac¬ 
tion approach should eventually have testable experi¬ 
mental consequences. In subsequent work we plan to 
map out F stat j c for all distances and to determine the 
ratio of the string tension to Aj^g, in both the leading 
logarithm model and in the extended model in which 
log log renormalization group corrections are included 
in e. 

This work was supported by the Department of 
Energy under Grant Number DE-AC02-76ER02220. 

* 7 More precisely, to eliminate the Coulomb self-energy diver¬ 
gences we study only differences K sta tic(^i) “ ^static^X 
with identical mesh structure around the charges. These 
quantities converge as the mesh spacing is decreased. 
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On page 406, first line below eq. (6), “confinement” 
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We give numerical and analytical results for the static potential of quasi-abelian quark and antiquark source charges, in 
the models in which radiative corrections to leading log order, and to leading log plus leading log log order, are retained in 
the gauge gluon effective dielectric functional. When the scale length of the latter model is fixed to give a second order fit 
to Martin's phenomenological heavy quark potential, the point of tangency with Martin’s curve lies at 0.51 fermi (in the 
center of the cc and bb quarkonium region), and the long- and short-distance limits yield, respectively, a string tension of 
320 MeV, and an asymptotic freedom scale mass of Aj^<7 = 720 MeV [all for color SU(3) with 3 light quark flavors]. Thus, 
effective action methods, using only renormalization group-improved perturbation theory as input, give a reasonable ac¬ 
count of the behavior of the heavy quark-antiquark static potential at all length scales. 


In a recent letter [ 1 ], we gave a qualitative analysis 
of non-linear effective action models for heavy quark 
statics, and showed that in the renormalization group 
approximation they predict both total flux confine¬ 
ment, and a linear static potential at large source se¬ 
parations. As a continuation of our study of the re¬ 
normalization group models, we present here results 
of a numerical computation of the static potential at 
all length scales, from which we extract theoretical 
predictions for several parameters characterizing the 
strong interactions. 

The models under consideration can be written as 
a problem in non-linear electrostatics, 

V-D=y 0 , VXE = 0, (1) 

with/ 0 a source density corresponding to static, quasi- 
abelian quark and antiquark source charges, 

/o = 2[ 63 (*“*i)~ 63 (*~*2)]> ^ 

e = (4/3)l/ 2 , ^-*21=/?, 

and with D related to E by the non-linear constitutive 
equation 

1 Also at the Racah Institute of Physics, The Hebrew University, 
Jerusalem, Israel. 


D = e(E)E . (3) 

In the leading log and leading log log models, e(E) is 
given by * 1 

e(E) = \ b 0 log (E/k), leading log , (4a) 

e(E) = 5 [log (E/k ) + £ log log (E/k)] , leading log log, 

(4b) 

with and | the renormalization group constants 
[for SU(3) quantum chromodynamics (QCD) with 
7V f light quark flavors] 

b Q = (1/Stt 2 ) (ll-|V f ), 

1 = 2(51-f7V f )/(ll-|7V f )2. (5) 

Throughout our calculations we will take iV f = 3, 
giving 

+1 The effective action corresponding to eq. (4a) contains a 
single leading logarithm term. The effective action corre¬ 
sponding to eq. (4b) contains the standard renormaliza¬ 
tion group log and log log terms, plus additional subdom¬ 
inant terms which vanish as \Io£(E/k)\ becomes infinite. 
Alternatively, if the log log model were defined by taking 
the effective action functional to have exactly a log plus 
log log form, the corresponding effective dielectric func¬ 
tional would differ from eq. (4b) by subdominant terms. 
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b 0 = 9/8tt 2 = 0.1140, \ = 0.790 . ( 6 ) 

This fixes all parameters in the model apart from the 
scale mass k */ 2 , which is set equal to unity to define 
dimensionless units for the numerical computations, 
and then is reinserted below and determined by com¬ 
paring with Martin’s fit to the heavy quarkonium 
spectra. According to the standard renormalization 
group analysis [2], eq. (4b) gives the leading two terms 
in an asymptotic expansion of e(E) for strong fields 7?. 
At strong fields, the corrections to the expression in 
square brackets in eq. (4b) are expected to be of order 
unity and, to the extent that they are slowly varying, 
can be absorbed into the scale mass k We will, 
however, be using the formulas of eqs. (4a) and (4b) 
outside the strong field region, in fact for all fields 
j^min < °°, where e(iT min ) = 0. A rationale for 
this extension has been given by Adler [3], who points 
out that renormalization group estimates are formally 
valid whenever the running coupling 

g 2 (E)~ [\b Q ]og(E/K) + ..-]- 1 , 


E/k -f{w) , w = ID/b^K , ( 8 a) 

with f(w) a transcendental function defined by 


w=/log/, 


log model , 


w = /(log/+ £ log log/), log log model , 


( 8 b) 


from which we can calculate the minimum electric 
field 


E miJ K = /(°) = 1 l0 S model - 

= 1.680 log log model. 


( 8 c) 


Once theZ) and# fields have been solved for by the 
numerical algorithm of ref. [ 1 ], the static potential 
can be calculated from the formula 


D 


*W*) = /d 3 * / E(D')dD' 


0 


=f d 3 x \ b 0 K 2 {5 wf(w) + i I f(w) 2 - /( 0 ) 2 ] 


is small in magnitude, which is true both when E/k > 1 
[giving g 2 (E) small and positive] and when E/k < 1 
[giving g 2 (E) small and negative]. This argument sug¬ 
gests that there is a second weak field asymptotically 
free regime, where eq. (4b) again gives the leading be¬ 
havior of e, and where e (or more precisely, the real 
part of e) is negative. Although renormalization group 
estimates cannot be used at intermediate field 
strengths, the statements that e < 0 for E/k ^ 1 and e 
> 1 at E/k > 1 imply that e must cross from 1 to 0 at some 
intermediate field strength 7i min , and this is the essen¬ 
tial feature of our model. The numerical results ob¬ 
tained below suggest that, in fact, the corrections to 
eq. (4b) are of the form 

Pog(tf/ic) + £ log log (E/k) + 0(1)], (7) 

with 0 ( 1 ) representing non-constant terms which are 
effectively of order unity in the entire range E m • < 
E<o °* 2 . 

As discussed in ref. [ 1 ], the numerical procedure 
for solving the model of eqs. (1)—(4) treats theZ) 
field as the fundamental dynamical variable, in terms 
of which E is obtained by inverting the constitutive 
equations of eq. (4). This gives 

* 2 For footnote see next column. 


+ H[y( 2log/(w)) — y(2 log/(0))]} , (9) 

withy(x) the exponential integral 

x t 

y(x)=fjdt, (io) 

— OO 

which is available in the IMSL function library. Some 
useful results concerning the large-7? and small-7? 
limits of ^static can be deduced by analytical methods. 
A simple flux conservation argument [1,6] shows 
that the static potential has a linear lower bound for 
large R , 

* 2 Possible theoretical support for the idea of a weak field 
asymptotically free regime is given by recent work of 
’t Hooft [4] and of De Calan and Rivasseau [5], showing 
that the sum of planar skeleton diagrams has a finite ra¬ 
dius of convergence as a function of coupling constant. 
Assuming that renormalization effects can be taken into 
account by using running couplings in the skeleton expan¬ 
sion, this result implies that in the large-A c limit, where 
planar diagrams dominate, the color dielectric constant 
will be an analytic function of the running coupling for 
small \g 2 {E)\. Renormalization group estimates will then 
apply for small negative as well as small positive g 2 (E), 
and if the N c = 3 theory behaves qualitatively like the N c 
= <» limit, one is led to the model discussed in the text. 
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F static(^) > E mi n Q R + const (11) 

and our numerical results (accurate to a few tenths of 
a percent, and extending out to k ^ 2 R or order 100) 
show that this bound is saturated. Hence we have 

string tension = K ^ 2 [Qf( 0)] ^ 2 . (12) 

A detailed analysis [7] of the short-distance perturba¬ 
tion theory of the leading log and log log models 
shows that 

^staticW = -(e 2 / 4 ^^o)[f + °(f 3 )] . *-*0. 

f=/(V/ w P> W P — 1/ApR 2 , (13) 

with A P given by a numerical integral, the evaluation 
of which [7] yields 

A P = 2.52 k 1 / 2 . (14) 

Since from eq. (8b) we find 

f = 1 /log Wp + 0 [log log Wp/(log Wp) 2 ] 

+ O [(1/log w p ) 3 ], (15) 

A p corresponds to the standard definition of the scale 
mass associated with the static potential * 3 , which is 
in turn related to the scale mass A r associated with 
the force law, 

—^staticC^) = (2 2 /4^ 2 ^ b ){]/log w F 

+ 0 []og]ogw F /(]ogw F ) 2 ] + 0 [ 1 /(log Wp) 3 ]} , 
w F s 1/A 2 ./? 2 , (16) 

by [8] 

Ax = A P /« • (17) 

Since A r is related to the standard strong interaction 
scale mass Aj^g by the formula [8,9] (with 7 E = 
0.5772... Euler’s constant) 

A x/Ams = ex P [7 e ~ 1 + (l/87T 2 Z> 0 )(f -if N { )] 

= 0.967 for N t =3, (18) 

* 3 As discussed in ref. [7], the log log model does not give 
the correct value of the coefficient of the log log term in 
the expansion of eq. (16) (where £ should appear, the log 
log model gives £ - 1), and so it is not good beyond one- 
loop order at short distances. Nonetheless, the model per¬ 
mits a meaningful determination of the scale mass Ap, 
because this involves only a one-loop calculation which is 
independent of the value of the parameter £. 


combining eqs. (14), (17) and (18) yields 

Ams = 0.959 k !/2 . (19) 

Taking the ratio of eq. (19) to eq.(12), the scale mass 
k ^ 2 drops out, and we get 

Apjg/string tension = 0.959/[2/(0)] l > 2 

- 0.892 log model , 

= 0.689 log log model . (20) 

Experimentally, if the string tension is identified with 
that computed from the slopes of Regge trajectories, 
we have 

string tension ^ 400 MeV . (21a) 

Recent determinations of Aj^g give values 

100 MeV ^ Aj^g <! 400 MeV , (21b) 

-* 0.25 < (A^g/string tension) expt < 1 , (21c) 

with the values at the lower end of the indicated 
ranges currently favored [10]. 

Since F^i^R) as given by eq. (9) contains infinite 
additive self-energy contributions, the quantity which 
can be measured computationally is E static (7? 1 ) — 
E sta tic(^2)» where identical mesh structures around 
the source charges must be used at the two separa¬ 
tions Rj ,R 2 . By using standard over-relaxation meth¬ 
ods [11] we have made a sequence of measurements 

of A 2 Static = Static (A) - V^ c (R/2), with k 1 I 2 R 
ranging from ^ 100 to 10 7 , for both the leading log 

and the leading log log models. For k F 2 R smaller 
than 1 the use of jacobian transformations is neces¬ 
sary to get good accuracy; this and other details of 
the computational methods will be described in a pe¬ 
dagogical review article which is in preparation [12]. 
As already noted, the large-distance results show that 
the bound of eq. (11) is saturated. For the smallest R 
values studied (10“ 7 < k ^l 2 R < 10~ 5 ), the measure¬ 
ments of A 2 K static agree with the leading term of the 
asymptotic formula of eqs. (13) and (14) to within a 
few tenths of a percent. The entire sequence of 
A 2 F s tatic y al ues has been fitted to the analytical 
formula 

Static = kV 2 [F(kV 2 R) + C] , (22) 

with F\ og (r) and A loglog (r) as given in table 1, and 
with C an undetermined overall constant. We estimate 
the combined accuracy of the measurements and the 
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Table 1 

Functions F\ 0 g{r) and F\ 0 g i 0 g(r). The coefficients are given in 
table 2. 


r range F{r) 


r< 0.0125 F(r) = ~{Q 2 l4nr\b 0 ) [/(wp)/w P ](l+fl ,/ 2 ) 

wp = l/(2.52/-) 2 , /(w) as in eq. (8b) 

0.0125 < < 0.125 F(r) = K + a(r/ 0.125)^ 

E = P + 7In(1 /r) + 6 [In (l/r )]2 + e[ln( l/r )] 3 

0.125 </-<2 F(r) = K'+ct'\ogr + f3'(\ogr) 2 + y'(logr) 3 

+ 6'(logr) 4 + e'(log/-) 5 

2 « r F(r ) - K" + ( 4/3) U2 f(0)r + a"/r ln 

+ p" Ir + y" log r 


Fitting for A 2 F static to be better than 2% for alii?, and 
better than 1% for very large and very small R values. 

To determine the scale mass k */ 2 appearing in eq. 
(22), we fit the potentials of table 1 to the phenomeno¬ 
logical formula given by Martin [13] 

^static = -8.064 GeV + 6.870 (GeV) 1 - 1 ^ 0 - 1 , (23) 

which is known to give a good description of the cc 
and bb quarkonium spectra. Writing R with 


Table 2 

Coefficients of functions F\ 0 g(r) and F\ 0 g \ 0 g(r) as given in 
table 1. 


coefficient 

log model 

log log model 


5.38 

5.14 

a 2 

0.545 

0.556 

K 

-2.323 

2.569 

a 

-15.118 

-15.717 

0 

-0.305 

-0.305 

7 

0.00368 

0.06964 

8 

-0.00885 

-0.0284 

e 

0.00043 

0.00212 

K ' 

-9.686 

-5.950 

a 

3.518 

3.840 

$ 

0.355 

0.807 

t 

7 

0.256 

0.395 

S' 

0.0439 

0.0683 

e' 

0.0127 

0.0110 

K" 

-10.520 

—7.781 

a" 

0.139 

0.465 

f 

-0.46 

-0.58 

7" 

1.966 

1.604 


/•dimensionless, and differentiating to eliminate the 
additive constant C, we find that the choice 

K l l 2 (r*) = [O.687/(r*) 0 - 9 F'(r*)] {1/1 ' 1) GeV , (24) 


makes the slope of eq. (22) identical to that of Martin’s 
potential at r = k = /**. A plot of eq. (24) shows 
that k 1 / 2 (r*) vanishes as r* -* 0 and as r* °°, and has 
a single maximum in between. At the maximum of 
k ^(r*), the potential of eq. (22) has a second order 
contact with Martin’s curve, giving the closest fit. 

From the formulas of table 1, we find that * 4 


K m 

max 


= 0.2291 GeV at r* = 0.789 log model, 

(25) 

= 0.2274 GeV at r* = 0.589 log log model. 


When reexpressed in physical distance units, the match 
points are 

R* = r*ln\ll x 

= (0.789/0.2291 GeV)X (/ = 0.1973 GeV fermi) 

= 0.68 fermi log model , 


R* = = 

= (0.589/0.2274 GeV) X (/ = 0.1973 GeV fermi) 

= 0.51 fermi log log model, (26) 

which lie in the middle of the heavy quarkonium re¬ 
gion of 0.1 fermi <7? < 1 fermi, thus giving a non¬ 
trivial check on both the fitting procedure and the 
models. Adjusting the additive constants C to bring 
the potentials of eq. (22) into coincidence with 
Martin’s curve at the match points, we get the formu¬ 
las 


V st atic(*) = 0.2291 GeV 

X [F log (0.2291 GeV R) + 9.250] , (27a) 

^staticW = 0.2274 GeV 

x t^ioglogC 0 - 2274 GeV 7?) + 5.582] , (27b) 

for the static potential in the log and log log models, 


44 An interesting feature of the fitting procedure is that al- 
though F stat | c changes significantly from the log to the 
log log model, the values of the scale mass k 1/2 deter¬ 
mined in the two cases agree to better than a percent. 
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Fig. 1. The potentials of eq. (17a) [log model; short dashes], 
eq. (27b) [loglog model; long dashes], and eq, (23) (Martin’s 
fit; solid line). The cc and bb quarkonium region lies between 
0.1 fermi and 1 fermi, and the validity of eq. (23) is restrict¬ 
ed to this interval. 

respectively. These are plotted, together with Martin’s 
curve of eq. (23), in fig. 1. The potentials of eq. (27), 
particularly that for the log log model, are clearly in 
good agreement with Martin’s curve in the heavy 
quarkonium region. 

Substituting eq. (25) back into eqs. (12) and (19) 
we get the string tension and in physical units, 

string tension = 250 MeV log model , 

= 320 MeV log log model, 

A^s = 220 MeV both models . (28) 

To summarize, effective action models, which use 
only renormalization group-improved perturbation 
theory as input (but which employ nonperturbative 
methods to solve the resulting differential equations), 
give a reasonable account of the heavy quark static 
potential at all length scales. This suggests that the 
following directions for further investigation using 
these methods will be of interest: (i) The study of the 
spin—spin and spin—orbit potentials in heavy quark sys¬ 
tems. A formalism for doing this has been set up by 
Hiller [14], and will give parameter-free predictions 
using the potentials of eq. (27) as input, (ii) The use 
of the relativistic effective action model proposed by 
Adler [15] to study binding in light-quark systems, 
and in particular to investigate chiral symmetry 
breaking effects, (iii) The study of whether a rear- 


4 November 1982 

rangement [15] of QCD around a zeroth order ap¬ 
proximation based on the use of the renormalization 
group-improved effective action can be used to give a 
systematic approximation scheme for treating bound 
state problems, which are currently inaccessible using 
the standard perturbative QCD methods. 

We wish to thank H. Pagels for calling our atten¬ 
tion to ref. [13]. This work was supported by the De¬ 
partment of Energy under Grant Number DE-AC02- 
76ER02220. 

Note added. Where the term “string tension” is 
used in the text, what is meant is the square root a 1 / 2 
of the string tension as conventionally defined by the 
formula 

Static W = aR + 0(1) • 

R^oo 
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S.L. Adler and T. Piran, The heavy quark static poten¬ 
tial in the leading log and the leading log log models, 
Phys. Lett. 117B (1982) 91. 

On page 92, first column, the argument for the 
existence of aniT m i n at which e(iT min ) = 0 implicitly 
makes the physically motivated assumption that e(E) 
is a continuous function of E. Continuity of e, togeth¬ 
er with the statements e < 0 for E/k 1 and e > 1 for 
E/k > 1, implies that e vanishes somewhere in the inter¬ 
mediate-field-strength region where E ~ k and where 
the running coupling is large, even though renormaliza¬ 
tion group arguments cannot be directly used there. In 
a recent letter by Elizalde [E. Elizalde, Is the next-to- 
leading log model confining?, Phys. Lett. 115B (1982) 
307], renormalization group arguments are used un¬ 
critically in the intermediate field strength region. This 
leads to a spurious infinite discontinuity in e(E), and 
hence to Elizalde’s erroneous conclusion that log log 
renormalization group corrections spoil the confining 
property of the leading log model. 

On page 94, in the first line of table 1, ui 2 ” should 
read 

In both parts of eq. (26), “/ = 0.1973 GeV fermi” 
should read “1 = 0.1973 GeV fermi”, and in the sec¬ 
ond part of eq. (26), should read 

On page 95, in the first line of the caption to fig. 1, 
“eq. (17a)” should read “eq. (27a)”. 

Ref. [7] should read: S.L. Adler, Short distance 
perturbation theory for the leading logarithm models, 
Nucl. Phys. B., to be published. 

Ref. [ 12] should read: S.L. Adler and T. Piran, 
Relaxation methods for gauge field equilibrium equa¬ 
tions., Rev. Mod. Phys., to be published. 
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Quasi-Abelian versus large-A c linear confinement 

Stephen L. Adler and Herbert Neuberger 
The Institute for Advanced Study, Princeton, New Jersey 08540 
(Received 6 December 1982) 

In ihe quasi-Abelian approximaiion as well as In ihe large-A c limil sources in ihe adjoini 
represeniaiion cannoi be screened. Neveriheless, ihe iwo phenomena are differeni because ihe 
siring lension is proporiional lo ihe square rool of ihe Casimir operaior in ihe firsi case and lo 
ihe Casimir operaior iiseif in ihe second. 


It is possible that the response of a non-Abelian 
gauge theory to the introduction of exiernal color 
sources Can be modeled by the response of a non¬ 
linear Abelian system to appropriately defined exter¬ 
nal charges. If we restrict ourselves to local and 
gauge-invariant systems the model is defined by an 
action of the following type: 

a = §d 4 x[f\F piV l (x)]F piV Hx)+j?'Ar\ . (1) 

By locality we mean that / depends only on F^lx) 
and not on higher derivatives. Systems of the type 
(1) have been investigated lately from an effective- 
action viewpoint. 1 In a limiting case they reduce to 
the models used in bag computations. 2 

The Abelian model encounters one obvious diffi¬ 
culty: The medium cannot screen even zero-triality 
[for SU(3) f ] sources. Therefore, the existence of 
linear forces binding quarks implies the same for 
color octets separated by arbitrary large distances. 

The Abelian vacuum cannot simulate color-octet pair 
creation. 

It was noted some time ago that a somewhat simi¬ 
lar phenomenon occurs in the N = limit of a non- 
Abelian SU(A) gauge theory 3 : At infinite N the pro¬ 
bability of creating a pair of particles in the adjoint 
representation with the color content necessary to 
screen a fixed external charge vanishes. 4 Again the 
existence of linear forces binding charges in the fun¬ 
damental representation implies the same for the ad¬ 
joint representation. 

The purpose of this Brief Report is to point out 
that the above similarity does not work on a more 
quantitative level. 

We start by using an argument due to Lieb regard¬ 
ing Eq. (I). 5 For the static problem we take 


y; xl = 8 M0 < ? [8 3 (x-/e)-8 3 (x + /e)] . (2) 


e is a unit vector. With fixed q and /, a variation of 
the action with respect to A^ix) will determine the 
field Fp V (x). Then the energy of the system, 

E(q,l ), can be computed. If we now scale q and / by 


q ->\ 2 tf 
/ — A/ , 



we obtain 

E(q,l) = -fE(\ 2 q,\l)-»E(q,l) = l 3 e(q/l 2 ) . (4) 

The linear confining and Coulomb cases, respective¬ 
ly, are given by 

E — ql , 

E~qVl ■ (5) 

Equation (5) tells us that the string tension goes as 
the square root of the coefficient of the Coulomb po¬ 
tential. For most applications to non-Abelian statics 
this implies that the string tension is proportional to 
the square root of the Casimir operator in the 
representation of the external sources. 

We turn now to the non-Abelian large-A case. The 
force between the external sources is extracted as fol¬ 
lows 6 : 


F[r]U)=- 


£± 

dl T 


lim In 

r—oo 


■ • 

-f- <XM(»Vr.«)> 

d[r] 



In (6) lr] denotes the representation, d[ r \ is its 
dimension, X[ r ] its character, W T ,i is the parallel 
transporter (untraced Wilson loop operator) around a 
planar rectangle of length T and width /, and ( - - ■ ) 
denotes the expectation value. At infinite A, SU(A) 
is undistinguishable from U(A) and therefore 


f -X[adj]( W) = 

d [adj] 


1 

d [fun] 


2 


X [fun] (WO 



Because of factorization at infinite N Eq. (7) holds 
also for the expectation values and, hence 

F[ad}]U ) ~ 2/ r [f un ](/) . (8) 

Equation (8) is correct independently of /. There¬ 
fore, if we have confinement, the string tension 
scales precisely as the coefficient of the Coulomb 
force, that is, as the Casimir operator and not its 
square root. 

As a check on our reasoning we consider the ap¬ 
proximation in which the interaction between sources 
is assumed to be given solely by the two-point 


27 1960 © 1983 The American Physical Society 
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(Ap(x)Ay(y)) Green’s function. A partial resum¬ 
mation of Feynman diagrams has been claimed io 
give a 1//7 4 singularity in the infrared, leading io a 
linear potential (albeit without flux confinement) at 
large distances. 7 Since the diagrams which were 
summed were all planar one should obtain a result 
consistent with the large-A argument. The use of 
only the two-point function allows Abelian modeling. 
Now, however, the requirement of locality cannot be 
met, and, in fact, a singularity in the propagator 
corresponds to an additional 9^ 2 in the F^ 1 term of 
(1). This precisely alters the scaling argument in the 
correct way. 


To conclude, local nonlinear effective-action 
models for confinemeni (generalized flux-tube 
models) and the large-A limit predict different charge 
dependences for the linear confining poiential. This 
result, together with ihe faci that the gauge group 
center is not felt at infinite A, suggests that different 
confinement mechanisms may be operative at infinite 
A and at finite A. 9 If this were the case, then no de¬ 
cisive lesson regarding A = 3 confinement could be 
obtained from studies of the large-A limit. 

This work was supported by the U.S. Department 
of Energy under Grant No. DE-ACO2-76ERO2220. 
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We analyze chiral symmetry breaking in QCD in Coulomb gauge. Using the Ward identities, 
we derive the renormalized gap equation from the renormalized Dyson equation for the vector 
and axial-vector vertices. We work within the ladder approximation, in which the Bethe-Salpeter 
kernel is a sum of longitudinal and transverse terms, depending Only On momentum transfer. This 
relates the chiral symmetry breaking parameters to the static quark potential. When transverse 
gluon exchange is neglected, our gap equation agrees in the infrared with that obtained by Amer 
et al. from a non-normal-ordered Coulomb gauge hamiltonian, while disagreeing with the gap 
equation obtained by Finger and Mandula using a normal-ordering prescription. The corrected 
gap equation leads to infrared-finite formulas for the effective quark and pion parameters, in which 
integrals for physical quantities converge for an infrared-singular confining potential V c oc q~ A \ we 
present the results of a numerical solution in this case. 


1. Introduction 

Quantum chromodynamics (QCD) is widely accepted as the candidate theory of 
strong interactions. Asymptotic freedom allows a perturbative treatment of QCD at 
high energies [1], permitting the theory to be tested in such processes as e + e~ 
annihilation and electroproduction. Unfortunately, the bound state spectrum of 
QCD is still an unsolved problem. Progress in this direction has been made via 
Monte Carlo techniques in lattice gauge theories. However, the inclusion of fermions 
on the lattice is still problematic, particularly for massless fermions [2]. 

A pertinent question to ask of QCD with massless fermions is whether chiral 
symmetry is spontaneously broken, and by what mechanism. Formal arguments 
using the ’t Hooft anomaly conditions [3] indicate that chiral symmetry in QCD 
must be broken in the Nambu-Goldstone mode. However, these arguments are 
essentially kinematical in nature, and the detailed dynamical mechanism of chiral 
symmetry breaking in QCD remains elusive. 
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A popular approach to the dynamics of chiral symmetry breaking is to write down 
a gap equation for the generated mass 1 9 which is then discussed analytically in 
the linearized approximation [4]. Usually this is done in the Landau gauge, where 
there is no wave function renormalization, yielding a gap equation which is finite 
without renormalization. However, in Landau gauge it is difficult [4] to implement 
renormalization group corrections to the gluon exchange potential. An alternative 
approach, pioneered by Finger and Mandula [5], is to construct the gap equation 
in Coulomb gauge. One advantage of using Coulomb gauge, which motivated the 
work of ref. [5], is that the Coulomb propagator corrections give the complete QCD 
^-function, This permits a simple implementation of renormalization-group- 
improved perturbation theory. A further advantage of Coulomb gauge, which we 
will exploit in sect, 3 below, is that the gap equation can be derived by making the 
same approximation to the quark-antiquark Bethe-Salpeter kernel as is used in the 
phenomenological analysis [6] of charmonium energy levels. This permits the use 
of phenomenological static potentials in the study of chiral symmetry breaking. 
However, a nontrivial problem with the Coulomb gauge is that, in this gauge, the 
wave function renormalization is infinite, and the gap equation requires renormaliz¬ 
ation. 

At first sight it may seem rather strange to use the non-Lorentz-covariant Coulomb 
gauge, which singles out a preferred rest frame, to study a problem involving massless 
quarks. However, as seen below, as a result of dynamical chiral symmetry breaking, 
the effective quasi-particle excitations which arise from solving the gap equation 
have a nonzero mass. Thus, a preferred rest frame is defined: the frame in which 
the quasi-particles are at rest. The existence of this preferred frame gives an a postiori 
justification for the use of Coulomb gauge in setting up the gap equation. 

In constructing the Coulomb gauge gap equation for a pairing-type model, Finger 
and Mandula [5] proceed from a normal-ordered Coulomb gauge hamiltonian. An 
analogous Coulomb-gauge gap equation, based on a non-normal-ordered hamil¬ 
tonian, has been studied by Amer et al. [7]. These pairing models are reviewed in 
sect. 2, using the equivalent formalisms of the Bogoliubov-Valatin transformation 
and the Dyson equation to derive the gap equation. For a pure Coulomb potential, 
the models of refs. [5] and [7] can be shown to correspond to the use of a renormalized 
and of an unrenormalized gap equation, respectively. However, for a general 
phenomenological potential, the two models have qualitatively different infrared 
behavior. For example, the gap equation of Amer et al. exists for a confining 
potential, whereas that of Finger and Mandula does not. Now renormalizations to 
remove ultraviolet divergences should not change the infrared behavior of a theory. 
Thus for a general phenomenological potential, the models of ref, [5] and ref, [7] 
do not simply correspond to the renormalized and unrenormalized versions of the 
same pairing model. There is clearly a paradox here: either the equations of Amer 
et al, do not, in general, give the correct unrenormalized theory, or the equations 
of Finger and Mandula do not in general give the correct renormalized theory. 
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Our primary aim in this paper is to find the correct form of the renormalized gap 
equation in Coulomb gauge. This is done in sect, 3, starting from the renormalized 
Dyson equation for the vector and axial-vector vertex parts. We proceed by making 
the ladder approximation to the quark-antiquark Bethe-Salpeter kernel, which 
introduces phenomenological potentials. Application of the Ward identities unam¬ 
biguously yields the correct form of the renormalized gap equation. From this 
analysis we reach the following conclusions: 

(i) The equations of Amer et al. give in general the correct unrenormalized 
equations. This corresponds to the fact that, since the color matrices are traceless, 
the interaction term in QCD does not require normal ordering. For potentials which 
do not lead to ultraviolet divergences, such as a pure confining potential V c ^q~ 4 , 
V T —0, the gap equation of ref. [7] is the correct one as it stands, 

(ii) The renormalized gap equation differs from the unrenormalized one by a 
polynomial counterterm, as expected from the standard BPHZ renormalization 
algorithm. The Finger-Mandula normal-ordering prescription does not correspond 
to a polynomial counterterm. Thus, it is incorrect, except for the case of a pure 
Coulomb potential. 

A second principal objective of this paper is to study the infrared properties of 
the corrected gap equation. In sect. 4 we summarize an analysis of pion properties 
in the pairing model given recently by Govaerts, Mandula and Weyers [8], We show 
that, when used in conjunction with the unrenormalized or correctly renormalized 
gap equation, all physical quantities are infrared-finite. That is, even for an infrared 
singular confining potential V c oc^“ 4 9 all physical quantities are given by infrared- 
convergent integrals as a result of detailed cancellations between singular terms in 
the integrands. Furthermore, the infrared cancellations are shown to arise in a 
general way from the operator structure of the infrared singularity in the interaction 
hamiltonian. This feature is illustrated by a numerical example in sect. 5, where the 
gap equation and the pion vertex equation are solved numerically for the case of 
a pure q~ 4 potential. 

To recapitulate, the plan of this paper is as follows. In sect. 2 we review the 
pairing models of refs. [5] and [7] and examine alternative renormalization prescrip¬ 
tions. In sect. 3 we determine the correct form of the renormalized gap equation by 
studying the Dyson equation for the renormalized vertex part. We demonstrate in 
sect. 4 the infrared finiteness of physical quantities when the correct gap equation 
is used. In sect. 5 we numerically solve the gap equation for the case of a confining 
potential. Finally, we summarize our findings in sect. 6. 

2. Coulomb gauge pairing model 

There are two equivalent approaches to setting up the Coulomb gauge pairing 
model, both of which are used in the paper of Finger and Mandula. The first is to 
make a Bogoliubov-Valatin transformation to a vacuum containing a qq condensate. 
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This is then optimized variationally, to give an equation (the gap equation) for the 
condensate wave function ^(p). The second is to use the Dyson equation for the 
quark propagator, in the Hartree approximation, to set up an equation for the quark 
proper self-energy .£(p). This is the gap equation rewritten in a different notation. 
In this section we outline both methods, following the notation of Finger and 
Mandula, but without normal-ordering the interaction term in the hamiltonian. 
The Coulomb gauge effective hamiltonian for a single* quark flavor q is 

H eS =qy (-iV)q-2irZ qy 0 ^ a q^qy 0 ^ a q- (2.1) 

a V 

The summation is over the color index a, and a = g /4ir can be taken to be either 
a constant or (as we shall do below) a running coupling a = a(—V 2 ). We wish to 
minimize over trial states containing a coherent superposition of qq pairs. The 
trial wave function is taken to be 

n [l-^(p)rft (a)t (p, S )cf (a)t (-p, S )]|0), p = \p\, (2.2) 

rs yW) p, s, a 

with b (a)t (p, 5 ) and d ia)t (p 9 s) the creation operators for a quark and antiquark 
with momentum p, color index a and helicity s. In eq. (2.2) r is the volume of an 
elementary cell in momentum space, and ty(p)= &*(p) is the momentum-space 
pair wave function. The normalization factor N(W) is determined such that 

(^1^=1, (2.3a) 

to give 

N('P) = n >/1 +'P(p) 2 - (2.3b) 

p,s,a 

The calculation of matrix elements in the vacuum state | W) is facilitated by making 
a Bogoliubov-Valatin transformation to an operator basis which annihilates \ ty), 
as described in appendix A. For the equal time quark Feynman propagator we obtain 

('P\lq a (x,0),q l3 (y,0)]\V)= f 7 ^ 5 e*[^S ( 4 ) (p,p 0 ) 

J (2.7T) J 2 77 

d 'p C »(.-J ^(p) 1 1~^ 2 (p) 

. (2tt) 3 Li + v 2 (p) 2 \ + v 2 (p) y ' p \ al3 ’ 

(2.4) 

where p is the unit vector p/|p|. This enables us to obtain the following expression 
The generalization of the analysis to more than one quark flavor is easy. 



R52 


639 


S.L. Adler, A.C. Davis / Chiral symmetry breaking 


473 


for the expectation of H eff in the state | W), 


5 3 (0) 


d 3 P 


12 pV 2 (p) 
1 + 'P 2 (p) 


77 


,3 j3 ^«(|P-«| 2 ) 


d 3 p tfq 


p-q 


np)V(q) 


+ 


V\p) 


i V ^ (q) 


LV +'p 2 (p)][l +'p 2 (q)] 

1 


1 

4 


1 +V 2 (p) 2 /\l+^ 2 (q) 2 , 


A. 

\P ' 9 


(2.5) 


The optimal condensate vacuum is obtained by minimizing ( X P\H\ X I') with respect 
to ¥(p). 


S'P(p) 


(V\H cS \'I') = 0, 


( 2 . 6 a) 


which gives the “gap equation” 


P*(p) = 


1 


3 77 2 J 


d 3 ^ 


a{\p - q\ 2 ) nq)[ 1 “ V\p)Y~ *(p)[l - V 2 (q)]p ■ q 


|p — q 


1 + 1 f f 2 (q) 


( 2 . 6 b) 


An alternative derivation of the gap equation is obtained from the Dyson equation 
for the quark propagator. Making a non-relativistic ansatz for the proper self-energy 
part 2, 


I = I(p)=pA(p)+y pB(p) , 


the propagator can be written as 


S l4 \p, Po) = 


1 


yoPo -yp-Z 


7oPo-y • p[l + B(p)]+pA(p) 

Po~<»(p) 2 


0){p) = ps]A 2 (p)+\\ +B(p)f . 


After integration over p 0 this gives 


s m (p) — 


dpo .„(4w s pMp)~pU+ B (p)]y p 

— iS w (p, po) =- 


77 


2co(p) 


and comparing eqs. (2.8) and (2.9) with eq. (2.4) we see that 


A(p) 


Ja 2 (p)+v+b(p)Y 


2 np) 

1 + 'p 2 (p) 

1 -v 2 (p) 

Ja 2 (p)+v+b(p)Y i + v 2 (p) 


1 +B(p) 


= sin 2 -&(p), 


= cos 2 i J(p), 


(2.7) 


( 2 . 8 ) 


(2-9) 


( 2 - 10 ) 
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with &(p) the rotation angle of the Bogoliubov-Valatin transformation defined in 
eq. (A.4). In the Hartree approximation the Dyson equation for E is 


2(p) = 


3tt J 


d 3 q 


a(\p~q\ 2 ) 

p-q I 2 


y 0 S O) (q)y 0 , 


( 2 . 11 ) 


which, on substitution of eq. (2.9), yields separate equations for A(p) and B(p), 


P A(p) = 


377 J 


d 3 q 


a(\p-q\ 2 ) V(q) 


P~q I 2 1 + V 2 {q)’ 


pB{p)=—i 
3 77 J 


d 3 q 


a(\p~q\ 2 ) 1 1 ~ V 2 (q) 

\p-q\ 2 2 1 + V 2 (q) 


p * q 


(2.12a) 


(2.12b) 


When eqs. (2.12a) and (2.12b) are substituted into the identity 

p'P(p)=K i - v 2 (p)]pMp)- x P(p)pB(p ), 


(2.13) 


we obtain again the gap equation of eq. (2.6b). 

Eqs. (2.1)—(2.13) are equivalent to the analysis of Amer et al. [7] who, as discussed 
in sect. 1, proceed from the non-normal-ordered Coulomb gauge hamiltonian. The 
calculation of Finger and Mandula [5] instead starts from a hamiltonian analogous 
to that of eq. (2.1), but normal-ordered with respect to the perturbative vacuum |0), 
which yields the following results. For the gap equation, Finger and Mandula obtain 


pV(p) = 


1 


7T J 


,M\P~q\ 2 ) nq)[l-V 2 (p)]+2p> qnp)V 2 (q) 


d q 


p-q 


1 + V 2 (q) 

(2.6b') 

which, in the Green function approach, corresponds to the “Dyson-like” equation 


2(p) = 


3 77 J 


d 3 q 


«Qp-g| 2 ) 

P~q 2 


y 0 [S° ) (q)-S { o ) (q)]y 0 , 


( 2 . 11 ') 


with S 0 the free zero-mass propagator 

S ( o 3 \p)-' = -h-P- (2.14) 

Since pairing is a low-momentum effect, the condensate wave function is expected 
to vanish rapidly at high momenta, 


V(p) 


p~* CO 


-> 0 . 


(2.15) 


Consequently eq. (2.11) and eq. (2.IT) exhibit very different high-momentum 
behavior. For a Coulomb potential 

a (\p~q\ 2 ) =const, (2.16a) 

or for an asymptotically free Coulomb-like potential 

«(l P ~q\ 2 ) = const/log [|p - q\ 2 / A 2 ], (2.16b) 
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the integral in eq. ( 2 . 6 b') converges at high momenta as a result of the overall factor 
*P(q) in the integrand, while the integral in eq. ( 2 . 6 b) has a high-momentum 
divergence given by 

divergent part of eq. (2.6b) = (Z — 1 )p'P(p ), 


where 


Z- 






(2.17) 


Hence, the Finger-Mandula gap equation is ultraviolet-finite, while that of Arner 
et al. has an ultraviolet divergence given by eq. (2.17). However, since the divergence 
in the Amer et al. equation is proportional to the kinetic term on the left-hand side 
of the gap equation, it can be eliminated by the standard renormalization procedure 
of adding a counterterm 


AH = (Z — l)qy * (-iV)q (2.18) 

to the hamiltonian of eq. (2.1). This gives an alternative subtraction scheme to the 
one used by Finger and Mandula. Taking the difference between eq. (2.6b') and eq. 
( 2 . 6 b) gives 


eq. (2.6b')-eq. (2.6b) = [Z(p) - l]p^(p ), 


Z(j>)-i- 


1 1 


p 3tt J 


l3 ot(\p-q\ ) A A 
d q - -p - q 

p-q 


(2.19) 


Thus, the two alternative schemes are equivalent only in the case of a pure Coulomb 
potential (eq. (2.16a)), for which Z(p) in eq. (2.19) reduces to a constant. That the 
Finger-Mandula subtraction scheme corresponds to use of a momentum-dependent 
renormalization constant leads one to suspect that it is incorrect. This conclusion 
is confirmed in the next section, where eqs. (2.1)—(2.13) and (2.18) are derived, via 
the Ward identities, from the Dyson equations for the renormalized vertex parts. 


3. The renormalized Coulomb gauge gap equation* 

In the theory of superconductivity, the gap equation is part of a more general 
system of equations for the electron propagator and the electron-photon vertex part 
[9], in which the Ward identity is satisfied. Hence, in choosing between alternative 
subtraction schemes for the pairing model of sect. 2 , it is natural to proceed in an 
analogous fashion, using the Ward identities to derive the renormalized gap equation 
from suitable approximations to the vertex parts. Specifically, we start from the 
renormalized Dyson equation for the vector and axial-vector vertices, making the 

* In this section, we use q and p to denote four-vectors, whereas in all other sections of this paper, 
the momenta q and p denote the three-vector magnitudes q = |fl|, p = \p\. 
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ladder approximation that the Bethe-Salpeter kernel depends only on the momentum 
transfer. Consistent with this approximation, we exclude quark annihilation graphs, 
so that there are no anomalies, and neglect terms arising from the non-commutativity 
of color matrices on the quark lines, so that we can use the Ward identities of QED 
(rather than the more complicated Slavnov-Taylor identities of QCD) and can omit 
all color indices. 

Our analysis therefore proceeds from the following equations for the renormalized 
vector and axial-vector vertex parts [10], 


r ii (p',p)8y = (Zy ti ) S y + 


H/8y 


d 4 q 

(2tt)' 


[iS F (p' + q)fjp' + q, p + q)iS' F (p + q)] pa 


x K ae yS (p + q,p' + q, q), 

d 4 q 


r „s(p',p)sy = [(Zyjysisy + 


8y 


(2tt)' 


[iS’ F (p' + q)r ii5 (p' + q,p + q)iS F (p + q)] e 


a 


XK a0,y8(p +<l,p’ + q, q) ■ (3.1) 

* 

In eq. (3.1) Sp is the full renormalized quark propagator, K a ^ y8 is the renormalized 
quark-antiquark Bethe-Salpeter kernel, and ( Zy is a shorthand for 


(ZyJ 


Zo 7 o, fx —0 
Zjj> M =7 = 1,2,3. 


(3.2) 


We are thus anticipating the fact that, if we make a non-covariant approximation 
to the ultraviolet tail of K , the fx = 0 and /x = 1,2,3 components of the vertex can 
have different renormalizations. Since anomalies have been excluded, the vector 
and axial-vector vertex parts have the same renormalization constants [11], and 
satisfy the Ward identities 

( P'-prw>p)=s'i 1 ( P ')-s' F i (p), 

(p 1 -prCsip'^p) = JsS'f'ip) + SiTWy, • (3.3) 


Let us now make our fundamental approximation. This is to assume that the 
Bethe-Salpeter kernel K a ^ y8 (p +<?, p' + q 9 q) is a function only of the momentum 
transfer q , with Lorentz vector couplings* on the quark and antiquark lines, 


^ a@,y8 (p + q,p' + q, q) ^a^,y8^,q ) 

- -4iri(y 0 ) Sfi (y 0 ) ay %V c (\q\) 


-47ri(y / )^(y fc ) a7 ^ J . fc -^j|V T (^. (3*4) 


* The introduction of scalar couplings would lead to explicit (as opposed to spontaneous dynamical) 
breakdown of chiral symmetry. 
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This approximation is the usual one made in potential theory treatments of heavy 
quark bound states. In lowest-order perturbation theory, V c and V T are given 
respectively by single Coulomb and single transverse gluon exchange, 


q 


a 


= 2 

<?o q 


(3-5) 


In higher loop order, renormalization group improvements make a in V c a running 
function of q at high momentum, while multiple Coulomb gluon exchange [12] 
produces an infrared-singular confining contribution to V c at low momentum. For 
example, corresponding to the frequently used [6] coordinate space potential 


V(r) 


err 


4 a 
3 r ’ 


(3-6) 


the momentum-space potential (defined as the Fourier transform of V(r) with a 
factor —divided out) would be 



2k 

(*¥’ 



(3.7) 


with a in eqs. (3.6)-(3.7) either a constant or a running logarithmic function of the 
coordinate or momentum, and with the term proportional to k the confining 
potential. 

Before using specific assumptions about the form of V c and V T , we first carry 
the analysis as far as possible using only the approximation to the form of K given 
in eq. (3.4)*. Substituting eq. (3.4) into eq. (3.1), eq. (3.1) into the Ward identities 
of eq. (3.3), and using the Ward identities a second time to rearrange the integrands, 
we obtain from the vector vertex equation 


(p’-prfjp’,p) = s' F - i (p')-s' F -‘(p) 


?/-i. 


= (Zy J(p'- P y + 


d 4 q 

“ r:o// ' 


(2tt)‘ 


[iS' F (p' + q) 


x(S' F 1 (p' + q)-S f \p +q))iS' F (p + <l)]i} a Kp,...(q ) 

d 4 9 


= (Zy>'" + 


— (Zy„ )p M — 


(2tt) 4 

d 4 ? 


S'f (p + y) pc,kap,...(q) 


(2 7 t)‘ 


S'f(p q)pc,kap,...(q) ■ 


(3-8) 


* The derivation of eqs. (3.8)-(3.12) is in fact independent of the choice of gauge, since it uses only 
the assumptions that k depends solely on the momentum transfer q and has vector couplings on the 
fermion lines. 



644 


___ Adventures in Theoretical Physics _ 

478 S.L. Adler , AC. Davis / Chiral symmetry breaking 

Similarly, from the axial-vector vertex equation we get 
( p'-p)%s(p',p) = y 5 S' F - 1 (p) + S' F - 1 (p’)y 5 

= (Zy fi y s )(p’-pY + -^-~^[iS' F (p' + q) 

X(ysS f F (p + q) + S F } (p r + q)ys)i$F(p + #)]/3a£a/3,...(#) 

d 4 q 

= 7s (Zy tt )p ,i + 7 z—^S , F (p+q)f}ak a f } ,A<l) 

L J ( 2 tt) J 

f d 4 a 

+ (Zy^)p^ + —^ 5 f ( p / + q)p a K&Sq) 7s , 

(3.9) 

where the Lorentz vector structure of k has been used in anticommuting the y 5 ’s 
to the outside on the right-hand side. Because the p and p dependence in eqs. (3.8) 
and (3.9) has separated, we deduce that the renormalized propagator must satisfy 

f d 4 a 

S'f 1 (p) = (Zy^p* + ' 4 S F (p + q)p a k a &„(q) • ( 3 - 1 °) 

J \Z 7T) 

Note that the presence of an additive constant (a kinematical mass term) in eq. 
(3.10), which would be allowed by the vector Ward identity of eq. (3.8), is excluded 
by the axial-vector Ward identity of eq. (3.9)1 Introducing the self-energy 2 by writing 

S f F~\p) = y^~Z( P ), (3.11) 

we see that 2 must satisfy the integral equation 

' d 4 

2(p)s y = [y» - (Zy^)] Sy p^ - —^ S F (p + q)p a k a ^ y8 (q ). (3.12) 

J (27r) 

This is the renormalized gap equation corresponding to the ladder approximation 
to the Bethe-Salpeter kernel. 

To make contact with the non-relativistic pairing model of sect. 2, let us now 

make the further approximation of neglecting the transverse gluon exchange term 

<%> 

in k. Thus, k becomes 

% 

k a ^ y b{q) = —• 4 , n'*( To) 5 / 3 ( 70)073 ^c(lfll) • (3.13a) 

Further, let us assume that Z(p) is given by the non-relativistic ansatz 

Z = S(p) = \p\A(\p\) + 7 • pB(\p\) . (3.13b) 

When eq. (3.13) is substituted into the integral equation for the fx =0 component 
of the vector vertex at zero momentum transfer, this equation simplifies dramatically. 
It is solved by 


ro(p,p) = Z 0 y 0 . 


(3.14) 
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To verify this, we note that when both eqs. (3.13) and (3.14) are substituted into 
eq. (3.1), the ^-integral 



[*Sf (p + q)r 0 (p + q, p + q)iSr(p + q)]p a K a p y8 (p + q, p + q, q) 


(3.15) 


reduces to 


i 



d 3 qV c (\q\)I 9 


1 = 



To^FTo^Fyo, 


S'F = b/o(<Io+Po)-y‘ (q+p)(l+B)-\q+p\A] \ (3.16) 

Evaluation of the ^-integral shows that / = 0. Since asymptotic freedom requires 
r 0 (p,p) to approach y 0 at large momentum, we conclude that, in the Coulomb 
gluon exchange model. 


Z 0 =\ . (3.17) 

Substituting eqs. (3.13) and (3.17) into eq. (3.12), the gap equation then simplifies 
to 


^(p) = (2-l)r7+n I d 3 9 Vc(|p-g|)y 0 S (3) (g)y 0) (3.18) 

3 77 J 

which, when reexpressed in terms of by following the analysis of eqs. (2.10)—(2.13), 
takes the form 


\ P \n\p\)=(i-z)\p\n\p\) 

i 


+■ 


77 


d Q V c (\p-q\) 


nwu - - *\mp • q 


l + v 2 (\q\) 


(3.19) 


When the obvious identification 


V'c(|p-,l)= 5 f^ (3.20) 

\p-q\ 

is made, eq. (3.18) (eq. (3.19)) is identical in structure to the gap equation of eq. 
(2.11) (eq. (2.6b)), apart from the addition of a renormalization counterterm with 
precisely the form derived from AH of eq. (2.18). Thus, our rederivation of the gap 
equation from an analysis of the vertex parts shows that the standard renormalization 
algorithm, rather than the Finger-Mandula normal-ordering prescription, gives the 
correct method for eliminating ultraviolet divergences. 
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4. Infrared finiteness of physical quantities 

Having established the correct renormalization prescription, let us now consider 
the infrared behavior of the gap equation. As mentioned briefly in sect. 1, the 
Finger-Mandula gap equation (eq. (2.6b')) has very different infrared behavior from 
that of the Amer et al. unrenormalized gap equation* (eq. (2.6b)) and of the correctly 
renormalized gap equation (eq. (3.19)). Specifically, as p^q the integrand of eq. 
(2.6b') simplifies to 

Vc(\p-q\)V(p), (4-la) 

while the integrand of eqs. (2.6b) and (3.19) reduces to 

V c (|p - q\)[p * (q ~ P ) W(p) +0 ((q~p) 2 )] . (4.2a) 

Hence, when V c is an infrared-singular confining potential diverging as \p~q \~ 4 as 
p^q, the ^-integral in eq. (2.6b') behaves as 

d 3 q\p-q\' 4 ^(p) (4-lb) 

and diverges, while the corresponding integral in eqs. (2.6b) and (3.19) behaves, 
after angular averaging, as 

d 3 q\p ~ q\~ 4 0((q -p) 2 ) (4.2b) 

and converges. Thus , the corrected gap equation and the pair wave function *T exist 
for a confining potential . We will refer to and to any other quantity in the pairing 
model which exists for a confining potential, as being “infrared-finite.” 

A second feature of the correctly renormalized gap equation, related to its 
infrared-finiteness, is that it is invariant in form when the coordinate-space potential 
is shifted by a uniform constant, 

V(r)-> V(r)—^irC. (4.3a) 

In terms of the momentum space potential of eq. (3.7), the transformation of eq. 
(4.3a) takes the form 

Vc(\p-q\)+V c (\p-q\)+(2ir) 3 C8 3 (p-q). (4.3b) 

This is obviously an invariance of eqs. (2.6b), (3.19) and (4.2a), since the coefficient 
of V c (\p — q\) vanishes at p = q in these equations. For reasons explained below, 
we expect physical observables very generally to be invariant under the transforma¬ 
tion of eq. (4.3). This, in turn, means that in the formulas for all physical observables, 
V c must appear multiplied by a coefficient which vanishes at p = q. So by the angular 

* Strictly speaking, Amer et al. [7] give only the linearized form of eq. (2.6b). The full gap equation, 
and the observation that is infrared-finite, appear in a subsequent paper by Le Yaouanc et al. [13]. 


R52 


647 


S.L. Adler , A.C. Davis / Chiral symmetry breaking 481 

averaging argument used in eq. (4.2), we conclude that all physical observables are 
infrared finite. In the remainder of this section, we will illustrate this general statement 
by an explicit, case-by-case examination of various physical observables which can 
be formed in the pairing model. 

Let us begin with the quasi-particle energy co(p). According to eqs. (2.8), (2.10), 
(2.12a) and (3.20), this can be written as 


a>(p)=- 


pA(p) 


1 


sin 2 &(p) 37r J 


d q V c (| P~q\)~ 


sin 2 &(q) 


sin 2 &(p) 


1 


3 77 J 


d 3 ? V c {q) 


sin 2‘d(\p — q\) 
sin 2 &(p) 


(4.4) 


Under the shift in potential of eq. (4.3b), co(p ) changes to 

a>(p)^ <o(p) +*ttC . 


(4.5) 


Hence the quasi-particle energy is not a physical observable and is clearly not 
infrared-finite*. However, since the shift in co in eq. (4.5) is a constant, the excitation 
energy co(p) — co( 0) is a physical observable, and is given by either of the two 
equivalent infrared-finite formulas, 


(o(p)-(o{ 0) = 


77 J 


d 3 q V c (q) 


sin2#(|p-g|) 
sin 2 &(p) 


-sin 2 &(q) 


(4.6a) 


(o(p)-(o(O) = <o(p)-<o( 0), 


co 


(p) = 


1 


77 


^q V c (\p - q\) — 

Lsin 2 v{p) 


sin 2 &(q) 


(4.6b) 


In writing eq. (4.6a), we have used the fact that the gap equation of eq. (3.19) 
implies that 


^(0) = l=>sin 2tf(0) = 1 . 


(4.7) 


The formula in eq. (4.6b) has advantages in numerical work, since sin 2# appears 
only in the radial integral over q , not in the angular integral over q. 

Next we consider the quark condensate (««) for a single quark flavor u. Following 
ref. [5], this can be evaluated in terms of the propagator S 0) (q) 9 


(uu)=('t r \qq\'t r ) = 38 al3 ('I'\^[q l3 (0), ? a (0)]+|{^(0), ^(O)}! 1 ^) 


= -3 



Tr S°\p) = -3 


d 3 p pA(p) 

(2tt) 3 co(p) ' 


(4.8) 


where the color factor, 3, arises from the trace over the suppressed color index. 
Although co(p) and pA(p) are individually not physical observables and are not 

* We thus again differ here with Finger and Mandula, who interpret o>(0) as the quasi-particle mass, 
and circumvent the infrared finiteness problem by using only potentials which are cut off in the infrared. 
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infrared-finite, eqs. (2.8) and (2.10) show that the ratio pA(p)/co(p) can be expressed 
entirely in terms of the condensate wave function and so is infrared-finite, 


pMp) 

w(p) 



^£L=si„2 W , 

1 +'P 2 (p) 


IT 2 J 


CO 


p 2 dp sin 2#(p) 


o 


(4.9) 


Finally, we turn to pion properties in the pairing model. These have been investi¬ 
gated by Govaerts, Mandula and Weyers [8], who use the Bethe-Salpeter equation, 
with the approximated kernel of eq. (3.13a), to compute static pion properties in 
the chiral symmetry limit. Their algebraic calculation is unaffected by changing the 
subtraction scheme for the gap equation from eq. (2.6b') to eq. (3.19), so we only 
quote their final results. These are summarized by the formulas 


2 m q 


(uu), 


-Nf„ = N 2 = 3 



sin 2 'd(p) 


Hp) 

co 2 (p) 


(4.10a) 

(4.10b) 


In eq. (4.10) fx„ is the pion mass, m q the constituent quark mass, the pion decay 
constant, and N the normalization of the pion Bethe-Salpeter wave function. The 
pion vertex part form factor, P(p), satisfies the integral equation 


P(p)=pA(p) 



d 3 q— [ pA(p)qA(q)+p • qC(p) C(q)]P(q), 
a) (q) 


C(p) = 1 +B(p) 


Introducing the abbreviated notation 


, x Hp) 

S(P) 2 


CO 


(p)’ 


Eq. (4.10b) is equivalent to 


-N=/ w = 


L2tt 2 J 


CO 


p 2 dp sin 2d(p)g(p) 


0 


1/2 


(4.11) 


(4.12) 


(4.13) 


and will be infrared-finite if g(p) is infrared-finite. Substituting eq. (4.12) into eq. 
(4.11), dividing by <*>(p) and using eq. (2.10), the equation determining g(p) can 
be written as 


g(p)“>(p) = sin 2&(p) + 



d 3 q V c (\p~q\) 


x[sin 2-&(p) sin 2ft(q) +p ■ q cos 2ft{p) cos 2&(q)]g(q). (4.14) 



R52 


649 


S.L . Adler, A.C. Davis / Chiral symmetry breaking 


483 


To show that eq. (4.14) is infrared-finite, we express the unrenormalized* gap 
equation of eq. (2.6b) in terms of tf(p), 

1 


p sin 2#(/>) = „ 2 


77 J 


d 3 ? v c (\p-q\) 


x[sin 2&(q) cos 2 i d(p) — p • cos 2 r &(q) sin 2#(p)], (4.15) 


and use this to rewrite eq. (4.4) as follows, 

1 


m(p)= 


3 77 2 J 


d 3 ? Vcflp-fll) sin 2^(^)fsin 2&(p) 

\ si 


cos 2 2&(p) 
sin 2-&(p) 


cos 2 &(p) \ . „ N 1 

—— „ . I p sin 2w( p) -*■- 

sin2#(p)r 3 


7I- 2 J 


d 3 <7 V c (\p-q\) 


x[sin 2&(q) cos 2i J(p) —p ■ q cos 2-&(q) sin 2#(p)] 


= p cos 2#(p) 


377 J 


d 3 ? Vcdp-^l) 


x[sin 2#(p) sin 2 d(q) + p-q cos 2-&(p) cos 2i?(^)]. (4.16) 

Substituting eq. (4.16) into eq. (4.14), the equation determining g(p) finally becomes 


g(p)p cos 2#(p) = sin 2i?(p) 


3tt 2 J 


d 3 ? V c (|p-,|) 


x[sin 2&(p) sin 2i J(q) +p • q cos 2-&(p) cos 2#(g)] 
x[g(9)-g(p)], (4.17) 

which is manifestly infrared-finite. 

A useful insight into the infrared-divergence structure of the pairing model is 
obtained by writing the propagator of eq. (2.8) in the form 

R + (p) R-(P) 


S (4 \P, Po) = 


Po- "(0) “ Mp) “ «(0)] po + w(0) + [o,(p) - w(0)] ’ 


(4.18a) 


with the residues f? ± (p) given by the infrared-finite expressions 


K ± (p) = r 


1 

2 


7o± 


7o± 


pMp) 

"(p) J 

2 np) 

i+^ 2 (p>j 


• p 


=f 5 7 • p 


1 +B(p) 
a>(p) 

1 l-^ 2 (p) 


P l+^ 2 (p) 


= l[yo±sin2tf(p)]=Fyy • p 


cos 2 , &(p) 
P 


(4.18b) 


* Use of the renormalized gap equation of eq. (3.19) in the following analysis would add a polynomial 
counterterm to eqs. (4.16) and (4.17), but would not change their infrared structure. 



650 


Adventures in Theoretical Physics 


484 S.L. Adler , A.C. Davis / Chiral symmetry breaking 

The only infrared-divergent quantity in eq. (4.18) is the term co(0) in the 
denominators. This structure is a reflection of the fact that as a result of confinement, 
an infinite amount of energy is required to create a single quasiparticle state from 
the vacuum. 

The fact that the infrared divergences take the form of eq. (4.18) has a simple 
operator interpretation. To see this, we consider the extension of the hamiltonian 
of eq. (2.1) to a general phenomenological potential V(r), 


H 


d 3 xqy ■ (-iV)q-l 


d 3 x d 3 y 4 + (x)^A a tf(x)3 V(|x-.H)tf T (j 02 A a tf(j 0 . 


\ a 


(4.19a) 


When the interaction term in eq. (4.19a) is rewritten in terms of the momentum 
space potential V c of eq. (3.7), we get 


H = 


d 3 xqy • (-iV)q 


with 


P a (q) = 


—!-3 d 3 qp a (q)p a (-q)v c (\q\), 
(2tt) J 

(4.19b) 

xe-**q\x)±\ a q(x). 

(4.20) 


Hence the infrared-divergent part of the interaction term has the operator structure 



d 3 qV c (\q\)p a (0)p a (0), 


(4.21) 


and so is proportional to the color-squared operator F 2 = p a (0)p a (0). Similarly, 
under the shift in potential of eq. (4.3), the change in the hamiltonian is 

Hh>H+2ttCF 2 , (4.22) 

and is again proportional to the color-squared operator. Because color-squared is 
governed by a superselection rule, any eigenstate if/ of H with energy E and 
color-squared C 2 (ifs) is also an eigenstate of the shifted H of eq. (4.22), with the 
energy shifted to E + 2irCC 2 (if/). By the same reasoning, the linear infrared diver¬ 
gence of eq. (4.21) appears solely as an energy level shift 



d 3 qVc(\q\)C 2 W 


(4.23) 


The contribution of this term to the energy difference <w(0) between the single 
quasi-particle state (C 2 = f) and the vacuum (C 2 = 0) is 


co(0) 



d 3 q Vc(\q\) + infrared-finite , 


(4.24) 


in agreement with eqs. (4.4) and (4.18). Moreover, comparing eqs. (4.22) and (4.23), 
we see that the infrared divergence can be completely eliminated from all quantities, 
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unphysical as well as physical, by making the potential shift of eq. (4.3) with 


C = 


1 


(27r) J 


d 3 tf Vc(lfll), 


(4.25a) 


corresponding to the use of a modified momentum-space potential 


Vc(\q\)^Vc(\q\)~& 3 (q) 


d 3 <7V c (|*|). 


(4.25b) 


By this method, we can get manifestly infrared-finite analogs of eqs. (2.12a,b) for 
A and B , permitting a numerical computation to be carried out without the introduc¬ 
tion of the pair wave function i P. The ideas just outlined will play an essential role 
in an extension of the numerical solution of sect. 5 to include transverse gluon 
exchange, since when retardation effects are included there is no analog of the pair 
wave function ^ and one must work directly with the gap equation in the form 
given in eq. (3.12). 


5. Numerical solution for a pure confining potential 


To verify the infrared-finiteness properties discussed in the preceding section, we 
have solved the Coulomb gauge pairing model numerically for the case of a pure 
confining potential, 



(5.1a) 


Eq. (5.1a) of course does not correctly represent the high-momentum behavior of 
V c , which is dominated by single-Coulomb gluon exchange. However, at high 
momenta the transverse (V T ) term in eq. (3.4) is expected to be as important as the 
Coulomb (V c ) term, and both should be included in any realistic analysis of the 
high-momentum regime and of renormalization effects. We hope to extend our 
analysis later on to include high-momentum components of the Bethe-Salpeter 
kernel. 

Since the potential of eq. (5.1a) does not lead to ultraviolet divergences, we work 
with the unrenormalized gap equation of eq. (2.6b), and with the corresponding 
equation for the pion vertex g given in eq. (4.17). Introducing the angular integration 
kernels 


1(2)(P, ?) = 


d (p- q)V c (\p~q\) = 


4k 


/ 2 2\2 » 
(P ) 


I(d(p, q) 


d (p- q)p ■ q V c (\p-q\) 


-1 


p 2 + q 2 T , ^_ 

P <? 


p + q 
p-q 


2 pq 


(5.1b) 
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Eqs. (2.6b) and (4.17) reduce to one-dimensional integral equations 


pv(p) 



g) 1 fr (g)[ 1 ~ ^ 2 (p)]~ 7 (d(a ^ 2 (g)] 

I + V 2 (q) 


9 


(5.2a) 


g(p)p cos 2&(p) = sin 2tf(p) + 


3 77 J 


1 CO 


0 


d q g 2 [sin 2iJ(p) sin 2-&(q)I m (p, q) 


+ cos 2i J{p) cos 2 ■&{q)I m {p, q)][g(q) -g(/>)]. (5.2b) 


To solve these equations numerically, we use relaxation methods appropriate for a 
nonlinear problem, as discussed in a recent pedagogical review by Adler and Piran 
[14], The equations are reduced to discrete form on a mesh containing node and 
half-node points, with p values on the node mesh and (to avoid singularities of 
/( U 2 ) at p — q) with q values on the half-node mesh. As a function of a particular 
node value ip = *P(pj) 9 eq. (5.2a) clearly takes the form 

^ C , + ~ ( i 1 r>t - C 2 + (l-^ 2 ) C 3 = 0. (5-3) 

with C | i2 ,3 functions of the node values ^(p,), I The basic iteration used for ip 
begins with two Newton iterations of eq. (5.3), starting from the old value of ip as 
the initial guess. The Newton iterations yield an improved value of ip which is then 
used to update the old value through use of the over-relaxed Gauss-Seidel algorithm. 
(The procedure is the same as that used to solve the classical abelian Higgs model 
in ref. [14].) Once W has been iterated to convergence, sin2#(p / ) and cos2#(pj) 
are computed from eq. (2.10) and are substituted into the discrete form of eq. (5.2b). 
This yields a linear discrete problem for the node values g(pj), which is again solved 
by iteration using the over-relaxed Gauss-Seidel algorithm. 

The numerical solution for *P(p) is plotted in fig. 1, using momentum units in 
which 1 = k 1/2 ^350 MeV. For small momentum, ^ decreases linearly as 

W(p) ^ 1 -otp, a »5 . (5.4) 

The value of the small-momentum slope of W can be related to an effective 
quasi-particle mass m*, as follows. Consider first the propagator for a free massive 
Dirac particle, 



7oPo~y ’ p-m 


_ 7oPo~y p+m _ 

[>o-v / P 2 + m 2 ][p 0 + J p 2 + m 2 ] 


R(p) 

- . ■ + analytic at p 0 

Po-'JP +m 



(5.5a) 


R52 


653 


S.L. Adler , A.C. Davis / Chiral symmetry breaking 


487 



Fig. 1. The gap function 'P(p) for the pure confining potential of eq. (5.1), plotted versus momentum 
in units with k 1/2 = 1. The numerical computation employs a jacobian transformation p = v 2 /{ 1 - u), 
with a mesh of 200 points uniformly distributed in the interval 0^ v ^ 1. 


with the residue R(p) at the positive frequency pole given by 


b) _ 7 . P 


2 Vp 2 + 


m 


2m 


+ 0 (p 2 ). 


(5.5b) 


Let us compare this with the propagator in the pairing model as given by eq. (4.18), 


1 


_ R + (p) 

7oPo-y * P~2 Po~co(p) 


+ analytic at p 0 = w ( p ) , 


(5.6a) 


with the residue at the positive frequency pole now given by 


*+(/>) = i 


2 np) 


+ To 


, 11 -V 2 (p) 

27 • P 


2H + '? 2 (p) ' /0 J 21 *p 1 + 'P 2 (p) 

= 1(1 + To) — 57 ' P a +°(P 2 ) • 


(5.6b) 


The small-momentum expansion of eq. (5.6b) clearly has the same form as that in 
eq. (5.5b), with an effective mass m* given by 


m*=-~|x350 MeV^70 MeV . 
a 


(5.7) 


From the solution for p ), we can also evaluate the quark condensate ( uu ) by 
evaluating the integral in eq. (4.9), with the result 

(uw)~ (-95 MeV) 3 . 


(5.8) 
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Fig. 2. The pion vertex function g{p) corresponding to the gap function of fig. 1. 


The numerical solution for g(p) is plotted in fig. 2, again using momentum units 
in which k 1/2 = 1. Using this solution, we can calculate the pion decay constant 
by evaluating the integral in eq. (4.13), with the result 

A-llMeV. (5.9) 

Experimentally, the constituent quark mass m*, the quark condensate { uu ) and 
the pion decay constant have the values 

m* xpt = 300 MeV, 

(^)expt^(-230MeV) 3 , 

/.ex pt ~95MeV. (5.10) 

Thus, the pairing model with a pure confining potential gives values for these which 
are consistently too small. Since the high-momentum part of V c has the same sign 
as the confining term (cf. eq. (3.7)), its inclusion should have the same qualitative 
effect as increasing the string tension k 1/2 , which in the model of eq. (5.1) increases 
m*, (uu) and f„. Hence, improved agreement with experiment is likely to result 
when high-momentum components of the Bethe-Salpeter kernel are included in the 
calculation. 

Finally, in fig. 3 we plot the excitation energy (o(p) — w(0), as calculated from 
the numerical solution using eq. (4.6b). Both the numerical work and analytic 
estimates obtained from eq. (4.6) show that the excitation energy vanishes at small 
momenta as p 2 (up to a possible factor of |log p 2 1 ), and at large momenta approaches 
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Fig. 3. The excitation energy a>(p)-a>(0) corresponding to the gap function of fig. 1. 


the excitation energy for a massless free particle to within a finite additive constant, 


co(p) 


<w(0) 


1 

- *P+T—2 

p — > oo 77 


d 3 tf V c (|fl|)[sin 2#(g) - 1] +o(- 

\P 


(5.11) 


Hence although the Coulomb gauge pairing model is not Lorentz-covariant, the 
high-momentum behavior of the excitation energy in this model nonetheless joins 
smoothly onto a relativistic dispersion law. 


6. Conclusions 

By use of the renormalized vector and axial-vector vertex equations and Ward 
identities we have constructed the correctly renormalized gap equation for chiral 
symmetry breaking in Coulomb gauge QCD. Making the ladder approximation to 
the Bethe-Salpeter kernel relates the chiral symmetry breaking condensate, and the 
other chiral parameters, to the static quark potential. This allows a phenomenological 
potential to be used in the study of chiral symmetry breaking. We have demonstrated 
the infrared-finiteness of all physical parameters, and have elucidated the structure 
of the infrared divergences. Neglecting the effect of transverse gluons we have 
numerically evaluated (mm), and the effective quark mass for a pure confining 
potential, Vc{q) — 1/ q 4 . The values we obtain are rather low compared with those 
deduced from experiment. However, inclusion of the high momentum tail should 
give better results. 

Inclusion of transverse gluons and the Coulomb tail in our gap equation will 
permit a more detailed investigation of the relationship between chiral symmetry 
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breaking and confinement in QCD. Further, extending our analysis to finite tem¬ 
peratures will allow a study of the relationship between the deconfinement transition 
and the chiral symmetry restoration transition. Work on these questions is in progress. 


Appendix A 


BOGOUUBOV-VALATIN TRANSFORMATION 

The Bogoliubov-Valatin transformation for the pairing model relates the operators 
b , d which annihilate |0) to a new basis set B , D which annihilate \ ty), 

1 


B {a \p,s) = 


[b^(p,s)+snp)d wr (-p,s)], 


D {a \p,s) = 


J\ + ^ 2 {p) 

1 

Vi + 'P 2 (p) 


(a)+, 


[d ia \p,s)-s^(p)b w \-p,s)]. 


(a)+. 


(A.l) 


The quark field can be reexpressed in terms of this basis, giving 

d 3 P 


q {a \x,0) = 


M,(p,s) = 


I [ B {a) (p, s)M l (p, s) ^+D^\p, s)M 2 (p, s ) e~n , 


(2tt) 3/2 

1 


(a) + , 


Vi + 'P 2 (p) 


(l+y 0 V)U(p,s), 


M 2 (p,s) = 


Vi + 'p 2 (p) 


(l-yo V)v( P ,s), 


(A.2) 


with U , V helicity spinors which satisfy 


To sV(-p,s)= U(p, s) , 

E U(p,s)U\p,s)=Y, V(p, s) V\p, j)={(l -7 ■ pyo) ■ 


(A3) 


It is often convenient to characterize the Bogoliubov-Valatin transformation by a 
rotation angle d(p) defined by 


sin &(p) = 


V(p) 


vFhFqj) 


cos &(p) = 


1 


Vi + ^ 2 (p) 


(A.4) 
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We give a critical discussion of gap equation models for chiral symmetry breaking, and 
formulate an extended Coulomb gauge model which includes one gluon exchange (but does not 
resolve the problems pointed out in our critique). 


§ 1. Introduction 

Quantum chromodynamics (QCD) is now widely accepted as the theory of the strong 
interactions, and gives a concrete realization of the prophetic idea of Nambu and Jona- 
Lasinio 1) that the pion is an almost massless fermion-antifermion bound state. A number 
of recent pairing model calculations, 2),3) based on approximations to QCD, have shown that 
chiral symmetry breaking, and the accompanying generation of a Nambu-Goldstone pion, 
necessarily occur when the instantaneous potential has a confining piece. However, the 
quantitative results for a phenomenological pure confining potential are not good, leading 
to values for the quark condensate [—< uu >] 1/3 and the pion decay constant f K which are 
too small by factors of 2-5. We show below that assuming a dominant confining 
potential is in fact problematic, since the chiral breaking model requires a Lorentz vector 
instantaneous potential, whereas recent quark spectroscopic data 4) shows that the 
confining potential is predominantly Lorentz scalar. One natural way to try to improve 
the model is by including the leading high-momentum components of the quark-antiquark 
potential, which arise from one gluon exchange and are Lorentz vector in structure. In 
this article we work out (but do not attempt to solve) the equations for this extension of 
the model. In §2 we review the general gap equation formalism, critically discuss 
confining-potential models, and show that including the instantaneous Coulomb potential 
without including transverse gluon exchange is inconsistent. We formulate an extended 
model including both Coulomb and transverse gluon exchange, and give the integral 
equations for this model in §3. In §4 we conclude with a brief discussion. 

§ 2. General formalism and discussion of gap equation 

potential models 

We sketch a method 3) for obtaining the gap equation, including renormalization 
counter-terms, in a general gauge in which the vertex renormalization is not finite. The 
basic idea 5) is to use the Ward identities to derive the renormalized gap equation from 
suitable approximations to the vertex parts. Specifically, we start from the renormalized 
Dyson equation for the vector and axial-vector vertices, making the ladder approximation 
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that the Bethe-Salpeter kernel depends only on the momentum transfer. Consistent with 
this approximation, we exclude quark annihilation graphs, so that there are no anomalies, 
and neglect terms arising from the non-commutativity of color matrices on the quark lines, 
so that we can use the Ward identities of QED and can omit all color indices. With these 
simplifications, the renormalized vector and axial-vector vertex parts satisfy the integral 
equations 

Em(P\ p)s7 — (Zy f j l )s7 J r fff'^y US' F (p' + q) r M (p' + q, P + q) iS' F (p + q)]p a 

xKa0,7s(p + q,p' + q, q), 

f^{p\ P)s 7 ^{Zy fi y^)s 7 J r f jf^y [iS'Ap' + q) t^{p' + q, P + q)iS' F (p + q)]p a 

:xKawip + q , p' + q , q) (1) 

with S 'f the full renormalized quark propagator and with K a ?, 7 s the renormalized quark- 
antiquark Bethe-Salpeter kernel. Since anomalies have been excluded, the vector and 
axial-vector vertex parts have the same renormalization constants, and satisfy the Ward 
identities 

(p'-prrAP\P) = s' F 1 (p')-s' F ~ l (p) t 

{p'-prr^{p\p) = rS F ~ 1 {p)+s' F ~ 1 {p')7^ (2) 

We now make the fundamental approximation of assuming that the Bethe-Salpeter 
kernel Ka^sip + q> p' + q, q) is a function only of the momentum transfer q , 

Kaft,7s{p + q, p' + q , q)^ kap, 7 s{q), (3a) 

and has Lorentz vector couplings on the quark and antiquark lines so that there is no 
explicit breaking of chiral symmetry, 

k ap, 7 's{q) ( 7s) 7'7=-( 7s) aa' ka'P, 7 s{q) , 

kaP,7S'{q){7h)s's= -{ 7 s)w'kaft', 78 {q). (3b) 

Substituting Eq. (3) into Eq. (1), further substituting Eq. (1) into the Ward identities of 
Eq. (2) and using the Ward identities a second time to rearrange the integrands, we find 
that Eqs. (1) ~(3) imply that the renormalized quark propagator must satisfy the integral 
equation 

S' f ~ 1 (P) = (Zym)P m + J^y S' F {p + q)pakapr'{q) . (4a) 

Introducing the self-energy £ by writing 

S / F ~ i (p) = 7Mp M -£(p) i (4b) 

and writing out all Dirac indices explicitly, we see that £ must satisfy the integral 
equation 

£(P)s7= [Ym~ (Z7m)\s7P M ~ f(P + q) Pakap,7s{q) . 


(5) 
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This is the renormalized gap equation corresponding to the ladder approximation to the 
Bethe-Salpeter kernel. 

As discussed in Ref. 3), if one assumes that ka^rsio) contains only an instantaneous 
potential 


kapasiq) = —4xi(ro) sAro)ar-irVc(\q\) y 



then Eq. (5) is readily reduced 21,31 to the gap equation for a non-relativistic pairing model. 
If one takes Vc to be the pure confining potential 



<7=string tension, 



the gap equation has a non-trivial solution, indicating a spontaneous breakdown of chiral 
symmetry; but as noted above, use of the phenomenological value tf 1/2 = 400MeV gives poor 
results for the parameters characterizing chiral symmetry breaking. However, there is a 
serious inconsistency in assuming the gap equation to be dominated by a 
phenomenological confining potential. The derivation of the gap equation requires the 
instantaneous potential to have the Lorentz vector couplings of Eq. (6), since a Lorentz 
scalar instantaneous potential 


kafi,r*(q) = -4^f(l) w (l)ary Vs(lal) 



would manifestly break chiral symmetry. On the other hand, experimental data on heavy 
quark spectroscopy 41 shows that the confining potential in heavy quark systems is 
predominantly Lorentz scalar. Moreover, theoretical arguments 61 based on the behavior 
of electric flux tubes also suggest that in the phenomenological confining potential 

V{r) = ar~Y~j , (9) 


only the Coulombic piece -(4/3 ){a/r) is Lorentz vector, while the-confining piece or is 
Lorentz scalar. We conclude that there is a puzzle here, and quite likely an indication 
that the approximations leading to the gap equation are not valid for the confining part 
of the potential. 

To get a possibly more realistic model, within the framework of the approximations 
leading to Eq. (5), one should clearly include the Coulombic piece (which is Lorentz 
vector) in Vc, by replacing Eq. (7) by 



However, as shown in Ref. 3), Eq. (10) leads to the non-covariant counter-term structure 


“Zr/’=r U = 0 ’ ( 11 ) 

Zyj, fjt = j = 1,2,3 , Z = log divergent, 

whereas the 0 and j components of the vertex are expected to have the same logarithmic 
divergence even in a non-covariant gauge. Thus, Lorentz covariance of the counter-term 
requires inclusion of transverse gluon exchange along with the Coulomb gluon term, by 
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writing 

k aP,7S k ap,7s(, Q) T k ap,7s(q) , 

kle, r s(q) = -4xi( 7j) m( 7 k) «■( Sjk --^pr-)y VY (<?), 

va « )= ^A¥- ™ 

A simple calculation shows that when Eqs. (12), (10) and (6) are substituted into the gap 
equation of Eq. (5), all divergences are removed with a covariant counter-term with 

Z=l jp" jd 3 q-^~^p+finite . (13) 

One further point must be addressed to formulate a model with Coulomb gluon 
exchange included. The coordinate-space potential of Eq. (9) is undetermined up to an 
additive constant, and this corresponds to the freedom to add to the Fourier transform V c 
a multiple of S 3 (q ). We will choose this multiple so that 1/ (q 2 ) 2 is replaced by 

[1/(q 2 ) 2 ]reg = 1/ (q 2 ) 2 ~ S 3 (q) fd 3 q'/(q' 2 ) 2 , (14a) 

which by construction satisfies 

/><?[1/(q 2 ) 2 ]reg= 0. (14b) 

The infrared subtraction in Eq. (14) guarantees that the confining piece represents a linear 
potential which vanishes at r — 0; without the subtraction, the Fourier transform of 
Eq. (10) gives a linear potential which equals a linearly divergent constant at r~ 0. Use 
of Eq. (14a) yields a gap equation which is manifestly infrared finite, even when retarded 
potentials are included. [When retarded potentials are neglected, Eq. (14a) makes the A 
and B functions of Ref. 3) infrared finite, whereas if one uses Eq. (7), only the gap function 
W (which has no analog in the retarded case) is infrared-finite.] Of course, given the 
problems with the Lorentz structure of the confining potential, one can question whether 
it should be included at all! If it is omitted, and only the one-gluon exchange potential is 
retained, Coulomb gauge is a poor choice: For the pure gluon-exchange problem, it has 
long been known that the retarded integral equations take a much simpler form in a 
covariant gauge, 7) especially in Landau gauge where the vertex renormalization Z is 
finite. 


§ 3. Equations of the retarded model 

To summarize, the retarded model has a gap equation given by Eq. (5), with 
kae,rs= -4;n(ro)s/i(ro)ary{yff[ ~S 3 (q) 

qjqk \ 4 a 
q 2 / 3 qo 2 ~q 2 ' 


4jc i(7j)sp(7k) a7\ Sjk 


(15) 
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We assume for £ the general Ansatz 

£ = pA(p, po) + r 'pB{p, po) ~ yopoD(p, po ), 

P = \p\ • 

After rotation to the Euclidean section where po—ico, £ has the form 
£ E — pA[p y oj]+ r'pB\p> a)]— irocoD[p , co], 

A[p y cd\ = A(P, icu) y etc., 
and the quark condensate is given by 

< uu> = 3 


After some algebra, the gap equation can be reduced to the following coupled 
equations for A, B and D on the Euclidean section, 


pA[p, co] = 



3o/2 \ 2qA[q, 

[(q-pYYI d\q,K\ 


2pa\p, ?r 

d\fi,Z] J 


2 qA[q, £] [~ 4 a .4_ a _ 

d[q,%] l_3 (q~pY 3 (£-(o) 2 + (q-p) 2 


3 (£ + co) 2 + ( q ~ p) 2 


? 


pB[p, (o\ = p(Z- 1) 





3g/2 \ p2qC[q, g] 
[(q~p) 2 Y Y d[q,t} 


2 pC[p, g] ' 
d[p,£] . 


2gC[g,£] |~4 an 4 qp(l + /; 2 )-(p 2 + q 2 )v 
d[q,%\ L 3 ( q~pY 3 a p 2 + q 2 ~2pqn 


x 


1 


1 


(Z-co) 2 + (q-pY (Z + coY + iq-p) 


coD[p,co]=co(Z-l)+^prf 0 d{ 


X 


4 r 1 

3 a [^-coY + (q~pY 


_1_ 

(Z + (oY + (q-pY _ 


with 


E[p, cd\ — l + D[p, a>\, C\p, cd\=l+B\p, eg], 
d\p, co] = co 2 E 2 [p, co\+p 2 {C 2 \p, co\ + A 2 \p, a)}}, 
V = p-q/ ( pq )- 


(16) 

(17) 

(18) 

integral 


(19a) 

(19b) 


In writing Eq. (19), we have made use of the following reflection symmetries, 
A, B, C, D, E are invariant under cy-> —eg. 


(20) 
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§ 4. Discussion 

Getting a quantitative realization of the Nambu-Jona-Lasinio model within QCD has 
turned out to be more difficult than one would have hoped. Coulomb gauge gap equation 
models based on an instantaneous confining potential give chiral symmetry breaking, but 
are at variance with the observed Lorentz structure of the phenomenological confining 
potential. Extending the model to include Coulomb gluon exchange does not cure the 
Lorentz structure problem and requires the inclusion of transverse gluons as well; this 
leads (after angular averaging) to a set of coupled two-dimensional integral equations, 
which will be much harder to study analytically 8) and to solve numerically than the 
one-dimensional integral equations of the instantaneous potential model. Moreover, the 
retarded equations are renormalization scheme dependent, since Eq. (19) contains the 
finite part of the renormalization constant Z as a parameter. Thus comparison of the 
extended model with experiment will have to take the renormalization scheme 
dependence 91 of <uu) into account, a complicating feature which again was absent in the 
instantaneous potential models. The situation is sufficiently complex that, without a 
resolution of the Lorentz structure problem, the motivation for a further elaboration of the 
Coulomb gauge gap equation model seems much diminished. 
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I formulate a successive over-relaxation (SOR) procedure for the Monte Carlo evaluation of the Euclidean partition 
function for multiquadratic actions (such as the Yang-Mills action with canonical gauge fixing). A convergence 
analysis for the quadratic-action (Abelian) case shows that as thermalization proceeds the mean nodal fields relax 
according to the difference equation arising from the standard SOR analysis of the associated classical Euclidean 
field equation. Hence, SOR should accelerate the thermalization process, just as it accelerates convergence in the 
numerical solution of second-order elliptic differential equations. 


As has been much emphasized, l the Euclidean par¬ 
tition function is a fundamental tool for studyingquan- 
tum field theories. For the case of a boson field 
theory 2 containing spin-0 scalar and spin-1 gauge 
fields, denoted collectively by the partition func¬ 
tional inverse temperature£ is given by the func¬ 
tional integral 3 


z= fd<t> { J f [d<p] exp(-S), 

Cdt (d 3 x£ B . 

Jq j 


(1) 


In Eq. (1) is the Euclidean action density, 
including source terms, and the path integral ex¬ 
tends over periodic paths, with <p{ 0) = <p(i 3) = <j> r 
I will restrict my attention in the following dis¬ 
cussion to the case where £ £ is a multiquadratic 
form (that is, it is at most quadratic in each 
individual field component), and will assume that 
the Euclidean action S is bounded from below. 

This restriction excludes interacting spin-0 fields 
from consideration (renormalizability for scalars 
requires a <p 4 term in the action), but allows <p 
to contain any number of non-Abelian spin-1 
gauge fields, since the outer-product form of the 
gauge-field self-interaction is easily seen to imply 
a multiquadratic action. 4 * 5 Of course, when gauge 
fields are present, the partition function as 
written in Eq. (1) is formally infinite, as a result 
of integrations over gauge transformations which 
leave the action invariant. In reducing Eq. (1) to 
a discrete form for Monte Carlo evaluation, there 
are two natural strategies for dealing with the 
gauge infinities. The first, introduced by Wilson 6 
and extensively studied 7 over the past few years, 
consists of using a discrete procedure in such a 
way that an exact, but compact gauge-invariance 
group remains, which can then be safely included 
in the Monte Carlo integration. 7 While this ap¬ 
proach has many interesting features, it suffers 


from the drawbacks that (1) it is expressed in 
terms of unitary-matrix link variables, and has 
no natural discrete analogs of the gauge potentials 
and gauge fields, and (2) the multiquadratic form 
of the action is lost. A second natural strategy, 
which I will pursue in this paper, is to use the 
Faddeev-Popov method 1 to break the gauge in¬ 
variance. In particular, if one chooses the canon¬ 
ical gauge fixing 6 


6 1 = 0infe 4 : - 00 <x u ... ,x 4 <°° , 


b z ~Q in R 3 : 
b 3 ~ 0 in R 2 : 
b 4 ~ 0 in R t : 


x l = 0,-oo<x a ,x 3 ,x 4 <« , 

*i = ** = °»~ 00< *3»*4< 00 , 

X\ ~X 2 ~x 3 ~0 r -°0<X 4 <°0 



for each gauge potential b** in <f>, the gauge de¬ 
generacy is completely broken, with a Faddeev- 
Popov determinant which is constant. The func¬ 
tional integral can then be made discrete by 
taking the nodal values of the gauge potentials as 
the variables, and applying the standard replace¬ 
ment 9 of derivatives by finite differences to the 
action S. Denoting the set of node variables which 
are integrated over by {<(>}={<f>(i), * = 1,... ,JV}, 
this procedure yields a multiple integral of the 
form 

*=[n /"<*<#> w] e- s 1(011 , (3) 

with S a multiquadratic form which is bounded 
from below. Thus, for any node variable <Hfe), S 
can be decomposed as 

= A„[<#>(£) — C J* +Bj, A„>0 (4; 

with A ki B kJ and C h functions of the subset of 
node variables {<j>} h s {<f>(i), * = 
k +1,..,, . 

Since in typical applications the dimensionality 
N of the multiple integral is very large, the 
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numerical estimation of Eq. (3) requires use of 
the Monte Carlo method. 7 ' 10 Starting from any 
initial configuration {^J, one generates a se¬ 
quence of successive configurations, or Markov 
chain, feah • ■ ■ > {<£*}> • ■ • by repeated ap¬ 

plication of a transition probability W[{<£}-{<£'}]. 

The transition probability W is chosen so that in 
the limit as M becomes infinite, the configura¬ 
tions in the chain are distributed according to the 
equilibrium probability density 

JU{<#>}] = e- SCUn . (5) 

Sufficient conditions 11 on W to guarantee an asymp¬ 
totic equilibrium probability distribution are the 
normalization condition 

[n fd<t> (O'J Wt{<#)}-{<#>'}] = 1 for all {<#>}, (6a) 
the ergodicity condition 

^J{<^}] > 0 ) i > J{<#)'}]>0-W[{<#,}- {<#,'}] >0, (6b) 

and the detailed-balance condition 

'}] ■ JUfo'Mfo'M*}] • (6c) 

In numerical work it is generally most convenient 
to change only a single node variable at a time. 
When specialized to this case, the form of W, for 
a step in which the node variable <£ft) is changed, 
is 

W=w[{<j>} h ;<l>fc)-<l>faY] , (7) 

with w required to be ergodic and to satisfy the 
normalization and detailed-balance conditions 

f d<(>ft)'»[{<(>}*;<(>ft)= (8a) 

-PbJWa* <f> ft)M{0}»; <f> ft) - 4> ft Y] 

{<#>}*» 4 > ft)* V[{<£} fc ; <P ft)' <£ ft)]. (8b) 

As is well known, the conditions of Eq. (8) do not 
fix w uniquely. The choice used in most Monte 
Carlo studies of gauge theories, motivated by the 
intuitive idea 7 of successively thermalizing the 
individual node variables, is 

“>[{<£}*; <pte)~<p m > 

r . (9) 

-at, 

which makes the distribution of new values <£ ft)' 
completely independent of the old value <p ft) being 
replaced. For a multiquadratic action, where the 
dependence of S on <£ ft)* is known explicitly from 
Eq. (4), the transition probability of Eq. (9) be¬ 
comes 

“>[{<£}*>■ <f> ft) ~ <P ft)' ] = M fc ) -1/ . do) 
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This evidently corresponds to choosing a Gaussian 
distribution of the new feth-node value around a 
central value C fc , where C h is the value of <£ft) 
which minimizes £[{<£}*> <£ ft)]. 

As motivation for the generalization of Eq. (10) 
which I am about to discuss, let us briefly consi¬ 
der the problem of minimizing the discrete action 
functional S[{<f>}]. This can also be accomplished 
by an iterative procedure, the simplest form of 
which consists of starting from an initial con¬ 
figuration {<£„}, and then successively replacing 
each node value <£ft), k-l t .. . by the value 
C fc [{<£} fc ] which minimizes S. Since S is nonin¬ 
creasing under this relaxation procedure, in the 
limit of an infinite number of steps the minimum 
of S (assuming it exists and is unique 12 ) will be 
attained. However, it is well known 9 that the 
procedure just outlined is not the optimal point- 
iterative algorithm for minimizing S; much more 
rapid convergence to the minimum can be ob¬ 
tained by using the successive over-relaxation 
(SOR) method in which <£ft) f is given by 

<t> ft)' = wC fc + (1 - w)<£ ft) 

= C 1 + (l-«)fo(ft)-Cj, (11) 

with w a parameter called the relaxation param¬ 
eter. Convergence is guaranteed provided that S 
remains nonincreasing at each step of the itera¬ 
tion, which requires 

0 « sKH,, 4>(ft)]-s[{<i> }*,<)>(*)'] 

= /!*[<#> (ft) - <J> (fc)'][<#> (ft) +<#> (ft)' - 2CJ 
= ^(ft)-^(ft)']*^-l) , (12) 

giving the restriction 

0<cu<2. (13) 

Whenw = l, Eq. (11) reduces to <£ ft)' = C ht corres¬ 
ponding to the simple minimization procedure in 
which the new value <j> ft)* is independent of the 
old value $ ft). When w * 1, the new value <p ft)* 
clearly retains a memory of the old value <£ft). 

In practice, optimum convergence is obtained by 
doing several iterations with w = 1, and then doing 
many iterations with a value w = w opt close to 2, 
adjusted to maximize the rate of final approach of 
S to its minimum. 

Let us now return to the problem of evaluating 
the partition function of Eq. (3), and ask whether 
there is a parametrized, over-relaxation gen¬ 
eralization of the Gaussian transition probability 
of Eq. (10). A simple investigation shows that 
such a generalization does exist, and is given by 
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<#> (fe) - *<*>'] = j- [zzhfi] - (1 - ">* <*>]*}. < 14a ) 

which can be rewritten as 


w[{<j>} h ; <f> ft) - <(>(*)'] - costffl)- 1 ' 2 expi-AJcoshe^ (*)' -C fc )+sinhG(<£(fe) -C fc )] a }, w - l=tanhG . (14b) 

To verify Eq. (14), we note that it obviously satisfies the normalization condition of Eq. (8a), while since 

A h [tp (ft) ~ C k ]* + A k [coshfl(<f>(*)' - C h )+ sinhfl <0 (fe) - C fc )] 2 

~A h cosh 2 G{[<£ (&) -C fc ] 2 +[<£ (fe) F -C h ] 2 } + 2A h coshfl sinhG[<£ (&) - Cj[<£ (fe)' - C fc ] 

~A h [<p(k )' - C fc ] 2 +A fc [coshG(0 (fe) - C fc )+sinhG<0 (fe)' -C fc )] 2 , (15) 

it also satisfies the detailed balance condition of Eq. (8b). Hence, the transition probability of Eq. (14) 
provides an SOR method for the Monte Carlo evaluation of the Euclidean partition function for multi¬ 
quadratic actions. 

To determine whether SOR accelerates the thermalization process, let us analyze in detail the case 
where the action S is a quadratic (as opposed to a multiquadratic) form, corresponding to an Abelian 
gauge theory with external sources. Let <j> ik) M denote the value of the feth-node variable after M complete 
iterations, let 

{<#,}"={<#, (l)", (ft -1)", * (fe +1)"-\ W u - 1 } 

denote the set of node variables which are passive when the ftth-node variable is being altered during the 
Mth-iteration sweep, and let p[{(p};N(M -l) + k-1] be the joint probability distribution of the node vari¬ 
ables after JV(M -1) + k -1 individual node replacements. Then we evidently have 

p [{<#>}?,<#>(*)";JV(M-i)+*]= f d<#>(fe)"- 1 4{<#>}l!';<#>(fe)"" 1 -<#>(Jfe) i ']p[W,<#>(*)"" l ;w(M-i)+fe-i], (16) 

■'.(6 

which tells us how the joint probability distribution evolves from step to step. Integrating Eq. (16) with 
respect to <l>(k ) M , and using the normalization condition of Eq. (8a), we learn that 

d<#>(jfe)"p [{<#>}", Hk) M ;N(M - 1)+ k]= r - 1)+ k - 1], (17) 

■'■'.(6 

which means that the joint probability distribution for the subset of node variables {<£}* [with <£(ft) integrat¬ 
ed out] is unchanged during the iterative step in which <l>(k) is altered. This in turn implies that the mean 
value of any node variable <Mfe), defined by 

<Mfc) = [n£ <Wi)] JT dMk)Mk)P[{<l>k ... ], (18) 

changes only during an iteration step in which <j>(k) is altered, and so is uniquely specified by the notation 
<P( kY 1 y which gives its value after M complete iterations. To study the evolution of the mean values, we 
multiply Eq. (16) by <p(k) M and integrate, giving 


4>(k)* = ffi/: 12 JC d<p(i J^jptW,<*>(*)“;W(M -1)+ k] 

x f d<p(k)* {[<p(k)“ -vC„-(l -w)<p(ky'- 1 ] (1) +[wC i + (1 

oC 

x <p(k)^]P[{<p}^<p(k^-\N(M -1)+ k -1]. (19) 


Comparing with Eq. (14a), we see that the contribution of the term labeled [ ] (l) vanishes, while using 
Eq. (8a) the contribution of the term labeled [ ] (2) simplifies to give 
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Up to this point the analysis is completely general, 
and applies to multiquadratic as well as quadratic 
actions. Specializing now to the case of quadratic 
actions, for which C h is a linear functional of its' 
arguments, Eq. (20) becomes 

*(*)*-wc„[{^}"]+ (i - «)«( k)*-', 

W={0(1)\ • • • M* - !)",♦(*+D" 1 ,. • •, • 

( 21 ) 

Thus, under SOR thermalization for a quadratic 
action, the mean nodal values evolve according 
to Eq. (21), which is just the difference equation 
encountered in the SOR minimization of the action 
S. Since SOR is known to accelerate the minimi¬ 
zation process, Eq. (21) implies that it will ac¬ 
celerate convergence of the thermalization pro¬ 
cess as well. Although the precise statement of 
Eq. (21) can be made only for quadratic actions, 
the general conclusion reached here, that SOR ac¬ 
celerates thermalization, is very likely to carry 
over to the general multiquadratic case as well, 
much as SOR accelerates the minimization 4 of 
multiquadratic as well as quadratic actions* 

As compared with the conventional 6 * 7 lattice 
gauge theory approach, the strategy for evaluating 
the partition function outlined above may have 
several advantages. First, since the potentials 
remain as the variables, there are natural discrete 


I 

analogs of the gauge potentials and gauge fields, 
which should permit the study of such questions 13 
as the behavior of the effective action for weak 
fields. Second, since the mean node variables for 
the Abelian theory thermalize according to the 
SOR equation encountered in minimizing the dis¬ 
crete action S , and since this equation is just the 
conventional 9 discrete version of the classical 
Euclidean field equation derived from the con¬ 
tinuum S, the Abelian theory will never give a 
confining potential for static sources. Thus, if 
confinement is found in the non-Abelian case, it 
should not be as an artifact of the discrete pro¬ 
cedure. Finally, the SOR method outlined above 
may well be computationally faster than the lattice 
gauge theory method, both because of the possi¬ 
bility of acceleration of the thermalization pro¬ 
cess, and because the Gaussian distribution of 
Eq. (14a) can be obtained from an array of pre¬ 
stored, normally distributed random numbers by 
the calculation of a single square root and a rel¬ 
atively small number of arithmetic operations. 
Detailed numerical experiments will, of course, 
be needed to see if these conjectured gains are 
realized in practice. 
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We study overrelaxation algorithms for the thermalization of lattice field theories with multi- 
quadratic and more general actions. Overrelaxation algorithms are one-parameter generalizations 
of the heat-bath algorithm which satisfy the detailed-balance condition; the parameter is the relax¬ 
ation parameter o>, 0<o><2, with co= I corresponding to the heat bath. First, we show that the 
o >—*-0 (extreme underrelaxation) limit of the overrelaxation algorithm is equivalent to the Langevin 
equation approach. We analyze the thermalization of a free-field action, and show that for co~2 
an oveirelaxed Gauss-Seidel algorithm yields a critical slowing down which is independent of 
wavelength, and has a correlation time which is a factor N smaller than that for an unaccelerated 
Jacobi iteration, with N the linear dimension of the lattice in lattice units. For a general nonmulti¬ 
quadratic action, we give a generalized overrelaxation algorithm which satisfies detailed balance 
with respect to an effective action which is explicitly computable in terms of the original action. 

In the case of SU(/i) lattice gauge theory we use this construction to formulate an overrelaxed al¬ 
gorithm which has exact lattice gauge invariance, and which satisfies detailed balance with respect 
to an effective action differing from the Wilson action only by terms of relative order a 2 in the 
continuum limit, with a the lattice spacing. 


I. OVERRELAXATION AND ITS RELATION 
TO THE LANGEVIN APPROACH 


The generic lattice field-theory problem is that of 
evaluating the Euclidean partition function 

z= f d[<f>]e-e s u*H , (1) 

with S the Euclidean action on a lattice and with 
fdW an integration over discretized lattice variables. 

In the Monte Carlo method for evaluating this integral, 
one generates a Markov chain of configurations (0 ( ), 
i=l,2,... by application of a transition probability 
W[ (0) —j► \<f>' J ], which is chosen to satisfy the detailed- 
balance condition 

( 2 ) 

as well as normalization and erogodicity conditions. 1 
These conditions guarantee that in the limit i—+<x> t the 
ensemble of configurations (^,) is distributed according 
to the equilibrium probability density exp( — PS[{<I >\]). 
In what follows, I will refer to the problem of generating 
such an equilibrium distribution of configurations as the 
thermalization problem. If we now consider the /?—> oo 
(zero-temperature) limit, only the configuration which 
minimizes S contributes to Eq. (1). Hence the zero- 
temperature limit of a thermalization algorithm will be 
an algorithm for the minimization problem of finding 
configurations \<f>\ which satisfy 

=0 . ( 3 ) 

101 


b<f> 


*[[*!] 


Conversely, we may expect that by generalizing methods 
which have been useful in solving the minimization 
problem, we can get useful algorithms for the thermali¬ 
zation problem. 

Following this line of reasoning, a number of years 
ago I showed 2 that for the special case of multiquadratic 
actions (which includes 3 the classical Yang-Mills action), 
the standard Gauss-Seidel overrelaxation algorithm for 
the minimization problem can be generalized to an over¬ 
relaxation algorithm for the thermalization problem. 
Since this earlier work forms the starting point for the 
analysis of the present paper, I proceed now to briefly 
summarize it. A multiquadratic action is one which, for 
any node variable <l> ki can be decomposed as 

]= s [{<!>]]= — Q A k >0 , 


(4) 

with A k , B kt and C k functions of the remaining node 
variables 


= . (5) 

A Gauss-Seidel iteration for the minimization problem 
consists of the successive replacement of each node vari¬ 
able <f> k by the value C k which minimize the action as a 
function of that single variable, with the other variables 
{<!>\^ k held fixed. Although this procedure gives the 
largest single step reduction in S, it is in fact not the 
most efficient procedure when coherent effects over the 
entire lattice are taken into account. A better minimiza¬ 
tion algorithm, which is no more demanding computa¬ 
tionally, is the overrelaxed Gauss-Seidel algorithm 

<f> k ~ =0> Ck +(1 m tyk > ( 6 ) 
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with co the “relaxation parameter.” Convergence is 
guaranteed provided that 5 remains nonincreasing at 
each step, which requires 

¥=k> * k ]-sUt) ¥=k* *'k] 


— A k (<f> k —<t> k ) 2 



giving the restriction 
0<<y<2 . 


(7) 


( 8 ) 


When co= 1, Eq. (6) reduces to the Gauss-Seidel prescrip¬ 


tion <f>' k —C k , in which the new value <f>’ k has no memory 
of the old value $ k . In practice, optimum convergence 
is obtained by doing several iterations with co— 1, and 
then doing many iterations with a value of co close to 2. 

Let us now turn to the thermalization problem for the 
action of Eq. (4). The thermalization analog of the 
Gauss-Seidel iteration is the heat-bath algorithm, in 
which a heat bath of temperature is touched in suc¬ 
cession to each node variable <f> k , with the other vari¬ 
ables \<j>\^ k held fixed. In Ref. 2, I showed that the 
heat-bath algorithm for Eq. (4) admits a one-parameter 
generalization, analogous to Eq. (6), in which the nor¬ 
malized transition probability IF is given by 


W[\* U.**-**] = 


PA k 

1/2 


PA k 

ttco(2—co) 

exp 


co(2—co) 


[<j>' k -coC k (1 co)<f> k ] 2 


(9) 


When co— 1, Eq. (9) reduces to the heat-bath algorithm, since the new values <f> k are distributed according to the equi¬ 
librium action and are independent of the old values tj> k . To see that Eq. (9) satisfies detailed balance for general co , let 
us introduce a hyperbolic angle 6 defined by 


co— l=tanh0, ---cosh0, -- — m ,, n ~ — sinhfl , 

M 2-co)] W2 M 2-co)]' /2 

in terms of which Eq. (9) takes the form 

W[ \<f>l^k’^k —*0]t ] = (PA k cosh 2 d/ir) W2 exp\ —PA k [cosh6(<f> k —C k )+sinh Q(<t> k — C k )] 2 J 
Detailed balance now immediately follows from the fact that 

(<l> k -C k ) 2 + [cosh0(<^ -C k )+sinh 0(.<t> k -C k )] 2 =cosh 2 0[(<^ -C k ) 2 +(<^ -C k ) 2 ] 

+ 2cosh0sinh0(0* — C k )(<f>’ k —C k ) 

= symmetric in $ k ,$ k . 


( 10 ) 


( 11 ) 


( 12 ) 


Since the transition probability of Eq. (9) is a Gauss¬ 
ian, it can be conveniently represented as a stochastic 
difference equation. Let n by a fictitious “time” index 
which increases by one for each update of the entire lat¬ 
tice, and let 7] n k be a set of Gaussian noise variables dis¬ 
tributed according to 

wuvn=n 

n,k 

and hence which obey 

<Vn.kVn'.k l >v = 7 ^n,n^k.k' ■ (14) 

Writing <f> k =<f> k , $ r k =$ k +l , Eq. (9) is evidently 

equivalent to 


sition probability of Eq. (9) to each node of the lattice, in 
some specified sweep order.] Let us now rewrite Eq. (15) 
by using the fact that <f> k —C k is proportional to 

05 /d<f> k y 

WA k m —C k >=0^[ wx j, W>* ’ 

(16) 

and by defining e k according to 

~^^k[\^ik\MhkU^k > (17) 

giving 


V^7 T 


**,k 


74 


(13) 




co{2 — co) 

WA k 


1/2 


Vn,k 


(is) r k + '-r k --e k P 


05 

Wk 


2 


1/2 


* 1/2*, 

€ k Vn,k 


(18) 


with C k and A k functions of the ^” +1 for those nodes 
which precede <f> kt and of $ for those nodes which fol¬ 
low <l> ki in the sweep of the lattice. [This just corre¬ 
sponds to the fact that an updating of the whole lattice 
is accomplished by the successive application of the tran- 


Apart from the extra factor of (1— <u/2) 1/2 , which ap¬ 
proaches unity as co— >0, Eq. (18) is just the discrete form 
of the Langevin equation with variable step size e k and a 
Gauss-Seidel interpretation of dS/d<f>, and approaches 
the corresponding Langevin stochastic differential equa- 
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tion as € k <x.co—>0. Hence the Langevin equation ap 
proach corresponds to the extreme underrelaxation lim 
it 4 of the overrelaxation algorithm of Eq. (9). 


II. CRITICAL SLOWING DOWN 
FOR A FREE-FIELD ACTION 


N 


5 = 


2 + 

*I * ■ ■ - > ‘d ~ 1 

+ (<£,- 1> / 2 + 1 , . . - ,i d —^i v i v . . - ,i d )2 
+ . . . ,i d + l — > 


( 20 ) 


In this section we give a detailed theoretical analysis 
of the performance of the overrelaxed minimization and 
thermalization algorithms, motivated by the fact that 
numerical studies by Whitmer, 5 Creutz, 4 and Brown and 
Woch 6 suggest that overrelaxation can improve the 
correlation time, as well as the speed of thermalization, 
in Monte Carlo simulations. We consider for simplicity 
the case of a single massless scalar free field ^ in d di¬ 
mensions; the inclusion of interaction and mass terms is 
not expected 7 to change the qualitative conclusions 
reached below. The node variable is thus 


with homogeneous (Dirichlet or Neumann) boundary 
conditions applied at the edges of the lattice. 8 Introduc¬ 
ing the notation 

)=^/|,... j d | fixed, / p ±i » 

we can now write the dependence of S on a given node 

as 

M - 1 

S independent of <f>j . (22) 


<t>i 


i * ■ 






and the action is taken as 


(19) From Eqs. (4) and (11) of Sec. I, we see that an overre¬ 
laxed transition probability for the update <f>j —can 
be constructed as 




2 ((cosh0[^(i‘ + 1 ] + sinh0[^(z* + 1)—0/ ] ] 




+ (cosh0[<£(i —1 )—<£/]+ sinh0[<£(i —l )—<£/] ) 2 ) 


(23) 


with the normalization constant J\f independent of <f>j and Rewriting the <(>) dependence by completing the square, 
and then substituting Eq. (10), Eq. (23) can be reexpressed as 


*0/]—^ ex P 



cosh# 

4 d 

, 



2 d<f>j— 2 [^( f ’u + l) + ^(/„ — 1)] 


#*-i 


+ sinhfl 


2d<f>j — 2 t^+n+^-l)] 




+ <l>i , <t>j -independent 


=jVexp 



4/3 


4 

dct)(2—co ) 



d$' I — (l—a>)d$ I — ±co 2 [<£(/„ +!)+<£(/„ — 1)] 


M=1 


+ <f>j , <t>'j -independent 


(24) 


Applying the procedure of Eqs. (13)—(18), Eq. (24) can be rewritten as a Gauss-Seidel stochastic difference equation 


d<f>j + -(1 — m)d^ I — \co 2 [^ n (V + 1 )+^ n+ ('/i” 1)] = — a7 hn > 

\ 

11 1/2 

dco{2 — co ) 


(ll.nlr.n' 28; a — 


40 


(25) 


, 8/ /' = 8. ., • • • 8 . 
1,1 q.'i 


•d'‘d 


It will be informative, in what follows, to also analyze the corresponding Jacobi stochastic difference equation, in 
which the old values 0"(z’ M — 1) are used for the earlier nodes in the sweep, instead of the updated values <j> n + *(/ — 1), 

2 [^%+i )+<l> n+ 'u ll -i)] = -a7]i„ , ( 26 ) 

#*—1 


Although Eq. (26) is not equivalent to the iteration of an algorithm which satisfies detailed balance with the action of 
Eq. (20), it has been extensively studied by Batrouni et a/., 9 and so furnishes a useful point of comparison. 

To solve Eqs. (25) and (26), we proceed by introducing a Green’s function 


R55 


671 


37 


OVERRELAXATION ALGORITHMS FOR LATTICE FIELD THEORIES 


s~m,n _/^n.n 

U. . t .t = Cj/ /' , 

'l-.'rf M 

which satisfies the stochastic difference equation, 
Gauss-Seidel case: 

d 

n + 1 ,n‘ 


M =1 

Jacobi case: 


dGtf - (1 - ffl WG^r - 2 l G r n ' <*'„ + 1 > + G /' n - 1 )] 

#*“1 


461 

(27) 


(28a) 


(28b) 


and the boundary condition 

G°p =0 . (29) 

Then the solution of Eqs, (25) and (26) can be written as 
= — 2 Gfo* °"V/',n' + ^7 » (30) 

where <j>] is the solution of the o-—0 (noise-free, or zero 
temperature) iteration, 

Gauss-Seidel case: 

d 

d4>l + x -(\-a>)d<l>l-\a> 2 [*"<i„+l> 

+ $ n + 'U tl -\)]=0 , (31a) 

Jacobi case: 

d 

d<t> n I +l -(l-<o)d<f>’!-±a> 2 + D 

+ £%-l>]=0 (31b) 

with the initial condition <£/=<£/. Since Eq. (30) implies 
that 

<«W7. « 2 ) 

we see that introduction of the Green's function has per¬ 
mitted us to separate $1 into a mean value term and in¬ 
dividual noise contributions. The rapidity of thermaliza- 
tion is determined by the rate of decay of <f>] with n, 


while the correlation time (the number of updates re¬ 
quired to evolve from one thermalized configuration to 
an independent one) is determined by the rate of decay 
of G”/? with n. For a general Monte Carlo calculation 
the thermalization and the correlation times are 
different, but they will turn out to be equal for the over¬ 
relaxed quadratic action case studied in this section. 

Since we are really interested only in the asymptotic 
limit of small mesh spacings or large lattices, we do not 
attempt to solve the difference equations (28) and (31) 
directly. (An alternative method, working directly from 
the iteration matrix for the difference equations, and 
yielding similar conclusions, has been given by Goodman 
and Sokal. 10 ) Instead we follow the method of Gara- 
bedian 11 and convert the discrete equations to an 
equivalent continuum problem, for which the corre¬ 
sponding partial differential equations can be solved by 
standard methods. Let us denote the mesh spacing by a 
and introduce continuum variables x^t by the 
correspondence 

f dx^^+a 2> d/dx fi <^a~ l A i , 

''/i 

t<-+an, J dt+-*a 2» d/dt+-*a 1 A, , (33) 

n 

a rf + 1 8(jc 1 —x \) ■ ■ ■ S(x d — xj)8(t — r')^8 f>r 8 n , 

with A the finite difference operator. Treating first the 
Jacobi iteration case, we rewrite Eqs. (28b) and (31b) as 


a-'da-'{G? I i'’ n -GW)-±coa- 2 2 [G/>"'(/„ + 1 >+G£"'(i„ -1 )-2G?f ]=a d ~'a ~ d , 

M =1 

fl- | d B - , (^; +1 -^j)-fwi- 2 2 [*%+n+*%-i>-2*;]=o. 

Making the correspondence 


(34) 


(35) 


Gffi <->G(x y x ) 

and referring to Eq. (33), we see that Eqs. (34) are the discrete analogs of the continuum parabolic partial differential 
equations 


G(x,x';t,t ')— 2 d G(x,x';t t t ') = — a d l 8(xi — x \) ■ ■ ■ 8(x rf — x’ d )b(t — t’) , 

coa 0 1 dx/ co 

“JA d _ 

<f>(x,t )— 2 7— 2 ^(x,f )=0 , 


coa dr 


l 


( 36 ) 
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with the initial conditions 

G(x,x';0,/')=0, t’> 0, 0(x,O)= smooth interpolation of <$ . 


(37) 


Turning next to the Gauss-Seidel iteration case, we follow the Garabedian analysis and anticipate the fact that the op¬ 
timum co is related to the mesh spacing a by 


co — 


1 + Ca ’ 

with C a constant of order unity. Substituting Eq. (38) into Eqs. (28a) and (31a), these can be rewritten in the form 
dCa-\Gtf Un ' -Off )-a~ 2 2 [G/"’ n ’(^ + 1 )+Gp n ‘(i fi -1 ] 


(38) 


a*- 1 


+«“ 2 2 1 G£ + { ' n \i^-G I n ' + { ’ n \i^-\)-[G I n '' n \ip)-Gp n \^-l)]\=j;a d - l a- d -%j£ ntn ' , 
#*-i 


co 


dCB“ 1 (^; + l -^J)-fl“ 2 2 [*% + l> + *%-l>-2*7]+fl- 2 2 1 *" + l (/ /l >-* ,, + l « /l -l> 


(39) 






— — 1 )]) =0 . 


Again making the correspondence of Eq. (35), we see that Eqs. (39) are the discrete analogs of the continuum hyper¬ 
bolic partial differential equations: 


dc4-G(x,x';t,t')- 2 ~^GU,x’;t,t')+ 2 —|—G (x,x';t,t’)= — a d ~ ‘8(jc, -x \) ■ • ■ Ux d -xj )8U-t') , 
dr ^ _ j dxp of dx „ co 


dc^-<i>(x,t)- 2 


a 2 -r 


~ dx 2 


$(x,t)+ 2 


' a 2 


(40) 


m-i dtdx » 


<l>(.x,t)=0 , 


with initial conditions 12 as in Eq. (37). Now making the change of variable 

d 

A* = l 

some straightforward algebra shows that Eq. (40) is transformed into the canonical hyperbolic form 

^C^-G(x,x';s,s')+v~tG(x,x';s,s')— 2 ~^~^-G (x f x , ;s t s t )— —a d ~ l b(x x —x\ ) * * * 8(x rf — x^)8(s — s') , 


(41) 


as 

a t 


4 05 2 




O) 


rf a 2 r. 




a 2 r 


(42) 


a*. 4 3,^' -,6x, 2 


<^(JC,5) = 0 , 


with the boundary conditions 
G(x,x';5,s # )=0 , 

^(x,5)=smooth interpolation of <$ 


We assume that the boundary conditions are such that 
there are no normalizable zero modes. (This implies that 
the corresponding Laplace equation with inhomogeneous 
boundary conditions has a unique solution.) Then we 
have 


imposed on the surface 

d 

5=12^. (43b) 

#*—i 

Let us proceed now to solve Eqs. (36) and (42) by sep¬ 
aration of variables. Let rp m (x) be a complete set of 
eigenfunctions of the ^-dimensional Laplace operator 


d 

V 2 = 2 


#*—l 



(44) 


&ix x -x\)' ■ ■ 8(x rf -*;)= 2 'l> m (xty* m {x’) , 


(45) 


with the minimum eigenvalue k x of order L " 1 
Jacobi iteration case, we expand 

G(x,x';r,f')= 2 , 

m 

$ix,t)='2,$ m (t)$ m (x) , 


In the 


(46) 


subject to homogeneous boundary conditions on the with the coefficients G m and <f> m obeying the differential 

edge of the cube 0 which bounds the lattice. equations 
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2d d 
coa dt^ m 


G m UV,n + * m 2 G m (x';f,n 


co 

, d d < 47 > 
^Lfl m (t)+k m 2 $ m (t) =0 . 

coa at 

Expanding the initial condition on <j> as 

#x,0 >= , (48) 

m 

Eqs. (47) are readily integrated to give 
G(x,x';t,t ')=2 /-* (x')<? kmU *0(f —t’) , 

m 

(49) 


^(x,0=2^|°V m U)e '* m ‘, A m = 

m 


-kt 


coak 


m 


2d 


We turn next to the Gauss-Seidel iteration case. We 
now expand 


G(x,x'\s,s')= 2 G m (x';s,s')i[> m (x) , 


m 


&x,s) = 2<t> m (s)tp m (x) , 


(50) 


m 


with the coefficients G m and <l> m obeying the differential 
equations 

dC^G m (x';s,s')+~-^jG m (x';s,s') 

+/t m 2 6 m (x';j,s’)=-a <!- V*(x')5(j -s') , 

CO 

2 (51) 

dC^ m (s)+~^ m (s)+k m 2 ^ m (s)=0 . 


ds 


4 ds 


The general solution for <f> m (s ) has the form first given 
by Garabedian: 11 


<t>n,<-s)=a m e PmS +b m e <? " ,J 


Pm =2[C — (C 2 —k m 2 /d) l/2 ] , 
Q m =2[C + (C 2 —k m 2 /d) W2 ] , 


(52) 


with the coefficients a m ,b m implicitly (but not explicitly) 
determined by matching to the initial condition on $ of 
Eq. (43). The values of a m ,b m in fact do not matter; all 
we need for what follows is that <j> decays as a function 
of time at least as fast as 


exp[-f min m (Re/v,,Re? m )] . 

To solve the equation for G m , we write 

G>{x';s,s'), s >s’ , 
G^(x';s,s'), s <s' , 


(53) 


G m (x # ;$,$')= 


(54) 


with 


G>’<(x';s,s')=a>’<(x';s')e PmS +b >■ <(x';s'X? 


. « t ^ mS 


(55) 


Continuity across s =s' requires 
G>(x , ;j , F J , )=G<(x # ;j r f j r ) , 


(56) 


while the 8 function on the right-hand side of Eq. (51) 
requires the first derivative discontinuity to be 


|^-[G m >(x';s,s')-G m <(x';s,s')] 


$ =5 


0) 


(57) 


To solve these, let us make the ansatz 


G<=0 


(58) 


and then show that this does in fact satisfy the boundary 
condition of Eq. (43). Assuming Eq. (58), a little algebra 
shows that Eqs. (56) and (57) are satisfied by 


G>(x'-,s,s')=--a d -'r m (x') 1 - - - , 

d °> Qm Pm 

(59) 

as can be verified by inspection. Hence G(x,x';s,s') is 
given by 

A ^ 

G(x,x';s,s')= —— a d ~ x 

d co 

m 

P m ts's) Ints'-S) 

X £ - - - 6(s — s* ) (60) 

f?m Pm 

and corresponds to taking a solution to the hyperbolic 
equation Eq. (42) which has support only inside the for¬ 
ward light cone: 

1/2 


2 (Xp—x'p) 2 

#*-l 


<—T7t(5 — s') . 


“ jl/2 


(61) 


Now the Schwartz inequality implies 


+ 2 (Xp-X'^K 


#*—1 


d 

1/2 

d 

2 i 


2 (x^—Xp) 2 

#*—i 

, 


tl=\ 


1/2 


= d X/1 


2 {Xp —Xp ) 2 


#*—1 


1/2 


(62) 


and combining the inequalities of Eqs. (61) and (62), we 
see that inside the forward light cone we have the ine¬ 
qualities 13 


0<s-s'Tl 2 (x /i —x , /i )= 
#*—i 


t-t' 


t—t'+ 2 (Xp—Xp) . 


(63) 


#*—i 


Thus for f'>0, the surface f=0 lies entirely outside the 
forward light cone, and hence the initial condition that 
G vanish at f=0 is satisfied by Eq. (60), even though the 
differential equation is not separable in the x,t coordi¬ 
nate system. From Eq. (60) we learn that G also decays 
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as a function of time at least as fast as Eq. (53). 

Let us now determine (following again Ref. 7) the 
value of C which maximizes the decay exponent 

A GS =rnin m (Rep m ,Re$ m ) , (64) 

where GS denotes Gauss-Seidel. For k m 2 large enough 
so that k m 2 /d > C 2 , we have 

Rep m = Re$ m = 2C ■ (65) 

On the other hand, for values of k m 2 small enough so 
that (C 2 — k m 2 /d) xn is real, we have 

rnin(Rep m ,Retf m )=p m >p x = 2[C - (C 2 -k x 2 /d) X/1 ] . 

( 66 ) 

Hence 

rn\n m {Rtp mt Rzq m ) 

=min|2C,2[C-Re(C 2 -^ 1 2 /rf) 1/2 ]J , (67) 

and this expression is maximized for 

C opl =k\/d wl . (68) 

At the optimum C we have 

A gs =R tp m =R tq m =2C opI =2 k , /d 1/2 (69) 

and all modes have the same time decay exponent. By 
contrast, for the Jacobi iteration the decay exponent [Eq. 
(49)] varies quadratically with wave number 

k m =co J ak m 1 /d (70) 

and becomes very small at the largest wavelengths, giv¬ 

ing for the most slowly decaying mode 

Ay =&jak 2 /d . (71) 


Comparing Eqs. (69) and (71) we have 
A y coj k x a 

Since /C|~ L~ x and L/a =N , with N the dimension of 
the lattice in lattice units, we get our fundamental result 

^GS 


d x/1 N . 


co 


(72b) 


Hence the overrelaxed Gauss-Seidel algorithm dramati¬ 
cally improves both the rapidity of thermalization and 
the correlation time as compared with the Jacobi algo¬ 
rithm, and makes critical slowing down independent of 
wave length. This improvement becomes even more pro¬ 
nounced when compared with Langevin-Jacobi pro¬ 
cedures, for which (as shown in Sec. I) one has coj « 1. 

We conclude this section with two checks on the 
analysis given above. First, the Jacobi case analyzed 
above is just a continuum version of the model for the 
correlation length studied by Batrouni et al . 9 In units 
with a = 1, they find 



1 

e(p 2 +m 2 ) 


(73) 


which with the correspondences e~coj (cf. Sec. I) and 

A A 0 

p +m ~k m becomes 



(74) 


and so our result agrees with theirs. Second, as a check 
on the reasoning leading to Eq. (60), we have explicitly 
evaluated the time dependence of the Gauss-Seidel 
Green’s function for the L — > oo limit in which the ip m 
are infinite-space mode functions. Details of this calcu¬ 
lation are given in the Appendix; the result is 


— , , ,, 4 2a 

G(x,x \t,t ) =- 


d -1 


g(/»,/V>= 


d co (27r) rf 

1 

4(C+i7 | l/rf 1/2 ) 


f d d le iI ' u ~ x ’ ) ga f t-t t ) , 

(75) 


Xexp 


(/"f + d 1 ) 2 
(/“) 2 + C 2 d 


X(C-/7«/d 1/2 > 


0(t) , 


with /“ and l L the components of / parallel and perpen- 
dicular to the fixed vector (1,1,. . ., 1). The presence of 
the factor 6(t — t’) implies that G(x,x';t,t') vanishes at 
t=0 for t ' >0, and so Eq. (60) does indeed satisfy the ini¬ 
tial condition of Eq. (37). For C ~k x /d 1/2 , Eq. (75) im¬ 
plies that the decay exponent is ~k x for wave numbers 
|/ | larger than k v in agreement with Eq. (69). [For 
wave numbers | / | smaller than k x Eq. (75) is no longer 
relevant, since the difference between infinite space and 
finite box mode functions becomes significant.] 


III. A GENERALIZED OVERRELAXATION 
ALGORITHM, AND APPLICATION TO SU(/i) 
LATTICE FIELD AND GAUGE THEORY 

The results of Sec. II indicate that overrelaxation 
should be of computational value for the thermalization 
problem, and so we proceed next to construct overrelax¬ 
ation algorithms for the Yang-Mills action (which, as 
noted above, is multiquadratic in the components of the 
gauge potential), and for the Wilson lattice gauge action 
(which is not multiquadratic). The construction employs 
the following generalization of the overrelaxation algo¬ 
rithm of Sec. I: Consider a field theory with field vari¬ 
ables which can be divided into two disjoint classes 
with functional integration measure 



i i 


and with the general (nonmultiquadratic) action 

S=Si[[^Uy')]+S 2 nV'j] • (76b) 

For this theory, consider an updating |^j—>|^'] in 
which only the (^] variables (or some subset of them) is 
changed, and let *?[(^),{^'),(^);0] be any auxiliary* 
functional of the indicated field variables and the relaxa¬ 
tion parameter 0 which is symmetrical under the inter¬ 
change [<£)«-*[<£'). For this updating, we take the tran¬ 
sition probability to be 
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)]-^[[^),[tAi;0]exp(-^cosh 2 05i[[f ),[!/>)] 


with the normalization *A/[[^]>[^J;0] given by 

N * 

■M- l [\<t>\,\rl>\;0]= n f C?fexp(_/3cosh 2 05,[[f J, [t/-J ]-/3sinh 2 05, [[^ j, [ t/-J ]-/35[[^J, [f . 

1 


(77a) 


(77b) 


Then W satisfies detailed balance with respect to the effective action 
- N * r / N r 

•N[WUO]=Il f d+Ml + U\*UO]/ H f d+ ■ 

1 / 1 


(78) 


The proof follows directly from the fact that 
w [ [ * I — [ </>' I ] e xp (- ps [ [ * |,( t/- J ] - In (JVIJ <j> J, [ t J; 9 ]/Jf[ [ t/- J; 9 ])) 

=JV[[t/-J;0]exp(-/3cosh 2 0(5 1 [[^],[t/-J] + 5 1 [[^J,[t/-J])-/35[(^J,[^] > [t/-J;0]-/35 2 [[t/-]]) 
= symmetrical in ■ (79) 


The multiquadratic case of the generalized algorithm is recovered by taking 

U 

with the linear functionals of the subset of variables (^}. The overrelaxation algorithm of Secs. I and II then cor¬ 
responds to the choice of auxiliary functional 

^[|^i,|fi,|tAi^]-sinh0cosh02^/tif MtflMytMUWi’M] 

‘j 

+^[|^i,|tAi]^[itAi]^[if MV']])* (8D 


for which the transition probability W of Eq. (77a) becomes 


—[f)] = JVexp [—/? 2 (coshflL ; [{^' J, ( rj>\ ] + sinh&L, [ (^ ], j t/>] ]) 

‘j 

X Aij[[il>) ](cosh0Lj[|^'J,|t/-J ] + sinh0Z, y [[0),[^] ]) 
in terms of a relaxation parameter co related to 6 as in Eq. (10), this can also be written as 

13 


(82a) 


W[ (0) —*■ (<^'j ] = JVexp 


co(2—co ) 




ij 


xAijlimLjlW i,[tAi]-(i-^)^[i^j f itAi]) 


(82b) 


Since by the linearity of L we have 

cosh#!,;[(<£'], [xp] ] + sinh#L ,[(<£], \xp\ ] = L t [[co$kidip r + $inhd$\ f \tp\ ] , (83) 

the normalization is now given by 

J\f~ l = JJ J* d<f>'exp{ —/?£,[(cosh0<£' + sinh0<£), (t/f) ]A ij [\tp\ ]Lj[(cosh00' + sinh0<£], \tp\ ]) 

l 

=(cosh0) N * n f ^<^'exp(— /3L i [\<l>'\,\r(r\]A i j[\tp\]Lj[\<l>'\,{tp\])=mdependent of |<£] , (84) 

l 

and so the second term on the right-hand side of Eq. (78) vanishes, giving in the multiquadratic case 
5 eff [(^j, |i^J;0]=S‘[(^J, (t/fj ]. In the application of the generalized algorithm to the Wilson lattice gauge theory 
given below, the second term on the right-hand side of Eq. (78) will be nonzero, but is arranged to be a higher-order 
correction in the continuum limit as compared with the original action S. 
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Let us now apply this algorithm to the Yang-Mills action 

I3S = f d^TriF^F^), =F^TK Tx(T i Th = \b iJ , (85) 

with the field-strength F^ v related to the potential by 

F,„=a il A y -d y A l , + ig 0 [A l „A v ]. (86) 

To formulate a discrete version of Eq. (85), we set up a cubic lattice with unit cell of side a, and associate the potential 
variables with the centers of the links. Then for a plaquette in the x^-x v plane with center ( x cfii x cv ), as shown in Fig. 
1, we have, for the field-strength component F at the center of the plaquette, 

F P = F llv (x cll ,x cv )=a- l [A v (x cll + ±a,x cv )—A v (.x cll — ±a,x cv )-A ll (.x cll ,x cv +±a)+A ll (.x ctl ,x cv -±a)] 

+ 'Soyt A p < - x c t i’X cv -fa ),A v (x clt + f a,x cv )] 

+ ig 0 ±[A tl (x cll ,x cv +±a),A v (x ctl -±a,x cv )] + (Ha 2 ) , (87) 

where the dependence on coordinates other than x and x v is not shown explicitly. Summing over plaquettes, and 
noting that each plaquette is shared between two unit cells, we have for the discretized action 

ps=^a 4 Tr(F P 2 ) . (88) 

p 

Consider now an update in which the potential A f on a single link / is changed. By Eq. (87), for each plaquette PD/, 
the field strength F P is a linear functional of A {t while for all other plaquettes the field strength has no dependence on 
Af. Hence if we let (^} be the set of potential components Aj t and \if/} be all other potential components, then in 
terms of these variables the action of Eq. (88) has precisely the form of Eqs. (76b) and (80). Moreover, if we choose 
any canonical gauge fixing (or if we do not gauge fix), then the integration measure has the form 

dp = J1 f dA/Yl f dtp (89) 


required by Eq. (76a). Thus the conditions for validity of the algorithm of Eq. (82) are satisfied, and so an overrelaxed 
algorithm for the update A { ^A\ is 


W[A t -*AI]=J\rexp 



2 a 4 Tr( cosh 6Fp + sinhflFp ) 2 

PDl 


=jVexp 


-1 

o)(2—o)) 


2 a*Tr[Fp-(.l-a>)F P ] 2 , 

PDl 


(90) 


Although Eq. (90) exactly satisfies detailed balance with respect to the discretized action of Eq. (88), it is not exactly 
gauge invariant, and this limits its usefulness in simulations where maintaining exact gauge invariance is important. 
To get a computationally useful algorithm, we must construct an analog of Eq. (90) within the framework of Wilson's 
lattice gauge theory. 14 Because the lattice gauge theory action is not a multiquadratic form, it is not possible to con¬ 
struct an overrelaxed algorithm which exactly satisfies detailed balance with respect to the Wilson lattice action. 




Aft (*cftt x ev + 1 °) 



Ay (*«„+ l a, x e „) 


tfv- 


U , 










z 


ft 


FIG. 1. Plaquette and potential variables in the plane 
used to formulate Yang-Mills lattice field theory. 



FIG. 2. Unitary matrices associated with the links of the 
plaquette of Fig. 1, which are used to formulate SU(n) lattice 
gauge theory. 
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However, this is a stronger requirement than is needed, since the lattice action is in any case only an order-a 2 approx¬ 
imation to the continuum action, and any member of the equivalence class of local, gauge-invariant lattice actions 
which differ from the Wilson action by relative order-a 2 terms in the continuum limit is equally suitable as a lattice 
action. We will show that it is possible to construct an exactly gauge-invariant lattice gauge theory transition proba¬ 
bility by the procedure of Eqs. (76)-(78) above, which satisfies detailed balance with respect to an explicitly comput¬ 
able effective action differing from the Wilson action only by terms of relative order a 2 in the continuum limit. 

To carry out this construction we rewrite Eq. (90) as 


W[A t 


Aj]=JVexp 


f — sinh0cosh0 2 a 4 TT(Fp-\-F P ) 2 —(cosh 2 0—sinh0cosh0) 2 a^riFp) 2 

PDl PDl 


— (sinh 2 0—sinh0cosh0) 2 ct 4 Tx{F P ) 2 

PDl 


(91) 


and look for lattice gauge theory realizations of a^TriFp) 2 , a 4 Tc(F P >) 2 , and a 4 Tr(F / ,-+F / ,) 2 . Consider the plaquette P 
drawn in Fig. 1; in Fig. 2 we have redrawn this plaquette with the links labeled by the SU(n) matrices to which they 
correspond in lattice gauge theory. Let us assume that the link potential being updated is Af = A ti (x Cf x cv — ±a), or in 
terms of lattice gauge theory variables, Uf = U fi _. We define 


U P = U tl _U v+ 


Vp + U v _, 




u^u v _=u 


P \ u, 


.u; 


(92) 


Then the lattice gauge theory analog of Eq. (91) is 


W[U,^Ui]=JVex p 

— sinhdcosh# 2 Po 

1 —-Re TriUpUp ) 


PDU, 

rt 


— (cosh 2 0—sinh0cosh0) 2 Po 


PDU 


/ 


1 — —Re TrUp 
rt 


— (sinh 2 0—sinh0cosh0) 2 Po 


PDU 


I 


1-—ReTr U P 
rt 


=jVexp 


1 —co 


co(2—co) 


2 Po 


PDU 


/ 


1 — —Re Tr(U P U' P ) 
rt 


-j- 2 Po 

m PDU, 


1- — ReTrt/ P . 

rt 


1 —co 
co 


2 Po 

PDU, 


1 — —Re Tr l/p 

rt 


(93) 


with Re denoting the real part, with jV fixed by the re¬ 
quirement 

/ d[Ui]W[U, — Ui] = 1 , (94) 

and with the parameter fixed in terms of rt and the 
bare coupling g 0 by the usual relation 15 



2 rt 


We note that Eq. (93) is independent of the cyclic order¬ 
ing of the link factors in U P , as long as U P and U P are 
ordered in the same way; in other words, by cyclic in¬ 
variance of the trace we have 

Tr[( uu v+ uu v _ >< t/;_ U v+ UU v _ )] 

=Tr[( u v _ u U v+ U^ + X u v _ c/;_ U v+ U^ + )] 
=Tr[( uu v _ u v+ )( uu v _ t/;_ U v+ )] 

= ■ * ■ . (96) 


I 

The gauge invariance of Eq. (93) follows from the fact 
that since U, and Uj have the same behavior under 
gauge transformation, so do U P and U P : 

Up^-UgUpUg-', Up^UgUpUg-' , (97) 

and hence again by cyclic invariance of the trace the 
quantities Ty{U p U p ), Tr U P > and Tr U P are exactly gauge 
invariant. Finally, since the Uj dependence of Eq. (93) is 
of the form Tr(UjU) t with U a linear combination of 
SU(n) matrices, in the case n = 2 the efficient SU(2) algo¬ 
rithm of Creutz 16 can be applied to the generation of 
links Uj distributed according to W[U,—>Uj]. 

The argument that Eq. (93) is an acceptable algorithm 
now runs as follows: Comparing with Eqs. (76)-(78), we 
see that Eq. (93) has precisely the form of the general¬ 
ized algorithm, with (^J corresponding to U h with \xfj\ 
corresponding to the other links in the plaquettes 
P'DUiy and with S\ corresponding to those terms in the 
Wilson action involving plaquettes P’DU,. Hence Eq. 
(93) exactly satisfies detailed balance with respect to an 
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effective action, which differs from the Wilson_ action by 
a term proportional to \n( t A/[U lt {if;\;d]/J\/[{il/\ t d]), 
with jV the average of over £//. Since Eq. 

(93) only involves couplings of the link / to links in pla- 
quettes P containing /, and since the entire construction 
is manifestly lattice gauge invariant, the effective action 
is local and lattice gauge invariant. Suppose that we Can 
show that Eq. (93) differs from Eq. (91) by terms of rela¬ 
tive order a 2 (absolute order a 6 ) in the continuum limit; 
then to leading order (absolute order a 4 ) the normaliza¬ 
tion factor MUbWU 0 ] * s independent of U h since by 
translation invariance jV in Eq. (91) is independent of 
. It then follows that ln(J\f[U h \tp\',d]/JV[\tp\;d]) is 
of order a 6 in the continuum limit, and the equilibrium 
effective action for the algorithm of Eq. (93) is a member 


of the equivalence class of acceptable lattice actions. 

To verify that in the continuum limit Eq. (93) reduces 
to Eq. (91) up to an error of order a 1 , we start from the 
continuum limit of the individual link variables, 

U H- =exp[ig 0 aA tl (x cll ,x cv -±a)+O(.a i )] , 

U v+ =exp [tg 0 aA v (x +ta,x cv ) + 0(.a i )] , 

(98) 

Up+ =exp[ —ig 0 aA tl {x ctl ,x cv +ja) + O(a i )] , 
U v _=txp[-ig 0 aA v {x cll -\a,x cv )+OU i )} , 

with TrO(a 3 )=0 since the IT s are all SU(n) matrices 
and hence have unit determinant. For the products of 
adjacent links which appear in U P , we have 


4 > +8 

txp[ig 0 aA ll {x cll ,x cv -\a)]txp[tg 0 aA v (x Cfl + \a,x cv )]=e + + , 

4 >_ +8 

exp[-ig 0 a^ M (x CM ,x cv +|a)]exp[-/g 0 a^ v (x CM - \a,x cv )]=e 


(99) 


with 

< &+= i go<*A fi {x Cfi ,x cv -±a) + ig 0 aA v (x Cfi + ±a,x cv ) 

~ ygo 2 ^ 2 t T a >> A v^ x c P + jOfX cv )] , 

( 100 ) 

_ = - '8o a A n (^^cv+ \<* >-i A Men-~ i<*>x cv ) 

yg() ® [ A fi^ x cii* x c v - ! - ^ A v^ x cfi ■ 

The errors 5 + and 8_ satisfy 8 + =0(a 3 ), 8_=0(a 3 ), 
Tr8 + =Tr8„ =0. Moreover, since 5_ =8 + (a —a) and 
<!>_ =<I> + (a —► —a), we have that 5 + +5_=0(a 4 ) and 
that the commutator is odd in a. Hence for 

the plaquette product U P we have 

u P =u fi _u v+ u tM+ u v _ 

=exp(<I> + + <!>_ + ^[<P + ,<I>_ ] + 0(a 4 )) , (101) 

with y[<I> + ,<I>_]=0(a 3 ), with Tr0(a 4 )=O and [refer¬ 
ring to Eq. (87)] with 

<&++<&-.=igo<* 2 Fp ■ ( 102 ) 

For a general altered set of potentials A ' we have 

=exp|<l>' + + <!>'_ + + 0(a 4 )] , (103) 

again with |[<l>' + ,<l>'_]=0(a 3 ), with TrO(a 4 )=0, and 
with 

4*' + + 4> '-=®o a2 ^ • (104) 

From these equations we find 


I 

Tr U P =n -\g 0 2 a*Tr(.F P ) 2 +CHa 6 ) , 

Tr U' P =n — yg 0 2 a 4 Tr(Fp) 2 +O(a 6 ) , 

2 (105) 

Tr {U P U'p) = n --Lg 0 2 a*Tr{Fp+F'p) 2 + b+Cna 6 ) , 
A=Tr((<l> + +<l>_+<l>' + +<l>'_)(i[<l> + ,<l>_] 

which when A=0 can be combined with Eqs. (95) and 
(93) to give Eq. (91). (Because of the identity 
Tr(a[a,y])=0, terms analogous to A do not appear in 
Tr U P and Tr U ' P *) Since the error term A is potentially 
of order a 5 , to complete the derivation we must show 
that A=0. Now repeated use of the identity 

Tr(a[0,y])=Tr([r,a]/3) , (106) 

which follows from cyclic invariance of the trace, shows 
that A can be reduced to the form 

A = fTr|[<I>V —<I> + ,<!>'_—<I>_](<t> + + <!>_)] . (107) 

In general A^0, but for the special case in which A' 
differs from A by the change of only the single link vari¬ 
able A fi (x Cfiy x cv —\a) or equivalently t/ M _, we have 
<£'_=<!>_ and A vanishes. Hence we have verified that 
Eq. (93) is a suitable overrelaxed algorithm for lattice 
gauge theory, for the case in which a single link at a 
time is updated. 

To conclude this section, let us compare the small-o> 
continuum limit of the overrelaxation algorithm of Eq. 
(93) with the continuum limit of the lattice Langevin al¬ 
gorithm of Batrouni et al .; according to our analysis of 
Sec. I, these should correspond. Taking the continuum 
limit of the overrelaxation algorithm from Eq. (90), we 
have 
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Tr( Fp—F p + coFp ) 2 =Tr 


a ( A\ ~ A \)+ jigo[(A{ — Ai ), i4 ad j acent ] +o)F P 


(108a) 


with A adjacem the potential on the leg of the plaquette adjacent to and following A h Referring to Eq. (98), we recall 
that in the continuum limit g 0 a^ is the effective expansion parameter; approximating Eq. (108a) to leading-order ac¬ 
curacy in this expansion gives 

12 


Tr(F' P -F P +coF P ) 2 ~Tr 


( A i — A i )+ coF p 
a 


Jj_J_ 

2 a 2 


(A/ j —A/) 2 +—(A — A / )coFf> AI -independent . (108 b) 

' a 1 


In four dimensions there are six plaquettes P containing /, and so 

2 Tr {F' P -F P +coF P ) 2 ~-\{Ap-A{) 2 +-{Ap-Aj)co 2 F> + ^/-independent 


PDl 


a 


= 3 


a 


PDl 


±U!>-Aj)+f2,F{. 

a 6 PDl 


+ .4/-independent . 


Substituting Eq. (109) into Eq. (90), dropping .4/-independent terms and approximating 2—<y«2, we have 

[ . 6 f 1 — l 2 

W[Ai^> Al]?zJ\fexp ’ 1 1 


4 CO 


Ma;}-aI )+f 

a ® PDl 


(109) 


( 110 ) 


which can be rewritten as the stochastic difference equa 
tion 




6 


1/2 

1 



(111) 


Now the lattice Langevin algorithm of Batrouni et aL , 
in the notation used above, takes the form 




( 112 ) 


Ff =iT J 


^2 

PDl 



2 n 


Ti[TKU P -Ul)] + € W1 Vj 


Substituting 


U 


i 




(113) 


into Eq. (112) and working to leading order in the ex¬ 
pansion in powers of g§aA y we get, for the continuum 
limit, 


ig Q a{Ap-Aj)Ti~-iTl 


€ 



2n 


2 igo al H 

PDl 


+ <*) 



(114) 


which on substituting P ( g 0 2 /(2n)=\ and factoring away 
the generators T ] becomes 


MA;J-Aj)=- 

a ' 


7 

g0 PDl 


8 o' 


1/2 


a 


l^j 


(115) 


Equations (111) and (115) have precisely the same struc 
ture, and give the identification 


6 _ 

CO= - j€ , 

8o 


(116) 


again showing that the Langevin approach corresponds 
to the small-o> limit of the overrelaxation algorithm. 


IV. DISCUSSION 

In closing I comment briefly on the comparison be¬ 
tween the acceleration strategy pursued above and that 
proposed by Batrouni et aL 9 Let us adopt as the “figure 
of merit*’ for an acceleration scheme the ratio of its in¬ 
verse correlation time k to that for an co = 1 Jacobi itera¬ 
tion. As we have seen in Sec. II, for an optimally over¬ 
relaxed Gauss-Seidel iteration, the figure of merit is then 
N, the length of a side of the lattice in lattice units. By 
contrast, Batrouni et aL employ a Langevin method 
based on the Jacobi algorithm, and propose a method of 
Fourier acceleration in which the Langevin step size is 
taken to have a momentum dependence which compen¬ 
sates the critical slowing down at long wavelengths. In 
principle, their method can yield (up to logarithms) an 
inverse correlation time of A~ ea~\ with a the lattice 
spacing and e the small parameter which governs the 
Langevin step size. Thus, recalling Eq. (71), for the 
method of Batrouni et aL , the figure of merit can be as 
large as 

—— ~e(L/a) 1 =eN 1 . (117) 

ak i 

For lattices of moderate size, where SW — 1, the overre¬ 
laxation method should be competitive with Fourier ac¬ 
celeration, but for very large lattices the Fourier method 
wins out, irrespective of the step size e. Clearly, an op¬ 
timal algorithm would combine the advantages of both, 
by permitting a step size of unity, as in the overrelaxed 
Gauss-Seidel approach, while replacing the factor 
in Eq. (69) by a wave number of order a~ l . 
One possible way to try to achieve an improved algo¬ 
rithm is to combine overrelaxation with a mesh-doubling 
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lattice refinement scheme, as is done in the case of the 
minimization problem by the “hyper-overrelaxation” al¬ 
gorithm of Press 17 or the mesh-refinement-interpolation 
scheme of Adler and Piran. 7 A closely related approach 
is the “multigrid” Monte Carlo method advocated by 
Goodman and Sokal. 10 I hope to pursue these issues in 
future work. 

Note added 

In Sec. II we determined an optimum value of co —let 
us call it 6) b -defined as the value of co which minimizes 
the correlation time r. By definition, this value of co 
maximizes the asymptotic rate of decay of the correla¬ 
tion between two lattice configurations, as the “time” 
separation AT = AMa between the two configurations 
becomes infinite (a = lattice spacing, AAf=number of 
iterations separating the two configurations). However, 
in an actual Monte Carlo calculation this asymptotic de¬ 
cay rate is not the quantity which directly governs er¬ 
rors. What one does in a Monte Carlo calculation is to 
perform some total number M of iterations, but to only 
take every mth iterate as a member of the ensemble of 
configurations used for measurements, where m~r/a. 
Taking more configurations than this increases the 
amount of effort spent in measurement without improv¬ 
ing the statistics, since the additional configurations are 
not statistically independent, while taking fewer 
configurations than this needlessly dilutes the statistics. 
Hence the quantity to be optimized is the absolute corre¬ 
lation between two configurations separated by m itera¬ 
tions, not the asymptotic rate of correlation decay. 

This optimization problem also arises in the overrelax¬ 
ation solution of differential equations, and the solution 
is as follows: For the iterations i =0,1,. . . one uses 
overrelaxation with a sequence of relaxation parameters 
co x with co 0 = \ and with co;—>co b for large /. In the case 
of iterations based on “odd/even” or “checkerboard” or¬ 
dering, as opposed to the “typewriter” ordering used in 
Sec. II, the optimum <y ( ’s can be computed explicitly in 
terms of i and co h using Chebyshev polynomials. In the 
Monte Carlo application, one would use a “sawtooth” 
pattern of co' s, returning co to 1 for the initial iteration 
after each configuration selected for measurement, and 


then stepping through the first m members of the Che¬ 
byshev or other optimal sequence. Taking co=co b for all 
iterations can actually make the correlations worse after 
a finite number of iterations than simply using co= 1, 
while the simple expedient of taking co 0 = 1 and 
co. > t =co b already guarantees monotonically decreasing 
correlations. For a brief and lucid discussion of these is¬ 
sues see Hockney and Eastwood, 18 while for a detailed 
theoretical analysis see Vargas. 19 A simple, explicit, 
“checkerboard” iteration version of the calculation of 
Sec. II has recently been given by Neuberger, 20 and the 
Chebyshev method is directly applicable to Neuberger’s 
scheme. 
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APPENDIX: EVALUATION OF THE INFINITE-SPACE 

GREEN’S FUNCTION 

We evaluate here the time dependence of the Gauss- 
Seidel Green’s function of Eq. (60), in the infinite-space 
limit in which the mode functions are 

W*> = \ d/ 2 e fk '*, k * = 2 VS ’ (A1) 

Substituting into Eq. (60), and noting that because of 
translation invariance there is no loss of generality in 
setting x'=/'=0, we have 


4 2 a d ~ X r „ e~pi-p*-n/2_ -qi -qx- n/2 

G{x y 0;t,0) = ^- — - - — f d d ke ik -* ----«f + L x -n), 

d co (277 r J Q —p 

d 

n = ( 1,1,. . . , I), p =2[C —(C 2 —k 2 /d) l/2 ] , (A2) 

q=l[C+(C 2 -k 2 /d) x/2 ] J ^ 2 =2^ 2 . 

We wish to evaluate the Fourier transform g ( l f t ) defined by 

GOc,<UO)= 4--^-4 f . (A3) 

d co (2irY J 

Taking the inverse Fourier transform of Eq. (A2), the x and k integrations in the d — 1 directions perpendicular to n 
can be done immediately, leaving [with n=</ 1/2 n, x =x n, / =/ n, k =k*n, (/ i ) 2 =/ 2 — 1 2 \ 
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„ j- ^ „ -pi -pd Xn x/2 a -q\-qd Xn x/ 2 

g(l,/ 1 ,t)= f ^-e~ i,x f dke ikx - --- 0U+\d W2 x) , 

" 2ir J 


q-p 


P 

Q 


= 2(CT\C l -[k 2 +(l L ) 2 ]/d\ U2 ) . 


1/2 


To do the x integration, we make the change of variables 
t +\d x/2 x =u , 


471 


(A4) 


(A5) 


giving 


g (l,I\t)=-^f dk e ~ iik ~ l)2,/d ' n i u , 


trd 


— oo 


/„= rdu e ak ~ i)2u/d ' n g qu =—L- 

J n q —p q —p 


o 


1 


1 


p -i{k -l)2/d 


1/2 


q —i (k —l)2/d 


1/2 


(A6) 


pq —i(k -imp +q )/d 1/2 -4{k -l) 2 /d ' 


Substituting p and q from Eq. (A4), and setting k —l = w, we are left with the single integral 


g{l,l l ,t) = 


1 


vd 


1/2 


/" dw 


-lwll/d Xn . 


1 


— OO 


4[l 2 +(l L ) 2 -\-2lw]/d —iw%C /d l/1 


(A7) 


Since the denominator has a single zero in the lower half of the w complex plane, for f <0 we can close the contour up 
to get g=0, while for t >0 we can close the contour down to get the answer quoted in Eq. (75) of the text. 
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Let u —Mu+Nfbe a general linear iterative process for solving the system Lu "/, with L ~Z T and 
with 1 -A/+TVZ. Provided that T= 7 (L —ML ~'A/ T ) ’’ is a positive-definite matrix, it is shown that 
one can explicitly construct a corresponding stochastic algorithm which satisfies the homogeneous-state 
condition with respect to the probability distribution exp( — pS), where S~ ]ru T Lu—f T u. When 
M J L ™ LM , the algorithm also satisfies the detailed-balance condition. 

PACS numbers: 11.15.Ha, 02.70.+d 


Over the last few years there has been considerable in¬ 
terest in the idea of the construction of accelerated 
Monte Carlo algorithms by analogy with acceleration 
schemes for solving deterministic systems of equations. 
For the physically interesting Case of quadratic or multi¬ 
quadratic actions, I showed 1 a while ago that one can 
construct an exact stochastic analog of the classical suc¬ 
cessive overrelaxation method, and this observation has 
had a number of applications. 2 Recently, several authors 
have proposed applying more powerful methods for 
deterministic systems, such as the multigrid iteration 3 or 
fast Fourier transform direct-solution techniques, 4 to the 
problem of accelerating Monte Carlo calculations. In 
this Letter I show, in the case of a quadratic action, that 
all of these proposals are closely related, and in fact are 
special cases of a theorem relating the most general 
linear iterative process to a corresponding stochastic al¬ 
gorithm. 

Theorem ,— Let L “ L J be a real, symmetric /x/ ma- 

P(u— ► u) “(/?/*) //2 (detr) 1/2 exp[ — (u' — Mu 

r-}(L“ 1 -A/L“ 1 A/ T )" 1 -r T , 

which satisfies the homogeneous-state condition 

fdue-f sM P(u—u , )~e- fiSb, '\ ( 6 ) 

The transition probability of Eq. (5) satisfies the 
stronger detailed-balance condition [which is sufficient 
but not necessary for Eq. (6)] 

e -#Hu) p(u _*„*)-« -AS<-Vfo u ) (7) 

if and only if M J L “ LM , or equivalently TV -TV t . 

Remarks .— (l) The overrelaxation 5 and multigrid 6 
methods are special cases of the iteration of Eq. (4) with 
and with spectral radius p(M) < 1, while direct 
solution methods correspond to taking M “0, TV“L 
The matrix T generalizes the temperature rescaling 
found, in the case of stochastic overrelaxation, in Ref. 1. 
The particular significance of stochastic overrelaxation. 


trix and / a real /-dimensional vector, from which we 


construct the action 

S(u) - tw t Zw“/ t m, (1) 

with variational equations 

Lw-/. (2) 

Let us consider the general linear iteration for solving 
Eq. (2), 

u‘-Mu+Nf, (3) 

where M and TV are a splitting of L defined by 

1-TI/+TVL. (4) 


Then provided that T as defined below is a positive 
definite matrix, corresponding to Eqs. (3) and (4) we can 
construct a stochastic process with normalized transition 
probability 


(5) 


within the general framework of the theorem, is that one 
Can show that it is the unique limiting Case of the algo¬ 
rithm of Eq. (5) in which the node variables are updated 
one at a time. 

(2) Equation (5) can be equivalently written as a sto¬ 
chastic difference equation. Letting 0 be the real, or¬ 
thogonal matrix which diagonalizes T, 

0 T r0 “ y “diagonal, (8) 

we have 

u-Mu +TV/+ (2/3 1/2 ) ~ l Oi 7 , (9) 

with I},- Gaussian noise variables normalized according to 

irurij) “2(y _1 ),y. (10) 

(3) The positivity condition on T can be reexpressed in 


-NfVBTiu'-Mu-Nf)], 


Reprinted with permission. 
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a number of equivalent forms. Rewriting T as 
r-±L m (l-S T S)-'L u \ 

( 11 ) 

S-L~ m M J L u \ 

we see that T is positive definite if and only if S T S has 
no eigenvalues larger than 1. This requires 

\\S\\ 2 ~p(S T S)~p(LML~ i M T )<\, (12) 

with p( ) (as above) the spectral radius and || || the spec¬ 
tral norm. 7 Convergence of the iteration of Eq. (3) re¬ 
quires 

p(M) -pU/ T ) -p(S) -p(S T ) < 1, (13) 

but since p(5) < ||S|| for a non-Hermitian 5, Eq. (13) 
does not imply Eq. (12). 

(4) Even when positivity of T can be demonstrated, 
the practical implementation of the algorithm depends 
on the ease of constructing the matrix 0 of Eq. (8) 
which diagonalizes T, or equivalently, of find the matrix 
T = Oy ~ ! ' 2 which factorizes T“ 1 according to 

r “i-7T T , (14) 

so that the generalized noise Can be constructed as 

( 77 * 77 )) "* 25 , 7 . (15) 

Conversely, in cases (such as those discussed in Refs. 
1-4) in which a stochastic differential equation has been 
constructed with Eq. (l) as its equilibrium action, the 
factorization of Eq. (14) is explicitly established and this 
guarantees the positivity of T. 

(5) Although the theorem applies only to quadratic 

actions, it is directly relevant to the Yang-Mills action 
and other multiquadratic 8 interacting theories. More 

generally, most practical methods for dealing with non¬ 
linear problems are based on generalizations from linear 
methods, 9 and so our theorem can be expected to have 
implications for the development of Monte Carlo algo¬ 
rithms for nonlinear problems. 

Proof of the Theorem. — We assume in intermediate 
steps of the proof of existence of M but since the final 
results only involve M y the case where M is not invertible 
Can be obtained by continuity as a limit from the case 
where Af -1 exists. Making the shifts u mt v-\-L~ 1 f 
u' mt v , + L~ i f \n Eqs. (6) and (7) permits one to factor 
away the explicit f dependence; hence it suffices to prove 
the theorem in the case /“0. Doing the integral in Eq. 

(6) by completing the square gives us 

(detr/detf) 1/2 e ~^ u '\ 

S(u’) =u' T Tu'-(M T Tu') T r~ 1 M T ru', (16) 

T= \ L + M T rM. 

Equating the left- and right-hand sides of Eq. (6), we get 


two conditions: (i) overall normalization, 


detT“detf; 

(17) 

(ii) w' T • ■ ■ u term in exponent, 


r-1 L=rMr~ l M J r. 

(18) 

From the definition of r we have 


f“ tL=M t VM\ 

(19) 

if we multiply Eq. (18) by M on the right and use Eq. 
(19) to eliminate M J TM we get 

TM- \LM = TMT *(f-H) 


-YM- f TMT~ X L. 

(20) 

This implies 


T=LML 

(21) 

the determinant of which gives Eq. (17). 
tion off by use of Eq. (19) gives 

The elimina- 

r =LML ‘ 1 ( \LM “ 1 + M J V) 


- yL + LML-'M^T, 

(22) 


which can be immediately solved to give the result T 
quoted in Eq. (5). For T, we then get 

r=HL~'-L' 1 M T LML~')~\ (23) 

If in addition to the stationary-state condition we im¬ 
pose the detailed-balance condition on P , we get the ad¬ 
ditional constraints: (iii) u T • • • u term in exponent, 

T“f; (24) 

(iv) u T • • • u ' term in exponent, 

TM-M t r (25) 

Substituting Eqs. (24) and (25) into Eq. (16) for f, we 
get 

r-jL+ni/ 2 , (26) 

which implies 

(27) 

The comparison of Eq. (27) with Eq. (5) then implies 
L ~ x M t ^ML or equivalently 

M J L =LM. (28) 

Since by Eq. (4), N~L “ 1 — ML _1 , Eq. (28) is also 

equivalent to 

7V-7V t . (29) 

This work was supported by the U.S. Department of 

Energy under Grant No. DE AC02-76ERO2220. 
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We study the overrelaxation algorithm developed by one of us (S.A.) for the SU(2) gauge theory in 
four dimensions. We find improvement in the decorrelation time for the plaquette by a factor of 2 to 3. 
However, our results suggest that the Monte Carlo application of overrelaxation behaves quite differ¬ 
ently from the classical differential-equation analog. In particular, the optimum value of the overrelaxa¬ 
tion parameter co may depend on what order parameter one measures. In addition, there are order pa¬ 
rameters that do not appear to improve by overrelaxation. 

PACS numbers: 11.15.Ha 


Statistical systems close to criticality are difficult to 
simulate. This is because most simulation algorithms use 
local updating which results in a highly correlated se¬ 
quence of configurations. Information about the update 
spreads amongst the variables via a diffusion process 
and, hence, to decorrelate the system over length scales 
of the order of the correlation length £ takes on the order 
of 

t ~£ 2 ( 1 ) 

steps, r in Eq. (1) is usually called the decorrelation 
time. 

Several methods to decrease r have been suggested. A 
very popular recent method is to use overrelaxed algo¬ 
rithms. 1 Here, in analogy with methods invented for 
differential equations, an overrelaxation parameter 
co € Il,2) is introduced into the simulation. The Markov 
process is defined so that for any co one obtains the 
correct asymptotic distribution, co ~ 1 corresponds to the 
usual (nonoverrelaxed) updating, co > 1 gives overrelax¬ 


ation. In this Letter, we test a specific overrelaxation al¬ 
gorithm for pure-gauge-theory simulations 2 suggested by 
one of us (S.A.). It is simplest to implement for the 
SU(2) gauge theory and so we restrict ourselves to this 
case in d ™4 dimensions with periodic boundary condi¬ 
tions on a hypercubic lattice. 

The aim is to generate independent configurations dis¬ 
tributed as 


P(Uj) ~exp[ —/3S(L//)] , 


with 


S(E//)« Z 

POUi 


1 — i tr(t//.) 


( 2 ) 

(3) 


Here, the U\ s are link variables and Up is the ordered 
product of the link variables around a unit lattice square 
(plaquette). It was shown in Ref. 2 that an overrelaxa¬ 
tion parameter can be introduced into the Markov evolu¬ 
tion by the generation of new values Uj from U / accord¬ 
ing to the probability function 


W(Ui~+ Uj) “JVexp 


1 — co 


. , £ /8ll — f tr(C//>C/»] - — £ 0[1- rtr(C/»] 

co{2 — co) pou, co pou, 


1 ~ co 
co 


£ fill - t tr(C//>)] 
POU, 


(4) 


with JV fixed by the normalization condition 


f diu;] un -1. (5) 

The replacement C//—► U\ is accepted with probability 


f 


P A "min 


1, 


W(U/— U,)expl-fiS(U/)l ' 
W(U/-> U/)exp[-0S(U/)l 



It is easy to see that this procedure satisfies detailed bal¬ 
ance with respect to the desired Boltzmann distribution 
of Eq. (2). 3 
Pa can be simplified to 


P 


A 


min* 


1, 


jsr 



and M can be explicitly evaluated [for the SU(2) case] 
to be 


M-Ae~ R 


I\(,2k) ’ 


( 8 ) 


with A a numerical constant, I\ the modified Bessel 
function of first order, 

k * (DetA/) 1/2 , (9) 


M 


2co pou t 


U s 


-1 + 


1 ~ CO 

2 — co 



( 10 ) 


Reprinted with permission. 
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where Up — UjUs , and 

R -t~ r Z Z /3[l — T tr(C//>)]. 

co\2 —co) pdu, co pdu, 

( 11 ) 


The simulation procedure is to preselect a U{ with Eq. 
(4) and then to use Eq. (7) to decide whether or not to 
make the replacement C//—► U}. Since W of Eq. (4) is 
linear in U\ one can generate it by a heat-bath algorithm 
(we use the method of Kennedy and Pendelton 4 ). Also, 
the whole procedure is readily vectorizable. Finally, al¬ 
though the formulas above [Eqs. (8)—(11)1 look formid¬ 
able, they are easy to implement in a real simulation 
since R and M are readily available in the course of a 
standard heat-bath update. The increase in computer 
time for each update over the co = 1 case was about 20% 
to 25% in our study. The main extra work is in the gen¬ 
eration of I u which can be vectorized. 

Suppose one is measuring the expectation value of 
some operator 0. Consider first the case when 0 is the 
plaquette operator j tr (Up). This operator probes the 
field on scales of the order of a unit lattice spacing. 
When £<Cl (i.e., is small), the decorrelation time will 
be small and no improvement will be necessary since one 
update is sufficient to decorrelate on length scales of or¬ 
der unity. On the other hand, when £» 1, the plaquette 
is a very local operator and again should decorrelate 
quickly. The worst case for the plaquette is when £~-l. 
Generalizing from this, if an operator gets dominant con¬ 
tributions from variations of the field over length scales 
of the order of /, then its decorrelation time will be max¬ 
imum when What this argument suggests is that 

the optimum value of the overrelaxation parameter co 
may have to be determined separately for each operator 
one wants to measure. This is very different from the 
differential-equation case where there is a unique mean¬ 
ing to the “best value of co” based on optimization of the 
decorrelation of the longest-wavelength modes. 5 Indeed, 
there may even be operators that are sensitive to many 
length scales whose decorrelation time cannot be im¬ 
proved by overrelaxation. 

Let us measure the expectation value E(I t p) — ( 0 ) of 
0 from N successive configurations after 7 0 thermalizing 
updates of the lattice. I labels the iteration number, 
7 = /o+ l,/o+ 2,. . .,/o+A. One defines the autocorrela¬ 
tion function C(J,&) as 

<iaj 


Z)(7,/3) 


/©+/v —/ 

Z lE(l,p)-E(p)] 

/-/ o+l 


x[£(/+J,/3)-£(/3)I. (12b) 

Here E(fi) = 7V ~'2/*-/C +1 E(I>P) is the average value 


of E in the simulation. 

If the system is dominated by a single decorrelation 
time r, then for large 7, 

C(7,j8)=e‘ y/tW . (13) 

In Fig. 1 we show C(5,/3) and C(10,/3) for the plaquette 
as a function of fH at co**\ on 4 4 , 6 4 , and 8 4 lattices. 
Note that £ is a monotonically increasing function of /?, 
and that for fixed 7, r is a monotonically increasing func¬ 
tion of C [see Eq. (13)]. Hence, Fig. 1 is a practical 
realization of the argument presented above which pre¬ 
dicted a peak in r as a function of £. Indeed, it is known 
that in the SU(2) theory, £~-l near fi — 2.2 and this is 
just where the peak is. Incidentally, this peak is also 
correlated with the peak in the specific heat and the loca¬ 
tion of the crossover from strong to weak coupling. 

Figure 1 also shows the analogous situation for the 
magnitude of the Polyakov loop. Note that now, C(5,/3) 
and C(10,j6) become nonzero only for larger values 
and the “turn on” value of /3 is different for different lat¬ 
tice sizes. Note also that there is no peak in C(j y f$) for 
the Polyakov loop as a function of 0. This implies that 
the Polyakov loop gets contributions from all length 
scales on the lattice up to the lattice size. One might 
suspect that such operators might not have their correla¬ 
tion time decreased by overrelaxation methods, because 
as shown in Refs. 2 and 5, optimized overrelaxation 
speeds up the decorrelation of the longest-wavelength 
modes while simultaneously slowing down the decorrela¬ 
tion of the shortest-wavelength modes. 

To study overrelaxation, we first studied C(5,j6) and 
C(10,j6) for the plaquette at =2.2 on lattices of size 4 4 , 
6 4 , 8 4 , and 12 4 . The overrelaxation was done by one 



FIG. 1. Autocorrelation function CU) at 7 — 5 and 7 — 10 
for the plaquette (Plaq.) and the magnitude of the Polyakov 
loop (| Pol. |) at 0 )= 1 as a function of /?. Circles, plusses, and 
squares represent data for 4 4 , 6\ and 8 4 lattices, respectively. 
Note the peak in C near /3=2.2 for the plaquette and its ab¬ 
sence for the Polyakov loop. 
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FIG. 2. The bottom two plots show C(5) for the plaquette 
on 8 4 and 12 4 lattices at 0 = 2.2 as a function of co. Note the 
sharp improvement in decorrelation time near ©=1.025 for an 
8 4 lattice and near ©=1.05 for a 12 4 lattice. The upper plot 
shows C(10) for the Polyakov loop. There is no improvement 
in decorrelation time in this case. 


sweep of the lattice with co = 1 followed by four sweeps 
during which the overrelaxation parameter was increased 
linearly to its target value. 1000 such sequences of five 
sweeps were done at each 0 value starting from an or¬ 
dered configuration (£// —1 V/) and C(5,0) and C(1O,0) 
were computed from the last 4000 lattices generated. 
The idea of changing co in this sawtooth fashion was 
motivated by the Chebyshev acceleration schemes in 
partial-differential-equation theory. 6 The results are 
shown in Fig. 2. Only results for an 8 4 and 12 4 lattice 
are shown because we did not see any improvement for 
smaller lattice sizes. The optimum value of co is deary 
near © = 1.025 for an 8 4 lattice and near ©“1.05 for a 
12 4 lattice. If r is estimated from Eq. (13), the improve¬ 
ment is a factor of 2 or 3 in decorrelation time for the 
average plaquette. Figure 2 also shows results for a 
similar simulation for the magnitude of the Polyakov 
loop on an 8 4 lattice at 0 = 2.5. Here, there does not 
seem to be an improvement. 

We have also computed, at (5 = 2.2, the autocorrelation 
function for the following two order parameters: The 
average adjoint plaquette, 

£ i(0) = {■ (Itr(£//>)] 2 ), (14a) 

and the plaquette-plaquette correlation function at dis¬ 
tance unity, 

E 2 U 3) - i (trlC/|>(x)]tr[C/|>(x + /i)]>. (14b) 

Here fi denotes a lattice axis direction and the correla¬ 
tion of Eq. (14b) is computed for face-to-face plaquettes. 
Since these are both order parameters dominated by 
short-distance effects, they should improve if Eq = y 
x (tr(t//>)). This is indeed true and is shown in Fig. 3. 


E 1 e 2 



FIG. 3. C(5) for the order parameters E\ and Ei [see Eq. 
(14) of the text] at 0 = 2.2 on an 8 4 lattice. Note the improve¬ 
ment near o> = 1.025 in all cases. 


Finally, one must address the issue of acceptance 
rates. Since the optimum value of co is close to unity, the 
acceptance rate at the optimum co is close to that at 
©“1. In Fig. 4, we show the acceptance rate relative to 
© = 1 as a function of 0 at co = 1.4 and as a function of co 
at 0=2.5 on an 8 4 lattice. We found that the accep¬ 
tance rate was almost independent of lattice size. 

In the classical 6 (or free-field 2 ) analysis, the optimum 
co is related to the lattice size for large lattices by 

<Uopt ~ l + C/L ' ° 5) 

with C an appropriate constant. Thus, as the lattice size 
L increases the optimum co also increases, approaching 
©opt™2 in the large L-limit. This trend can be seen in 
the two lattices for which we have determined © opt for 
the plaquette, but Eq. (15) only roughly holds: The cal¬ 
culation of C from C = L(2/© op t — 1) gives C* == 8,11 for 
the L =8,12 simulations. Since C/L is of order unity for 


0 = 2.5 co = 1.4 



FIG. 4. The left-hand plot shows the acceptance at 0=2.5 
as a function of o> on an 8 4 lattice. The right-hand plot shows 
the acceptance at © = 1.4 (relative to the acceptance at ©=l) 
as a function of 0, also on an 8 4 lattice. 
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these lattices, the 1 /L 2 corrections to Eq. (15) can be ex¬ 
pected to be important, and so the variation in the fitted 
C values is perhaps not surprising. The value of C ob¬ 
tained is considerably larger than the classical value 
C «3 for similar boundary conditions, and in accor¬ 
dance with our discussion above, is likely to vary with 
the order parameter being probed. 

In summary, we have presented a study of overrelaxa¬ 
tion which shows that it can be a useful tool in the study 
of certain order parameters. Left unanswered are the 
questions of how to determine a priori which measure¬ 
ments benefit from overrelaxation and which do not, and 
how the systematics of optimizing co differs from the 
classical case. It will be important to resolve these is¬ 
sues, by further empirical study or theoretical analysis, 5 
before overrelaxation methods are applied to large-scale 
simulations in gauge theory. 
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ALGORITHMS FOR PURE GAUGE THEORY 
Stephen L. ADLER 
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In this talk I review recent progress on algorithms for pure gauge theory, concentrating mainly on overrelaxation 
methods, with some brief comments on Fourier accelerated Langevin and multigrid algorithms at the end. 


1. THE CRITICAL SLOWING DOWN PROBLEM 

Let m be the mass gap or characteristic mass scale of 
the physics being studied, and let a = L~ x be the lattice 
spacing. The ratio of the characteristic length m~ 1 to a 
is called the correlation length 

4 = m~ 1 /a = L/m , (1) 

and becomes infinite in the continuum limit L —► oo. In 
a Monte Carlo calculation, noise introduced at x,t must 
difFuse to x\ \x’ — x\ ~ £ to yield an independent lattice 
configuration; this requires a characteristic "decorrelation 
time" or “autocorrelation time" r. In the free field case 
r more generally 

r ~ V , (2) 

with z a dynamical critical exponent. Let W be the 
computational work needed to get an independent con¬ 
figuration. For an ideal “fast" algorithm 

W ~ L d x logs of L , (3a) 

but from eq. (2) we in fact have 

W 

— rxj -v L z as L — ► oo , (36) 

and so if z > 0, W is much larger than ideal for large 
lattices. This is the “critical slowing down problem," 
ways of alleviating it are the subject matter of this talk. 

2. RELATION BETWEEN THERMALIZATION AND 
MINIMIZATION 

In Monte Carlo we study the partition function 

* = / m ^ S(U) , ( 4 ) 


and corresponding Green’s functions, by generating a se¬ 
quence of configurations {(/*} distributed according to 
the probability distribution 

P{U) = e-^ u > . (5) 

Regarding /? as T -1 with T a “temperature," let us con¬ 
sider the T —► 0 or /? —► oo limit of eq. (4). In this limit 
fluctuations in U are frozen out, and z is dominated by 
the minimum of S(U) at which 

Su S(U) = 0 . (6) 

Any Monte Carlo algorithm, in the /? —► oo limit, be¬ 
comes an algorithm for the minimization problem of 
eq. (6); conversely, we may expect that certain algo¬ 
rithms for eq. (6) can be generalized to finite /? to provide 
useful new Monte Carlo algorithms. 

з. THE GAUSSIAN OR FREE FIELD MODEL 

To develop the relation between thermalization and 
minimization, it is instructive to consider in detail the 
Gaussian or free-field model. 

3.1. Gaussian minimization 

Let L = L T be a real, symmetric l x l matrix and 

и, / real ^-dimensional vectors, and consider the action 

S(u) = - u t Lu — f T u , (7) 

for which the variational equation £ u 5(u) = 0 gives the 
linear system 

Lu = f . (8) 

A general linear iteration for solving eq. (8) is defined by 

U „+1 = Mu n + Nf , (9) 
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with M y N a “splitting" of L which obeys 

1 = M + NL . 


( 10 ) 


The condition of eq. ( 10 ) guarantees that if u n+1 


Uoo, 


then 


Uoo — Mu oo + Nf 


( 11 ) 


is equivalent to Lu^ = /, so that is the desired 
solution of eq. ( 8 ). Subtracting eq. (11) from eq. (9), 
we have 


K+i - u «>) = M(u n - u^) ; 


( 12 ) 


hence errors in the initial guess decay as M n , and the 
dominant error decays as p n , where 


p = “spectral radius" = \Mi\ , 
Mi = eigenvalue of M . 


(13) 


The iteration converges if p < 1 which requires Mi — 
1 -e t -, 2 > > 0 and the dominant error clearly decays 

with characteristic time 


Terror decay ~ £ min 


(14) 


But from eq. ( 10 ), 1 — M = NL and hence we expect 

£min ~ L min = smallest eigenvalue of L , (15) 

i.e., the smallest eigenvalue of L gives the slowest de¬ 
caying errors. For example, in the dimension d free 
field case, where L is the discretization of Vj, we have 
L min ~ a 2 ~ L~ 2 , which implies r err(M . decay ~ L 2 , or 
z = 2 . 

3.2. Gaussian thermalization 
The minimization algorithm of eq. (9) for the ac¬ 
tion of eq. (7) can be generalized into a Monte Carlo 
algorithm as follows . 1 Let P(u —► u') be the normalized 
probability for the transition from u = u n to u* — u n+1 , 
given by 

P{u->u')= (f)' /2 (det r) 1/s x 

exp [-(«' - Mu - Nf) T p T(u' -Mu- JV/)] , 

(16a) 

with r = r T a generalized inverse temperature matrix 
related to L and M by 


r = i (L- 1 -ML- 1 M T y 1 . 


(166) 


I du e-0 s < u > P(u -» «') = e -^ s(u ') , (17) 

which is necessary for Monte Carlo; P satisfies the 
stronger detailed balance condition 

e -0s(u) p(u _» „') = e -0S[u') p^ u , 5 (18) 

which is sufficient but not necessary for Monte Carlo, if 


M t L = LM 


N — N T . (19) 


3.3. Stochastic difference equation form of Gaus¬ 
sian thermalization 

Let O be the real, orthogonal matrix which diagonal¬ 
izes T, so that O t TO = 7 = diagonal . Then eq. (16) 
can be written as a stochastic difference equation 

u' = Mu + Nf + O , (20) 

with rji Gaussian noise normalized as (^i)^ = 2£^. It¬ 
erating eq. ( 20 ) n times we see that the autocorrelation 
( u N+n u N ), averaged over ujy, behaves as 


(u N+n u N ) V ' UN ~ M” . 


( 21 ) 


Hence, as first shown by Goodman and Sokal , 2 in the 
Gaussian case the stochastic decorrelation time r is the 
same as the deterministic error decay time T„ rar decay , 
and thus from eq. (14) we have 


r ~ £ 


mm * 


( 22 ) 


3.4. Local algorithms—minimization 
Consider now the specialization of the iteration of 
eq. (9) which acts only on a single node variable u k . As 
a function of this node, 

S(u) = A k (u k - C k ) 2 + B k , 

Ak, Bky Ck independent of uk , (23) 

and for stability we assume A k > 0 . The simplest lo¬ 
cal minimization algorithm is the Gauss-Seidel iteration 
Uk —► u' k — Ck, in which one replaces the old value 
of u k with the value Ck which minimizes the action. 
The Gauss-Seidel algorithm discards the information con¬ 
tained in the old value of it*; a more efficient algorithm, 
when coherent effects over the whole lattice are taken 
into account, is to replace u k by a linear combination of 
Ck and the old value, 


A straightforward calculation shows that P(u —► u’) sat 
isfies the homogeneous state condition 


Uk -► u' k = u Ck + (1 - w)u k - 


(24) 
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Equation (24) is called the overrelaxed Gauss-Seidel algo¬ 
rithm, and ut is called the overrelaxation parameter. The 
iteration of eq. (24) will cause the action to decrease if 

0< 5(...) — 5(tt£, ...)== 

(25) 

A k {u k -u' k y (£-i) , 

which requires 0 < u < 2 . Theory shows that conver¬ 
gence is optimum for a value utopt which scales, for large 
Lt as 

, (26) 

1 -r l 

with Copt a constant. 

3.5. Local algorithms—thermalization 
Since the algorithms of sect. 3.4 are special cases 
of eq. (9), they will have thermal generalizations. The 
thermal version of the Gauss-Seidel algorithm is the heat 
bath, for which P{ tt* tt£) is 


for the overrelaxed algorithm has been calculated . 4 ’ 6 ' 6 It is 
convenient to Fourier analyze and look at the submatrix 
M k for wave number k; in Neuberger's 6 calculation, using 
red-black ordering with periodic boundary conditions, 
is a 2 x 2 matrix and the analysis is particularly simple. 
For the Gauss-Seidel or heat bath algorithms one finds 



r lT 1 ~ e k = const X a 2 k 2 , 

(29) 

where 

kfnin — O(l), kfnax " C?(L) , 

(30a) 

and so 

"Hong wavelength ~ ^ > 

(306) 


^ahort wavelength 1 > 


and the critical exponent z defined by eqs. ( 1 ) and ( 2 ) is 


z = 2 . For the overrelaxed Gauss-Seidel and heat bath 
algorithms, there is an optimum value u ; = uj^ t for which 


P(u k ->u’ k ) = (e^) 1/J e-^*K - c *> 1 
= const x , 


(27) 


and is independent of the old node value tt*. The thermal 
generalization of the overrelaxed Gauss-Seidel algorithm, 
proposed by Adler 3 in 1981, is 


/>(«»-«i)= 

ex P Kfc - <*> c k - (1 - 0 >)u*] 2 } , 


(28) 


which satisfies detailed balance with respect to the action 
5 of eq. (23). Again, u must lie in the range 0 < w < 2. 
For 1 < w < 2 , the mean u k in the distribution of eq. (28) 
isu^ = C*+(ti;— l)(Cjfe—u*) and is displaced beyond C* 
in the opposite direction from the old value u k , hence the 
name overrelaxation. In addition to displac ing the m ean, 
eq. (28) reduces the width by a factor of yju(2 — u>) < 1 
as compared with heat bath. The case w = 2 is of spe¬ 
cial interest; it corresponds to a deterministic reflection 
of u k around C k , since for = 2 the width of the dis¬ 
tribution is zero and the mean is u k — C k + C k — u k . 
The corresponding action change [eq. (25)] is AS = 0 
for u = 2, and so the u = 2 limit of eq. (28) is micro- 
canonical, or action conserving. This fact is exploited, as 
discussed below, in the Brown and Woch, Creutz SU(n) 
implementation of overrelaxation. 

3.6. Eigenvalue analysis of the iteration matrix M 
for VJ 

In the free field case, the eigenvalue spectrum of M 


Tjf 1 = e k = const x knm , all k , (31a) 

and so 

T «ll wavelengths ~ L (318) 

which implies z — 1! Note that while overrelax¬ 
ation speeds up the decorrelation of the long wavelength 
modes, it slows down the decorrelation of the shortest 
modes. As noted above, for large L the optimum or scales 
as = 2/(l + C 0 p ( /L), and the detailed analysis shows 
Copt ~ Knin/<Pf 2 . More generally, for the family scaling 
towards 2 for large L asw = 2/(1 -f C/L ), C / Copt, 
one still has z = 1, but the coefficient of proportionality 
r/{ is not optimal. Thus, in the free-field case, over¬ 
relaxation reduces z = 2 to z = 1, and goes half-way 
towards solving the critical slowing down problem. 

3.7. w-fixtng 

In the minimization problem, it can be shown 7 that 
ui = 1 maximizes the initial rate of error decay, while 
the choice u ; = maximizes the asymptotic rate of 
error decay for large computational time, but can actu¬ 
ally increase errors for some finite number of early iter¬ 
ations. Hence the optimal strategy to get the small¬ 
est error at each stage of computation is to use for 
sweep i an u value w it with u> 0 = 1, 1 < uii < Uopt 
for i > 1 , and = ujgptl under certain assump¬ 

tions about the errors, the sequence o; t - can be explic¬ 
itly constructed in terms of Chebyshev polynomials. In 
the Monte Carlo case, the noise injected at each up¬ 
date behaves as a new initial error term, and so the 
decorrelation strategy suggested by Chebyshev overre- 
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laxation is to use a “sawtoothed” sequence of u/s, 
1, 1 • ■ • ?? • • ■ ? 1? ? • ■ ■ ? ? 4 ■ m With 

u>i,l < i < N increasing towards and with the 
sequence length N chosen of order the effective decorre¬ 
lation time r. (Note that this gives an implicit, self- 
consistent specification of N.) Then taking one lat¬ 
tice configuration for measurement per sawtooth sweep 
should give the maximum number of statistically inde¬ 
pendent configurations per unit of computational work. 
The “OR n" methods discussed below will be seen to be 
analogs of the sawtooth scheme. 

3.8. Multiquadratic generalizations 

The algorithm of eq. (28) is valid for any action 
which, as a function of each individual node variable u ki 
has the form of eq. (23). This defines the class of multi¬ 
quadratic actions, which is larger than the class of Gaus¬ 
sian actions. Some interesting interacting multiquadratic 
actions are: 


4.1. Microcanonical analog algorithms 
The simplest implementation of overrelaxation for 
SU(n) gauge theories is the microcanonical analog al¬ 
gorithm proposed by Brown and Woch 10 and Creutz. 11 
Let Ui be the link being updated, and let Sw be the sum 
of terms in the Wilson action which depend on U 


pSwlUi) = const-Re Tr 

n 


Ut ^ Us 

staples 



Let U Q be the value of Ui which minimizes Sw\ in terms 
of U 0 and the old value of the link U *, we wish to con¬ 
struct an analog of the u = 2 limit of the multiquadratic 
algorithm of eq. (28), which we rewrite as 

u* k = Ck ~~ u k + Ck . (34a) 


Exponentiating eq. (34a) gives 


.t u 


h = e { c ‘ 



(346) 


• The continuum Yang-Mills action 5 = \(F£j) 2 t 
which is multiquadratic 8 because the interaction 
term in F^, is an outer product 

• Non-compact discrete Yang-Mills, for the same rea¬ 
son; 


which suggests the unitary group analog 

Ui = U 0 U 0 . (35) 

The algorithm of Brown and Woch and Creutz consists 
of using eq. (35) as a Metropolis trial selection; since the 
inversion of eq. (35), 


• The discretized <f> A and Higgs actions with <f> A point 
split® according to <f> A (x) —► <f> 2 (x)<f> 2 (x -f ap) t with 
p a unit lattice displacement. 

4. NON-MULTIQUADRATIC IMPLEMENTATIONS 
OF OVERRELAXATION 

Most non-multiquadratic implementations of overre¬ 
laxation make use of the generalized Metropolis algo¬ 
rithm, consisting of the following two steps: 


U t = U 0 (Ui)' 1 U 0 , (36) 

is just eq. (35) with Ut and U[ interchanged, the ratio 

W(U[ —► Ut)/W(Ui —► Ui) in eq. (32) reduces to unity, 
and detailed balance with the Wilson action of eq. (33) is 
satisfied by accepting the trial selection of eq. (35) with 
the conditional probability 

P A = min (l, (37) 


1. Given u, pick a trial tt' with normalized probability 
distribution W(u —► u'). 


2. Accept u ' with conditional probability 


P A = min 


{ 


1, 


W(u' -> u) 
W(u -> J) 


e-P s W)\ 
e-W”) ) 



For SU(n) with n > 3, the acceptance probability 
P A is smaller than unity. The case of SE/(2) is special, 
since the sum over staples in eq. (33) is then proportional 
to an SU( 2) group element which we call U~ x , 

£ u s = ku- 1 , 

staples 


This procedure gives an overall transition probability 
P(u —*■ u f ) which satisfies the detailed balance condi¬ 
tion of eq. (18). I will group the algorithms to be dis¬ 
cussed into two basic types, (a) microcanonical analog 
algorithms, and (b) tunable-u; algorithms. 


ps w (u t ) = const - ^ . (38) 

n 

We see from eq. (38) that indeed Ut = U 0 minimizes 
Sw(Ut), and since 
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Tr(E J'U- 1 ) = Tr(U 0 U/ 1 U a )U (j 1 = Tr (UJ/f 1 ) 


W[U t -» U[\ = jVx 


= TrfUoU/ 1 )- 1 = Tr U t U^ , 

(39) 

we have Sw(Uft = Sw(Ui)- So for SU(2) we have 
jPa = 1, and the Brown and Woch, Creutz algorithm 
is microcanonical. To get ergodicity one adds standard 
Metropolis steps, defining “OR n Metropolis” as follows: 

OR 0 Metropolis = 

Brown and Woch, Creutz algorithm 

of eqs. (35) and (37); (40) 

OR n Metropolis = 

OR 0 +n standard Metropolis steps. 

Clearly, OR n is similar in spirit to the "sawtoothing" 
procedure discussed in sect. 3.7. 

In analogy with the above SU(n) algorithm, Gupta, 
et al. ia have given an overrelaxed Metropolis algorithm 
for the XY model. The action here is 

5 = const - /3 £(i j) cos(0i - 6j) 

= const - /? k Re(SiEt) , (41) 

Si = e 1 * 1 ', k € Sj , 

where (ij) indicates the restriction of the sum to near¬ 
est neighbor pairs, eq. (41) has the same structure as 
eqs. (33) and (38), so a corresponding overrelaxed, mi¬ 
crocanonical algorithm is 

Si - Si = 2 .- Sf * 2 , . (42) 


«P [l - i Re Tr (U P U' P j\ 

-i Zpdi P [l - s Re Tr U' P ] 

£pd< /J[l-i ReTr Vp]} , 

(43a) 

with the normalization constant Af fixed by 

j d[U’\ W[U t - U[\ = 1 . (436) 

We then use W as a Metropolis preselection, and accept 
U[ with the conditional probability 

Pa =m!n{l, 

= min {1, Af'/Af} = min {1, 1 + 0(a 2 )} , 

(44) 

giving an efficient algorithm for which P A — ► 1 in the 
continuum limit. In the case of 5E/( 2 ), the normalization 
Af can be readily calculated in closed form . 13 

An alternative tunable-u algorithm, for the special 
case of 51/(2), has been given by Brown and Woch . 10 
They proceed by mapping SU(2) onto R 3 in such a way 
that the Wilson action transforms into a unit Gaussian. 
They then do an overrelaxed heat bath update using 
eq. (28), and finally remap back to SU{2). Explicitly, 
they start from eq. (38) and set 

U t = e' * f - s U a , (45) 

with B the SU(2) matrices, so that Sw becomes 

/? S w (U t ) = const — /?& cos 8 . (46) 


4.2. Tunable-w algorithms 

Adler 4 ’ 13 has given an overrelaxed algorithm for 
SU(n) incorporating a tunable u ; parameter. The al¬ 
gorithm is constructed by first doing a non-compact dis¬ 
cretization of SU{n) and using eq. (28) to write the 
corresponding overrelaxed transition probability W(u —* 
u’). One then writes the Wilson lattice gauge theory 
transcription of W, which takes the following form. Let 
l be the link being updated, with U t the old value, U[ 
the new value, and Us the staple joining with Ut in the 
plaquette P. In terms of Up = UiUs , Up = U[Us we 
have 


They then map 6 into r = f(8) defined by 

z -l J'=m x 2 e -*>l2 dx = 

(Z')- J /o sin 2 a; e 0k cot “dw , 



with Z, Z' appropriate normalization constants. Defining 
t = rr, they do an overrelaxed update 


-* r 


1 


r = (1 - w)r + -7= , 

y/2 

and then remap to get 

u[ = e' f-'W-*U 0 . 


(48) 


(49) 
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This algorithm exactly satisfies detached balance by con¬ 
struction, and so does not need a Metropolis correction 
step. 

Finally, Heller and Neuberger 14 have given a tunable- 
u; algorithm for the nonlinear cr-model, with action 


ft(x) = 1 , 

ps = p Y $( x ) • + °£) 


(50) 


Z,/i 


Defining $q to be the value of ^(x) which minimizes 5 
for fixed x, their procedure is to draw a geodesic in the 
0 (n) manifold through tf> and <f> 0 parameterized by u, 
thus defining an os-dependent mean ft, and to take ft 
distributed around ft with a distribution which approxi¬ 
mates the narrowed distribution of eq. (28). In the 0(4) 
case, this recipe can be written in terms of SU( 2 ) ma¬ 
trices as 



(51) 

U' = (U 0 U~ 1 )“U , 

(52a) 

U' = V U' , 



with V a noise matrix distributed as 

7 


P(V) = exp 


tr (V) 


}' 


(528) 


2 o ;(2 — o;) 

where 7 is read ofF from the conditional distribution of 
U as determined by its nearest neighbors, written as 


p{U) = exp 


5 7 tr (17 U-') 


(52 c) 


As before, the new value U 9 is used as a Metropolis pre¬ 
selection in a final accept/reject step which enforces de¬ 
tailed balance with respect to the action of eq. (50). 


C P {U U U t ) = — Y W* - W), (Wp - (W)) 2 , 

Up p 


Wp = - Re Tr U P , (54) 

n 

OR 0 was somewhat slower at decorrelating plaquettes 
than heat bath or 10-hit Metropolis. Creutz showed that 
the plaquette was improved by using an algorithm with a 
tunable parameter intermediate between OR 0 and stan¬ 
dard Metropolis, but he did not study OR n. 

Brown and Woch 10 assessed OR n by studying au¬ 
tocorrelation times for the average action and for the 
"Polyakov loop ” 15 defined by 


“Polyakov” = Tr 


n u t 

Lline through lattice 



Letting p(A) be the autocorrelation between sweeps i 
and t + A, they defined the truncated sum 

m 

Pm = Y P( A ) ’ ( 56 ) 

A~-m 


in terms of which the autocorrelation time r of eq. ( 2 ) 
is given by 

r = £ Poo ; (57) 

in practice they estimated p^ from p l50 - They found that 
overrelaxation gives a dramatic improvement in decorre¬ 
lation for SU( 3) for /? = 5.6 on a 4 4 lattice, with up 
to a factor of 3 improvement for interesting observables. 
The Polyakov loop was best for OR 0 , while the action 


Table 1: Correlation time r in sweeps (error ~ 20). 


5. NUMERICAL RESULTS ON OVERRELAXATION 
A number of studies of the utility of overrelaxed al¬ 
gorithms have been carried out in the last two years; we 
group them by algorithm type as in Sec. 4. 

5.1. Studies of microcanonical-analog algorithms 
Creutz 11 studied the algorithm OR 0 [cf. eq. (40)] 
for SU(3) at /? = 6.0 on a 7 x 7 x 7 x 6 lattice. For the 
non-gauge-invariant lattice correlation (n^ = number of 
links) 

C(u u Ui) = — Y Re Tr \ U 'a Va] , (53) 

he found that OR 0 outperformed 128 hit Metropolis 
(which is effectively equivalent to heat bath). For the 
gauge-invariant plaquette correlation (np = number of 
plaquettes) 



OR 1 

20 hit 
Metropolis 

Pseudo 
Heat Bath 

P 

R 

T 

R 

T 

R T 

6.5 

40 

90 

90 

200 

130 170 

7.0 

40 

60 

110 

180 


7.25 

40 

60 

100 

170 


7.5 

40 

60 

140 

210 



decorrelated best for OR 10 but was worse for OR 0 or 
OR 1 than for conventional 10-hit Metropolis, consistent 
with Creutz's results. 

Gupta et al . 16 compared the following algorithms for 

SU(3)\ 
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• 20 hit Metropolis, 

• OR n, 

• pseudo heat bath (the Cabibbo-Marinari SU{2) sub¬ 
group algorithm), 

• hybrid Monte Carlo. (58) 

They measured small Wilson loops on blocked lattices, 
blocking in five levels by >/3/level starting from a 9 4 
lattice, and measuring autocorrelations of the blocked 
loops at each level. Their results for rectangular loops 
( R ) and twisted loops (T) are in Table 1. Gupta et 
al. conclude that OR 1 is the optimal algorithm for pure 
gauge simulations over the entire range of accessible /?. 

In another, independent investigation, Gupta et al . 12 
studied the XY model using the algorithm of eq. (42), 
For an iteration consisting of 15 steps of eq, (42) followed 
by 2 standard Metropolis steps, they found a sizable re¬ 
duction in critical slowing down, measuring 

r = 0.15f 1-2 . (59a) 

For comparison, a pure standard Metropolis iteration 
gives 

t = 5f 2 . (596) 

5,2. Studies of tunable-ur algorithms 
Brown and Woch 10 studied the tunable-u; algorithm 
of eqs. (45)-(49) for the SU(2) subgroups of S[/(3), at 
/? = 5.6 on a 4 4 lattice. They found that the Polyakov 
loop decorrelated best at ui = 2 , where a factor of 3 
gain was realized, while the action decorrelated best at 
an intermediate u of 1,25, However, more recent work 
shows that on the Columbia 64-mode machine , 17 a mixed 
algorithm consisting of 9 = 2 iterations followed by 

1 conventional Cabibbo-Marinari iteration was not no¬ 
ticeably better than a non-overrelaxed algorithm on a 
24 s x 16 lattice at /3 > 6 . (It is not possible, with cur¬ 
rent software, to implement the tunable algorithm with 
ut < 2 .) 

Adler and Bhanot 18 tested Adler's algorithm of 
eqs. (43)-(44) for SU{2) at /? around 2.2. A factor of 2 
improvement in plaquette correlation was observed (with 
sawtoothing) at very small u ;, with u ~ 1.025 on 8 4 
and ~ 1.05 on 12 4 lattices. For the magnitude of the 
Polyakov loop, no distinct minimum was observed, but 
larger u ; values were found to be better than ui = 1 , con¬ 
sistent with the Polyakov results of Brown and Woch. 

Heller and Neuberger 14 studied the nonlinear <r- 
model in 1 dimension using the algorithm of eqs. (50)- 


(52). They point out that interactions can renormal¬ 
ize the starting value of u ; into an effective value u; e yy. 
They determine ut K ff computationally by studying the 
field-field autocorrelation matrix and fitting to free-field 
formulas for the overrelaxation of \ [(5* <j>) 2 -fm 2 <£ 2 ], 
with m the mass gap, getting uj e ff = — 0.77m, as 

compared with the theoretical formula u>opr(m) = 2— 2 m. 
These formulas show that there exists an ui < 2 for which 
> Uoptfrn), and hence (in d = 1 at least) a 
dynamical critical exponent of z = 1 is attainable. 

5,3. Conclusions from the numerical work 
The following conclusions can be drawn from the nu¬ 
merical work synopsized above: 

1. Overtaxation works! It ts the local algorithm of 
choice in most pure gauge system applications, and 
for many allied lattice spin and field theories as well, 

2 . z = 1 is not a free field artifact, but is attainable in 
interacting systems. 

3. Sawtoothing of some sort is desirable; in the micro- 
canonical analog versions, one should use OR 1 at 
least. 

4. The decorrelation for the plaquette is best for 1 < 

< 2 , while the Polyakov loop is most improved 
for u ~ 2, However, the systematics of (a) which 
form of overrelaxed algorithm is best, (b) how to 
pick u, and (c) which measurements are helped and 
why. is far from settled—more theoretical and em¬ 
pirical work is needed. 

6 , FURTHER QUESTIONS AND DISCUSSION 
6,1, How does overrelaxation work? 

Neuberger 19 has proposed an interesting field- 
theoretic model for overrelaxation, based on the fact 
that his "red-black” analysis of the free field case re¬ 
duces to a 2 x 2 matrix problem. Let be a free 

field, governed by an action which is the discretization of 
i +w» 2 ^ 2 ]. Let ■0 ± be the fields with support at 

the "red” and “black” sites, 

^ = \ [l ± (-l) s —] * (60) 

and let (j) 1 =■ + - 0 “ — (f> and (f> 2 = = 

(—l) 2 ***^ be respectively the sum and difference of the 
red and black site fields. Neuberger shows that the over¬ 
relaxation analysis for <f> can be mapped into a stochastic 
field model for the fields <£ 1,a , with action 
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S = J d?x C , 

C =\ + \ [m-lit 1 ) 1 + . 

(61) 

and evolving according to the stochastic differential equa¬ 
tion 


A second, related question is whether a fully tun¬ 
able or can be introduced into overrelaxed Metropolis by 
an extension of the Brown and Woch, Creutz construc¬ 
tion. An interesting possibility is to use a new Metropolis 
variant proposed by Creutz, Gausterer and Sanielevici , 20 
based on unitary link matrices Ui and auxiliary “noise” 
matrices Vi , with Ui, Vi e SU(n ), iterated according to 


r = -(1 - 7 J ) 1/J # - 7 «■* # + (1 - 7 2 ) 1 'V , 

a,/? = 1,2, 

(ij 0 (x,f) r) 0 (x',t')) n = 2 S a0 S d (x - x') S(t - f') . 

(62) 

When 7 = 0 eq. (62) is the usual Langevin equation, 
which corresponds 13 to the extreme underrelaxation limit 
ui = 0 ; for 0 < 7 < 1 eq. (62) differs from the Langevin 
equation by the addition of a “reversible mode cou¬ 
pling" term e^dS/diffl, which enters the Fokker-Planck 
equation as ( djd<j> a ){e a PdSjd<ffl ) = 0 and so leaves 
exp(—/?5) as the equilibrium distribution. According to 
Neuberger, the model of eqs. (61)-(62) maps into the 
overrelaxation analysis as follows; 

m\ - m 3 < 1 = light field mass, 
m\ = Ad = heavy field mass, (63) 

7 (i -ufiy+u*/* > 

with 0 < 7 < 1 mapping into 0 < ui < 2. His conclu¬ 
sion is that overrelaxation works by adding a reversible 
coupling between pairs of long and short wavelength 
modes. The effect of the reversible coupling is to speed 
up the stochastic evolution of the long wavelength modes 
while slowing down the evolution of the short wavelength 
modes; the fact that the heavy modes are slowed down 
presumably explains why the observed results of overre¬ 
laxation depend on the operator being measured. Based 
on his analysis, Neuberger suggests that there is a dynam¬ 
ical equivalence class of overrelaxed algorithms which will 
be tunable to z = 1 . 

6.2. Two questions about overrelaxed Metropolis 

An important issue to be answered is whether OR n, 
or its generalization OR n x n 3 , defined as n x steps of the 
Brown and Woch, Creutz algorithm followed by n 3 steps 
of standard Metropolis, can attain z = 1 for appropriate 
n or appropriate n Xi n 2 . Or do these algorithms merely 
give an approximation to a fully tunable or for small lat¬ 
tices? 


U[ = UiFi [{EE}] Vi , 

(64) 

= Fi [{EE}] Vi.Fi [{?/'}] , 

and with the primed update accepted with conditional 
probability 

Pa = min [ 1 , exp/?(- 2 F + H)] , 

(65) 

B = S{{U}) + X i S(Vi). 

Here Fi e SU(n) are arbitrary functionals of the {U}, and 
exp [-/?S(V)] is an arbitrary imposed distribution for 
V. In this generalized setting, OR 0 corresponds to the 
choices Vi =1 (no noise), -Fi[{^}] — Uf 1 Ui 0 U^ 1 Ui 0l so 
that = UftU^Uio. An interesting question 

is whether one can get an efficient tunable-u algorithm 
along the lines of the Heller-Neuberger construction of 
eqs. (51)-(52), by choosing 

fi [{U}} = [Ur'U i0 Y , 0 < u < 2 , ( 66 ) 

and making an appropriate choice of S(V), 

6.3. Can z = 0 be attained by nonlocal algorithms? 
So far we have discussed local algorithms. We now 
address the question of whether non-local algorithms can 
give a further improvement in critical slowing down from 
z = 1 to z = 0. Two methods have been discussed in 
the literature: (a) Fourier acceleration in the Langevin 
equation 21 and (b) stochastic multigrid . 2 In Fourier ac¬ 
celerated Langevin, one performs a Langevin update 

<Hx,T n+ i) = <Hx,T n ) - f*{</>,1)\ , (67) 

with the driving term / at x coupled nonlocally to the 
action variation 8S/8<f> at y , 

/* 7 = [ e x,i/ 6<t,(y, Tn ) + y/ £ z,y *Klf> r n)] > 

(68) 

= £ P e*’ p ' {x ~ v) «(p) • 

Usually one takes e(p) = e(p 2 + m a ) _1 , as motivated by 
the analysis of critical slowing down in the free field case. 
In stochastic multigrid, one performs sweeps on succes¬ 
sively coarser grids according to the following recursive 
procedure: 
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• Do mi heat bath sweeps on a L d lattice; 

• Compute the conditional action 2 on the next coarser 
(L/2) j lattice; 

• Do 7 multigrid iterations on the (Z/ 2) d lattice; and 

• Add the coarse grid result back on the L d lattice 
and do m 2 further heat bath sweeps. 

According to this definition, for each initial sweep on 
the L d lattice there are 7 on the lattice of dimension 
(L/2) d , 7 2 on the lattice of dimension (L/4) d , and so 
forth. 

Let me now briefly discuss a number of problems 
which will have to be surmounted in order to apply these 
methods as lattice gauge algorithms: 

1. For both Fourier acceleration and multigrid—the 
problem of non-smooth approximate zero modes. 22 
We have seen in sect. 3.1 that the worst errors typi¬ 
cally come from the smallest eigenvalues of L. Both 
Fourier acceleration and multigrid assume that the 
dangerous modes are localized around the origin in 
Fourier space (as they are in the free field case): 
Fourier acceleration through the explicit choice of 
£x, v , and multigrid through the blocking scheme. 
However, as is well-known, non-Abelian theories can 
have zero eigenmodes which are not localized around 
zero in Fourier space—in fact, as exemplified by in- 
stanton zero modes, they can be arbitrarily rapidly 
varying. To deal with this problem adaptive versions 
of the Fourier acceleration or multigrid algorithms 23 
will be needed. 

2. In Fourier acceleration, the choice of step size is an 
issue. If the step size is small, the algorithm is accu¬ 
rate but the stochastic evolution is slow. If a large 
step size is used to get rapid evolution, there are 
detailed balance errors, or the danger of small ac¬ 
ceptances if detailed balance is restored by a global 
Metropolis step. 

3. In multigrid, attention must be paid to the work 
estimate. If the Lagrangian is a polynomial of order 
p, the conditional action on level 2* is computable 
in 0(p) steps from the action on level 2* -1 , giving 
the work estimate 2 

W ~L-'+7(£/2) <, + 7 2 (W + ... , kq , 


the Wilson action, the conditional action on level 2 fc 
must in general be computed on the finest lattice 
and requires L d steps, giving the work estimate 2 

W ~ (l + 7 + 7 * + ... + 7 log 2 Lj jd 

~ L d log 2 L 7 = 1 (70) 

~ i d + l °g 2 't' 7 > 1 

Now according to Goodman and Sokal, 2 in the Gaus¬ 
sian case one can prove that critical slowing down 
is completely eliminated for 7 > 2. But according 
to eq. (70), this gives z K ff = log 2 7 > 1 and so 
multigrid does no better than is possible by overre- 
laxation. Thus, in order for multigrid to beat overre¬ 
laxation, one will have to either (i) eliminate slowing 
down with 7 = 1 (this is conjectured 24 and could be 
studied numerically in lower dimensional examples 
even in the absence of proofs) or (ii) find a clever 
method to reduce the work needed to compute the 
conditional lattice gauge theory action. 

6.4. Conclusion 

To conclude, overrelaxation is very simple to imple¬ 
ment, and goes half-way towards solving the critical slow¬ 
ing down problem for pure gauge theories—one can get, 
in principle, from z = 2 to z = 1. To get from z = 1 to 
z = 0 will require further good ideas! 
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INTRODUCTION AND BRIEF REVIEW OF 
VIEWING TRANSFORMATIONS OF A PLANAR 

OBJECT 


A central issue in pattern recognition is the efficient 
incorporation of invariances with respect to geometric 
viewing transformations. We focus in this article on a 
particular method for handling invariances, called “im¬ 
age normalization”, which has the capability of extract¬ 
ing all of the invariant features from an image using only 
a small amount of information about the image (such as 
a few low order moments). The great appeal of normal¬ 
ization is that it isolates the problem of finding the im¬ 
age modulo the effect of viewing transformations, from 
the higher order problem of deciding which features of 
the image are needed for a specific classification deci¬ 
sion. Intuitively, normalization is simply a systematic 
method for transforming from observer-based to image- 
based coordinates; in the former the image depends on 
the view, whereas in the latter the image is viewing trans¬ 
formation independent. From a mathematical viewpoint, 
our method consists of placing a set of constraints on 
the transformed image equal in number to the number 
of viewing transformation parameters, permitting one to 
solve either algebraically or numerically for the param¬ 
eters of a normalizing transformation. Since the con¬ 
straints are necessarily viewing transformation noninvari¬ 
ants, their construction is in general simpler than the 
direct construction of viewing transformation invariants. 

Let us begin our discussion with a quick review of the 
viewing transformations of a planar object, since these 
transformations will be used as illustrations of our gen¬ 
eral methods. Under rigid 3D motions the image /(x), 
with x = (xi, X 2 ) the two dimensional coordinate in the 
image plane, is transformed to J(x'), with x' related to 
x by the planar projective transformation 

x ' n= Jl m =l GnmXm + tn , ^ = ^ ^ _ (1) 

I + P™ Xrn 


When the depth of the object is much less than its dis¬ 
tance from the lens, then the parameter p n in Eq. (1) 
can be neglected, and Eq. (1) reduces to the linear affine 


transformation 


x 


f 

n 


J 2 G nm 


m 


X 


m 


4* t 


n 


= 1 



[An affine transformation, with G nrn replaced by G nm — 
t n Pm , also results when Eq. (1) is expanded in a power 
series in x m and second and higher order terms are ne¬ 
glected.] Additionally, when the viewed object is con¬ 
strained to lie in the plane normal to the viewing or 3 
axis, Eq. (2) specializes further to the similarity trans¬ 
formation group of scalings, rotations, and translations, 
in which G nm is simply a multiple (the scale factor) of a 
two dimensional rotation matrix. The projective trans¬ 
formations, the affine transformations, and the similarity 
transformations all form groups, and this will be the char¬ 
acterizing feature of the viewing transformations studied 
in our general analysis. 

In applications, it will be convenient to use subgroup 
factorizations, which are readily obtained from the group 
multiplication rule for the transformations of Eqs. (1) and 
(2). For example, a general planar projective transforma¬ 
tion can be written as the result of composing what we 
will term a restricted projective transformation 


n 1 / 

1 + Lm=l P™ X m 

with the general affine transformation of Eq. (2). An¬ 
other subgroup factorization expresses the general affine 
transformation of Eq. (2) as the result of the composition 
of a pure translation 




with a homogeneous affine transformation 

2 

x n = ^ ^ G nm Xm ■ (4b) 

m=l 

Yet a third subgroup factorization expresses a general ho¬ 
mogeneous affine transformation as the result of compos¬ 
ing what we will term a restricted affine transformation, 
which has vanishing upper right diagonal matrix element, 


'X'n ^ ^ Qnm %m » 


771=1 


512 = 0 , 


(5a) 
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with a pure rotation 

2 

x n = nm-Ern ? 

m=l 

fin = fi 2 2 = cos0, fii 2 = fi 2 i = sin 6 . (5b) 

A variant of Eqs. (5a,b) is obtained by requiring that the 
matrix g have unit determinant, so that it has the two- 
parameter form gn = u, g± 2 = 0, g 2 i = w, 922 = w -1 , 
and then including a scale factor A in Eq. (5b), which 
now reads 

2 

X n = A ^ ^ finm^m- (^c) 

m= 1 

GENERAL THEORY OF IMAGE 
NORMALIZATION 

We proceed now to formulate a general framework for 
image normalization, with the aim of understanding the 
common elements of the various normalization methods 
which appear in the literature and of generalizing them to 
new applications. As a preliminary to the mathematical 
discussion of Subsecs. 2A-E, we specify our notation for 
viewing transformations. Let Q = {£} be a group of 
symmetry or viewing transformations fi, which act on 
the image I(x) according to 

I(x) -> J s (f) = I(S(x)) . (6a) 

Our notational convention, that we shall adhere to 
throughout, is that x f = S(x ) is the concrete image co¬ 
ordinate mapping induced by the abstract group element 
S. [A specific example of such a transformation would be 
the planar projection transformation of Eq. (1), in which 
S would be the abstract element of the planar projec¬ 
tive group characterized by the parameters G mn , t n . Pm 
specifying the concrete coordinate mapping.] In this no¬ 
tation, the result of successive transformations with S i 
followed by S 2 is given by 

m ^i S 2Si (£) = . (6b) 

The transformation groups of interest to us are in gen¬ 
eral ones with continuous parameters, in other words, Lie 
groups. However, very little of the formal apparatus of 
Lie group theory is required in what follows; basically, 
all we use is the group closure property and the enumer¬ 
ation of the number of group parameters. In particular, 
no knowledge of the representation theory of Lie groups 
is needed. 

A. The normalization recipe. We begin by giving the gen¬ 
eral prescription for an image normalization transforma¬ 
tion. Let Ni(x) be a transformation of x which depends 


on the image /, and which is constructed so that under 
the image transformation of Eq. (6a), it behaves as 

Ni s (x)=S~ 1 (N I ( £)) , (7a) 

with S~ 1 the inverse transformation to S of Eq. (6a), 

S(S~\x))=x . (7b) 

Also, let M/(x) be an optional second transformation of 
x which depends on the image I only through invariants 
under the group of transformations G, that is, 

M[ s (x) — M/(x), all S G G • (7c) 

Then 

I(x) = I(N I (M I (x))) (8) 

is a normalized image which is invariant under all trans¬ 
formations of the group G ■ This is an immediate conse¬ 
quence of Eq. (6a) and Eqs. (7a-c), from which we have 

l s (x) = I s (Ni s (Mi s (x ))) 

= J(^(^- 1 (^ j (Mj( *))))) (9) 

= £))) = /(f) . 

B. Uniqueness. Before specifying how to actually con¬ 
struct a map Tv/ obeying Eq. (7a), let us address the 
issue of uniqueness. That is, given two maps N i/(x) and 
A 2 /(x), both of which obey Eq. (7a), how are they re¬ 
lated? By hypothesis, we have 


Niis(x) = S~\Nu(x)) , 
N 2 Is {x) = S~\N 2 I (x)) . 

(10) 

Since for any /(x) and <f(x) we have 


f{g{x))-'=g~\r\x)) , 

(11a) 

we can rewrite the first line of Eq. (10) as 


tiu a W = tiu(S(x)) ■ 

(lib) 

Let us now define a new map M/(x) by 


M/(f) = N^i^ix)) , 

(12) 


which reduces to the identity map when N u = N 2I ; then 
by Eq. (lib) and the first line of Eq. (10), we have 

M /s (f) = N^ s {N 2Is {x)) 

= Nu\S{§~\N 2I { x)))) (13a) 

= Nu\N 2I {x)) = M/(f) . 

In other words, Mj(x) depends on the image I only 
through invariants under transformations of the group G , 
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and from Eq. (12), the normalizing map N 21 is related 
to the normalizing map Nu by 

N 2 i(x) = Nii(Mi(£)) . (13b) 

This is why in writing the general normalized image cor¬ 
responding to a particular normalizing map in Eq. (8), 
we have included in the x dependence the possible ap¬ 
pearance of a map M/ which depends on the image only 
through invariants under transformation by elements of 

G- 

C. Construction of TV/ by imposing constraints, and 
demonstration that normalization yields a complete set 
of invariants. We next show that one can construct 
an image normalization transformation obeying Eq. (7a) 
by imposing a suitable set of constraints. We shall as¬ 
sume now that Q is a iV-parameter Lie group which 
is continuously connected to the identity. Let Ck[I] — 
Cfc[/(x)] , k = 1, ...,K (where x is a dummy variable) 
be a set of functionals of the image I(x) with the property 
that the K constraints 

C k [I S '] = C k [I(§'(£))\= 0, k = l,...,K (14a) 

are satisfied for a unique element S' — N{ of G , so that 
C fc [J(TV/(x))]=0, * = 1,...,*. (14b) 

Then, as we shall now show, TV/(x) is the desired nor¬ 
malizing transformation. 

We remark that the condition that Eqs. (14a,b) should 
have a unique solution can be relaxed in applications to 
the condition that there be only one solution in the range 
of relevant viewing transformation parameters. Clearly, 
either form of the uniqueness condition requires that the 
constraint functionals not be invariants under G, and thus 
their structure will in general be simpler than that of 
directly constructed viewing transformation invariants. 
In many cases, the constraints can be constructed from 
viewing transformation covariants , which have simple al¬ 
gebraic properties under the transformations of G , per¬ 
mitting closed form algebraic solution for the parameters 
of the normalizing transformation. 

To see that the construction of Eqs. (14a,b) gives a 
transformation TV/(x) that obeys Eq. (7a), let us con¬ 
sider the effect of replacing I by Is in Eqs. (14a,b). By 
hypothesis, the constraints 

C k [I s (S'(x))}= 0, k=l,...,K (15a) 

are uniquely satisfied by a group element S' = TV/ S of £?, 
so that 

C fc [7 5 (TV/ s (x))] = 0, * = 1, , (15b) 

with TV/ s (x) the proposed normalizing transformation 
corresponding to Is. But using Eq. (6a), we can also 
write Eq. (15b) as 

C k [I(S(Ni s (x)))] = 0 , k = l,...,K , (15c) 


which has the same structure as Eq. (14b). Therefore, 
by uniqueness of the solution TV/ of Eq. (14b) we must 
have 

S(Ni s (x)) = TV/(x) , (16a) 

which by Eq. (7b) is equivalent to 

N Is (x) = §~ 1 (N I (£)) , (16b) 

showing that the TV/ produced by solving the constraints 
does indeed obey Eq. (7a). Hence the imposition of con¬ 
straints gives a constructive procedure for generating im¬ 
age normalization transformations. 

We note that this construction makes the normalizing 
transformation TV/ an element of the group G , and the 
quotient M/(x) = TV^} 1 (TV 2 /(x)) of two normalizing maps 
constructed by imposing different sets of constraints will 
likewise be an element of Q ■ When both TV/ and Mj in 
Eq. (8) belong to G , we can invert Eq. (8) to express the 
original image I in terms of the invariant, normalized 
image I according to 

I{x) = I{M;\NJ\x))) . (16c) 

This equation shows that normalization leads to a com¬ 
plete set of invariants, in the sense that the information 
in the normalized image, plus the K parameters deter¬ 
mining the viewing transformation (NJ l (x)), suffice 
to completely reconstruct the original image. By way of 
contrast, representation-theoretic and integral transform 
methods, although attacking the same problem as is dis¬ 
cussed here, yield only a small fraction of the complete 
set of invariants. Moreover, normalization has the fur¬ 
ther advantage of requiring only a minimal knowledge of 
the kinematic structure of the group; the full irreducible 
representation structure is not needed, and the methods 
described here are applicable to noncompact as well as 
to compact groups. We note finally that the discussion 
of this section is slightly less general than that of Subsec¬ 
tions A,B above, where we did not require either TV/ or 
Mi to belong to G\ the most general normalizing map TV/ 
is obtained from one generated by constraints by using 
as its argument a map M/ which does not belong to G 
but that is invariant under transformations of the image 
/by£. 

D. Extension to reflections and contrast invariance. We 
consider next two simple extensions of the constraint 
method for constructing the normalizing transformation. 
The first involves relaxing the requirement that G be sim¬ 
ply connected to the identity, as is needed if G contains 
improper transformations such as reflections. Reflections 
are said to be independent if they do not differ solely by 
an element of the connected component of the group; for 
each independent discrete reflection R in G , the set of 
constraints of Eq. (14a) must be augmented by an ad¬ 
ditional constraint D[I(S'(x))] > 0, where D[I(x)] is a 
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functional of the image which changes sign under the re¬ 
flection operation R , 

D[I(R( £))] = -D[I(X) 1 • (17) 

The second extension involves incorporating invariance 
under changes of image contrast, that is, under image 
transformations of the form 


J(x) —► c/(x), c > 0 . (18a) 

To the extent that illumination is sufficiently slowly vary¬ 
ing that it can be treated as constant over a viewed ob¬ 
ject, changes in illumination level as the object is moved 
to different views take the form of changes in the con¬ 
stant c in Eq. (18a), which is why incorporating contrast 
invariance can be important. If we require that the con¬ 
straint functionals Ck [and D if needed] should be invari¬ 
ant under the change of contrast of Eq. (18a), then the 
image normalization transformation Nj(x) and the aux¬ 
iliary transformation M/(x) can be taken to be contrast 
invariant. A contrast invariant normalized image J c (x) is 
then obtained by the obvious recipe 



f d 2 xl(x) 


(18b) 


E. Use of subgroup decompositions. Suppose that for a 
general element S of the group Q , there is a subgroup 
decomposition of the form 


which if we impose the requirement of a unique solu¬ 
tion over transformations N 2 € Q 2 determines a “partial 
normalization” transformation N 2 i. Note that a suffi¬ 
cient condition for the constraints of Eq. (20a) to be 
independent of Si is for the functionals C 2 k to be 5'1- 
independent, but this is not a necessary condition; we will 
see examples in which, as S\ traverses Qi, the functionals 
are merely covariant in some simple way that guarantees 
invariance of the constraints obtained by equating all the 
functionals to zero. To see how N 21 transforms under the 
action of the group £/, we replace I by Is in Eq. (20b), 
giving 


^[/ 5 (^ 2 / s (^))]= 0 , k = l,...,K 2 ; (21a) 

again making use of Eq. (6a) this becomes 

C 2fc [/(^ 2 j s (f)))]=0, k = l,...,K 2 • (21b) 

Since the argument S(N 2 / s (x)) appearing in Eq. (21b) is 
no longer a member of the Q 2 subgroup, we cannot con¬ 
clude that it is equal to the argument 7v2/(x) appearing 
in Eq. (20b), but the arguments can differ at most by a 
transformation of x by some member S[ of the subgroup 
Qi which leaves the constraints invariant, giving 

Af 2/s (f) = 5- 1 (Af 2/ (5((f))) (22a) 


S = S 2 S 1 , (19a) 

with S 2 belonging to a subgroup Q 2 of £/, S\ belonging to 
a subgroup Q\ of Q , and with the respective parameter 
counts K,K\, and K 2 of Q, Q\, and Q 2 obeying 

K = Ki + K 2 . (19b) 

(Such subgroup compositions for a general Lie group are 
obtained by constructing a composition series for the 
group, but we will not need this formal apparatus in 
the relatively simple applications that follow.) Let us 
suppose further that we can solve the problem of image 
normalization with respect to the group Q \, and that we 
wish to extend this solution to the full invariance group 
Q. The subgroup decomposition allows this to be done by 
imposing K 2 additional constraints to deal with the Q 2 
subgroup, as follows. Let C 2 fc[/(x)], with k = 1,.„,K 2 , 
be a set of functionals of the image chosen so that the 
constraints 

C 2 k[I(N 2 i(Si (£)))] = 0 , k = 1, ...,K 2 (20a) 


as the subgroup analog of Eq. (7a). Corresponding to 
this, the partially normalized image defined by 

J(x) = I(N 2 i(x)) (22b) 

transforms under the group Q as 

m^Isi X) = I S (N 2Is (x)) 

= I(S(S-\N 2I (S[(x))))) (22c) 

= I(N2r(S[(m = HS[m, 

and thus changes only by a transformation lying in the 
Q\ subgroup. Further image normalization of I using the 
constraints appropriate to Q\ then gives a final normal¬ 
ized image 

i(x) = I(N 2I (N lI (M ! (x)))), (23) 


are independent of Si e Q\. In particular, taking Si as 
the identity transformation, Eq. (20a) simplifies to 

C 2 k[I(N 2 i(x))} = 0 , k = 1 ,..., A 2 , (20b) 


which is invariant with respect to the full group of trans¬ 
formations Q , where as before M/ is any transformation 
which is constructed solely using Q invariants of the im¬ 
age. 
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Abstract—We apply the general framework for image normalization to the problem of the similarity and 
affine normalization of partially occluded planar curves. An algorithm is given using first derivatives to give 
a similarity normalization, and first and second derivatives to give an affine normalization, dependent on a 
finite interval around a chosen point P of the curve. © 1998 Pattern Recognition Society. Published by 
Elsevier Science Ltd. All rights reserved 

Normalization Affine Similarity Partially occluded Planar curves Invariants 
Affine invariants Similarity invariants 


1. INTRODUCTION, AND NORMALIZATION WITH 
RESPECT TO TRANSLATIONS AND ROTATIONS 

The problem of efficiently incorporating invariances 
plays an important role in machine vision, and much 
work has gone into methods for extracting features 
invariant with respect to classes of geometric viewing 
transformations. An appealing way to deal with this 
problem is to use image normalization, in which the 
transformed image is reduced to a standard form 
which effectively “mods out” the viewing transforma¬ 
tion group; the normalized image and all of its fea¬ 
tures are then invariants, while the parameters of the 
normalizing transformation give the pose of the orig¬ 
inal image, so there is no loss of information. Adler (1) 
recently gave a general formal framework for image 
normalization, and illustrated it by using it to rederive 
and generalize known methods for the normalization 
of a non-occluded, isolated image. The method is not, 
however, limited to this rather special case, as we 
demonstrate here by applying it to the more realistic 
problem of the similarity and affine normalization of 
a partially occluded planar curve, such as that charac¬ 
terizing the image of the boundary of a partially 
occluded planar object. 

There has been much discussion in the literature of 
the problem of the recognition of partially occluded 
curves, using mainly the approach of constructing 
viewing transformation invariants using strictly local 
information at a point P (obtained by computing 
a finite number of derivatives; see reference (2) and 
references cited therein), or as in recent work of 
Bruckstein et aiP } using “semi-local” information 
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constructed from a finite neighborhood of P. Our 
approach is similar in spirit to this latter work, but 
instead of directly constructing invariants we instead 
use viewing transformation covariants, that is quantit¬ 
ies which are not invariant but have simple algebraic 
transformation properties under the transformations 
of interest, to construct a normalization algorithm. 
The advantage of focusing on covariants is that they 
often can be constructed in a more computationally 
robust fashion (for example, using only low-order 
derivatives or moments) than is possible for invari¬ 
ants. 

Let us assume that we are given a set of closed 
planar template curves, C 2 > , C^, with Cj speci¬ 

fied by giving its parametric form Xj(tj) = [x/f,), y/fj)] 
in which the parameter t, increases as the curve Cj is 
traversed counterclockwise. We are now given a con¬ 
vex target curve segment x target (£ targel ) = [x targe ,(l la rg e t), 

ytacget(^target)]! fOT ^target min — Itargel — ^target max> Which 

represents a fragment of the jih template as distorted 
by both an affine viewing transformation and a pos¬ 
sible reparameterization £ target = U(tj) with unknown 
function U. The problem is to identify the fragment 
with the correct corresponding segment of the correct 
template. In the discussion that follows, we shall omit 
the subscripts j and “target” from the curve para¬ 
meters, which will simply be referred to by the generic 
label f; the fact that t has a different meaning for each 
of the different curves will be taken into account by 
framing the entire algorithm in terms of reparameter¬ 
ization invariant quantities. 

We proceed by using the subgroup method (see 
reference (1), and Appendix A of this paper, where the 
methods of reference (1) are adapted to the normaliz¬ 
ation of planar curves), in which we successively solve 
the problem for translations and rotations, for the full 
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similarity group, and finally for the full affine group. 
For translations and rotations the procedure is obvi¬ 
ous. One first associates with each point P of the 
target and templates the reparameterization invariant 
unit normalized tangent vector 

f = T/\Tl (la) 

where the unnormalized tangent vector is given by 

T = (*',/), (lb) 

and where the prime denotes differentiation with re¬ 
spect to the generic parameter t. One first takes a par¬ 
ticular point P of the target curve, translates it to the 
origin, and rotates the target so that it is tangent to 
the y axis at the origin and has its tangent vector 
pointing into the upper half-plane. One next picks 
a point P ' on the first template, translates it to the 
origin, and rotates the first template so that it is also 
tangent to the y-axis at the origin and has its tangent 
vector pointing into the upper half-plane. Maintain¬ 
ing these conditions, one sweeps P f over the first 
template and looks for a match, proceeding in this 
fashion from template to template until a match is 
found. If no match is found, one then repeats the 
procedure with the tangent vectors of target and tem¬ 
plate at the origin pointing in opposite directions. The 
search implicit in this procedure is necessary because, 
without identifying landmarks on the curves, there is 
no way of knowing a priori (i) which point P ' of the 
correct template corresponds to the given point P on 
the target, and (ii) whether the parameter of the target 
runs in the same or the opposite sense to that of the 
template. Apart from the search over the possibilities 
for P ' and the relative sense of the template and target 
parameters, the algorithm consists simply of imposing 
the same translational and rotational normalization 
on the template and target curves. 

2. NORMALIZATION WITH RESPECT TO SIMILARITY 

TRANSFORMATIONS 

We turn next to the problem of normalization with 
respect to the full similarity transformation group, 
comprising translations, rotations, and scalings. Let 
us pick a specific point P of the target, a neighbor¬ 
hood of which will be used to construct the similarity 
normalization. As before, we perform a translational 
and rotational normalization by translating P to the 
origin and rotating the target so that x' = 0, y' > 0 at 
P. We begin by observing that the tangent vector to 
the target at a general point with parameter t makes 
an angle with respect to the y-axis given by 

/ 

w(t) = tan 0 = ^, (2) 

which is reparameterization invariant, and which by 
our translational and rotational normalization van¬ 
ishes at P. Let us compute w(£) for each t on the target 
segment and store its value along with x(£), y(£). Let 


now dr(w) > 0 with T(0) = 0 be a measure for integ¬ 
ration over the target curve; because w is invariant 
under the coordinate rescaling x -» Ax, y -» Ay, so is 
this measure. Also let A s > 0 be a parameter govern¬ 
ing the size of an integration interval along the target 
segment starting from P, assumed to lie entirely with¬ 
in the segment. Finally, let F D (x f y) be any non¬ 
negative function of x, y which is homogeneous of 
degree D under coordinate rescaling, that is 

F d (Ax, Xy) = X d F d {x, y). (3) 

We now use the quantities just defined to form the 
constraint (with p ^ v arbitrary real number para¬ 
meters) 

jS*dr(w)f J ,(Xx(txX3'(0y‘ = 1 ( 

^dr(w)f B (Xx(t), Iy(C)Y ’ - 1 

which using the homogeneity of F D can be solved 
algebraically for the scale parameter A to give 

R = jo sdr ( w ) f i>(x(tx y(or (S) 

Jo* dr ( W )F D (x(t),y(t)y 

Then according to the general normalization pre¬ 
scription of reference (1) and Appendix A, the nor¬ 
malized curve C with the parameterized form x(t) = 
[Ax(f), Ay(r)] is invariant with respect to scaling, as 
well as to translation and rotation, of the target seg¬ 
ment. Note that the resulting similarity normalization 
depends not only on the parameter A s , the exponents 
p, v and the chosen functions T(w) and F D (x , y), but 
also on the choice of the fiducial point P, and hence 
the normalized curves corresponding to different 
choices of P will not in general be congruent to one 
another. 

To determine the correspondence between target 
segment and the templates, we now do a search over 
the templates, over the choice of fiducial point P' on 
each template, and over the two possible senses of the 
tangent vector at P', applying the same normalization 
recipe as was applied to the target segment and look¬ 
ing for a match. The search for a match is done by 
comparing the normalized target and template using 
any convenient measure for the degree to which they 
coincide (such as, e.g. the integral over all points of the 
normalized target of the minimum distance to the 
normalized template). In some applications it may 
suffice to use a measure based on only a small number 
of features extracted from the normalized curves, in 
which case an alternative procedure would be to dis¬ 
pense with normalization and directly construct in¬ 
variant features by the methods of references (2, 3). 

3. AFFINE NORMALIZATION 

We turn finally to the problem of normalization 
with respect to the affine transformation group. Again 
let us pick a specific point P of the target, a neighbor- 
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hood of which will be used to construct the affine 
normalization. As before, we perform a translational 
and rotational normalization by translating P to the 
origin and rotating the target so that x' = 0, / > 0 at 
P. The conditions that P lie at the origin and that 
x' = 0 at P are preserved only by a subgroup of the 
full affine group, consisting of homogeneous affine 
transformations of the form 


A = 


a 0 

fi yJ' 


( 6 ) 


Since we have been implicitly dealing only with 
proper transformations (as opposed to reflections), we 
assume that the matrix of equation (6) has a positive 
determinant, and so ay > 0. 

Under the action of A, the coordinate vector 
x = (x, y) is transformed to x A = (x A , y A ) = (ax, 
fix -f yy\ Let us now follow Vaz and Cyganski (4) and 
introduce a new arc length parameter dr defined by 


dx = | x'y" — x”y f \ il3 dt. 


(7) 


which is shown in reference (4) to be reparameteriz¬ 
ation invariant. Under the action of A, dr is trans¬ 
formed to dx A = (det A ) 1/3 dr = (ay) 1/3 dr, and so 
although dx is not an affine invariant, it transforms 
linearly under affine transformations and thus is 
a convenient integration measure. Clearly, the differ¬ 
ence between the upper and lower limits of an integra¬ 
tion over t also simply rescales by (ay) 1/3 under the 
affine transformation A; henceforth, we shall choose 
the constant of integration in the integrated version of 
equation (7) so that x = 0 at P. 

We now define “center of mass” coordinates 
Xcm> ycM and central moment integrals n rs = ^ rs (T), in 
the neighborhood of P, by 


(*CM> yCAf) — 


jo dr[x(t), y(t)] 

S odr 


(8a) 


and 


Mrs(T) = 


dr(x - x C mY( y - ycM)\ (8b) 


o 


where the upper limit T will be specified through the 
normalization procedure. Under the action of the 
affine transformation with matrix A, the central mo¬ 
ments transform to 


VA;rs(T A ) = 


dx A (x - x C MX4(.y - ycM) S A , (8c) 


0 


with T A —(ay) 1/3 T. Substituting the transformed 
quantities and making a change of integration vari¬ 
able from x A to t, we get 


VA. rs (T A ) = (ay) 1/3 


dr[a(x - x CM )J 


o 


X [fi(x - X CM ) + y(y - ycM)J 


= (a7) 1/3 I 


si 


t=0 [ ' (^^O- 


«Ty (9) 


Let us now impose a set of four constraints to 
uniquely fix the affine transformation parameters 
a, /?, y and the interval of integration T = ^ 00 , 

PA;0 2(T A ) = Pa-.2c{Ta) = L PA;ll(T A ) = 0, T A =f 

( 10 ) 

with / a positive constant characterizing the normal¬ 
ization. Because the yi !s have been defined as central 
moments, these conditions are always attainable for 
/ in the range of T A . Substituting equaton (9), the first 
three conditions of equation (10) can be solved alge¬ 
braically in terms of T by solving a quadratic equa¬ 
tion. Writing 


- 7 a P 

y = - P = ~, 

a a 


(11a) 


the solution takes the form 


y = 


nioiT) 

9 1 ' 2 


P=~ 


dn (T) 
01/2 


= M20(T)H02(T) - Hu (T) 2 , (lib) 

a = ^ 20 (Tr 3/8 7- 1/8 ,(a 7 ) 1/3 = ®- 1/8 . 

The final condition of equation (10), which determines 
the integration interval T before normalization, is 
implicit and must be solved by an iterative method. 
Writing 


P(T) = ^- 1/f W)-/, 


(12a) 


the desired value of T is a solution of F(T) = 0, which 
always exists for / in the range of ^ _1/ Voo- Lor 
simplicity, we solve this equation using Newton’s 
method, for which the desired solution T is the limit 
of the iteration defined by 


T„ + i — T n 


F(T) 

F\T) 9 


(12b) 


with the prime here denoting differentiation with re¬ 
spect to T, and with the initialization of Tj specified 
below. The result of this iteration, when substituted 
into equations (11a) and (lib), is the set of parameters 
a, /?, y of a normalizing affine transformation matrix 
A with the form of equation (6). In order to get 
a normalizing transformation that behaves smoothly 
as T -»0, we take as the final normalizing transforma¬ 
tion the rescaled matrix A = A~ n \ tcircle A, with 
Anit circle the transformation obtained by applying the 
above procedure to a unit circle passing through the 
origin and tangent to the y-axis there. Corresponding 
to this choice, we parameterize / as /=/(Ax), with 
A a the arc length along a unit circle and with 


/(Ax) = 


__ ^ - 1/8 / A \ A 

unit circle v^X/^X? 


(12c) 


so that the limit of vanishing normalization interval is 
simply the limit A^ -► 0. With this choice of /, a conve¬ 
nient initialization for the Newton iteration is Tj =A^. 
(Formulas for the central moment integrals and 
£$" 1/8 computed from such a unit circle are given in 
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Fig. 1. Normalization results. Ellipses related by affine transformation (upper left) normalize to the 
same unit circle (upper right). Generic curves related by affine transformation (lower left) normalize to 
the same curve (lower right). The solid portion of each curve is the segment used for normalization. 

Typically 5-6 Newton iterations were used. 


Appendix B.) The matrix A again has the form of 
equation (6), with parameters a, /?, y, and the final 
normalized curve C has the parameterized form 

= Xx(f) = [ax(tx + yy(0J- (13) 

The transformation A has the useful property that it 
transforms any ellipse C through the origin, and tan¬ 
gent to the y-axis there, into a unit circle C, with 
A a the arc length on the unit circle of the segment used 
for normalization (see Fig. 1). Again, the normalizing 
transformation depends on the choice of the point 
P in addition to the interval size A a . 

To determine the correspondence between target 
segment and the templates, we now do a search over 
the templates, over choice of fiducial point P' on each 
template, and over the two possible senses of the 
tangent vector at F, applying the same normalization 
recipe as was applied to the target segment and look¬ 


ing for a match. The search for a match is again done 
by comparing the normalized target and template 
using an appropriate measure for the degree to which 
they coincide. 

Variants on this procedure are possible. For in¬ 
stance, we could instead (see reference (1)) use the 
affine subgroup defined by matrices of the form 


5 - (J (,4) 

which give a general affine transformation when com¬ 
bined with a similarity transformation. In this case 
one would only require ti A:02 (T A ) = ti A:20 (T A ) as a nor¬ 
malization condition on the moments \i A:02 and ti A . 2 o, 
without requiring the common value to be unity. One 
would then have to do a scaling normalization, using 
the method described in Section 2, following the par¬ 
tial normalization with respect to the affine trans- 
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formation of equation (14), and this scaling normaliz¬ 
ation would have to be placed inside the Newton 
iteration loop which determines the integration inter¬ 
val T. 

To conclude, we have shown that the normalization 
methods of reference (1) are not limited to the case 
of non-occluded images. When applied to partially 
occluded curves, they give a method for similarity 
normalization using only the tangent vector (which 
requires only first parametric derivatives), and 
a method for affine normalization using only the tan¬ 
gent vector and curvature (which requires only first 
and second parametric derivatives). 

4. SUMMARY 

We extend the general theory of image normaliz¬ 
ation developed by Adler (1) to the case of planar curve 
segments. Specifically, we show that the method can 
be used to obtain a normalization, and hence all the 
invariants, of partially occluded planar curves sub¬ 
jected to similarity and affine transformations. The 
similarity normalization uses only the tangent vector 
(which requires only first parametric derivatives), 
while the affine normalization uses only the tangent 
vector and the curvature (which requires only first and 
second parametric derivatives). Thus, our algorithm 
gives a substantial improvement over previous 
methods in the literature for constructing similarity 
and affine invariants of partially occluded planar 
curves. Our algorithm is based on the subgroup 
method/ J) in which we solve in succession the normal¬ 
ization problem for planar curve segments under 
translations and rotations, similarity transformations, 
and finally affine transformations. The similarity nor¬ 
malization is given by an explicit parametric integral 
over the curve segment; the affine normalization is 
given in terms of parametric integrals over the curve 
segment by a single, rapidly convergent Newton iter¬ 
ation. We give all the formulas needed for construct¬ 
ing the normalization algorithm, and a figure 
illustrating typical computational results. 
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APPENDIX A 

We derive here a framework for the normalization 
of planar curves, starting from the general theory of 
image normalization given in reference (1). Let C be 
a planar curve segment x = xc(0 parameterized by t, 
which in the terminology of reference (1) corresponds 
to an image intensity per unit area /(x) given by the 


reparameterization invariant expression 



V 

*max 

df|T|£ 2 (x - xc(0)- 

^min 


(Ala) 


In Equation (Ala), £ 2 (x) is the two-dimensional Dirac 
delta function, defined by £ 2 (x) = 0 for x ^ 0, and 
jj d 2 x£ 2 (x) = 1, so that £ 2 (x) is a distribution of unit 
weight with support at the origin x = 0. Also in equa¬ 
tion (Ala), T is the tangent vector defined in equation 
(lb), so that ds = df|T| is the differential of arc length, 
and therefore equation (Ala) describes a distribution 
with support on the curve segment C having uniform 
weight per unit of arc length. The total image intensity 
integrated over area, corresponding to equation 
(Ala), is given by 


t* 


d 2 x/(x) 




d 2 x5 2 (x - x c (0) 

f max 

dr|T| = J ma * - Smin, (Alb) 

^min 


dt|T| 


and so is just the total arc length of the segment. 

Let now ^ = {S} be a group of symmetry or view¬ 
ing transformations S, which act on the image /(x) 
according to 


I(x) U 7 s (x) = 7(S(x)), (A2a) 


with x -»S(x) the image coordinate mapping induced 
by the group element S. Substituting Equation (Ala) 
into equation (A2a), we have 


fs(x) = 


•t 

*max 

dt|T|<5 2 (S(x) - xc(0) 


*max 

df|T||J(x)|' 1 5 2 (x - S' 1 (x c (t))), 

^iTiin 


(A2b) 


with J(x) the Jacobian of the transformation S(x). 
For the case of similarity and affine transformations 
discussed in this paper, the Jacobian J is simply a 
constant and plays no further role; the image trans¬ 
formation of equations (A2a) and (A2b) is then equiv¬ 
alent to the replacement of the curve segment C by the 
transformed curve segment Cs described by the para¬ 
metric expression 


x Cs (t) = S~\x c (t)). (A3) 


Using equation (A3), the various results for image 
normalization given in reference (1) can be taken over 
to the planar curve case, with due attention to the fact 
that the inverse transformation appears in equation 
(A3). In particular, since 

f(g(x)) _1 =g~ 1 (f' 1 (x)), (A4) 


expressions in reference (1) which involve multiple 
transformations will have the factors reverse ordered 
when expressed as an action on the parameterized 
curve. For example, the general normalization recipe 
for the parameterized curve reads 


x c (t) = M c (Nc(xc(0)), (A5a) 
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where Nc(x) is a normalizing map constructed from 
the curve which transforms under the image trans¬ 
formation as 


N C5 (x) = N c (S(x)), (A5b) 

and where Mc(x) is an optional second transforma¬ 
tion which depends on the curve C only through 
invariants under the group of transformations that 
is, 


M Cs (x) = M c (xX all S e (A5c) 

We can easily check the validity of equation (A5a) 
directly, 


\ Cs (t) = Mc s (N Cs (x Cs (0)) 

= M c (Kc(S(S-\xcm)) (A6) 

= Mc(Nc(x c (0)) = x c (f> 

The other general statements in reference (1) are sim¬ 
ilarly converted to results for the normalization of 
parameterized curves. 

APPENDIX B 

We give here the central moments for a unit circle 
passing through the origin and tangent to the y-axis 
there. The parameterized form for the circle is 

Xunit circle^) = ( ~ 1 + COS t, SU1 t\ 0 < t < 2 7T, (Bl) 

and the origin lies at t = 0. Equation (7) for the affine 
covariant arc length reduces to dx = dt , and we 
choose the constant of integration so that t = 0 at the 
origin. Integrating over a segment of the unit circle 
0 < t = t < T, we find the following formulas for the 
center of mass coordinates and the second central 
moments, 

(x C M, ycM) =T~\ — T -f sin T, 1 - cos T), 


/rr , x T L sin 2 7 

rf )-2 rif 


1 sin 2 T 

Hoi(T) 2 ( 1 2T 


- y (sin Tf, 


4 / . T 
— — 1 sin — 
T\ 2 


(B2a) 


T ( T 2 T\ 
/in (T) = sin T sin y (cos— - ^ sin— I, 


and the corresponding small T behavior is 
(xcm, yew) ^ 




^20(T) 


Voi(T) 


1 rp 5 

* T 7 

45 1 

315 ’ 

1 7 3 

~T 5 , 

12 

40 

1 T 

4 | ^ j. 

24 1 

+ 720 1 


(B2b) 


■ 1/8 (T) = [/i20(l>02(T) - /iu(T) 2 ]- 1/8 

3 _ 


8640 1/8 T“M 1 + 


280 
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We develop the theory of the nonadiabatic geometric phase , in both the Abelian 
and non-Abelian cases , in quaternionic Hilbert space. 


1. INTRODUCTION 


The theory of geometric phases associated with cyclic evolutions of a physi¬ 
cal system is now a well-developed subject in complex Hilbert space. The 
seminal work of Berry on the adiabatic single state (Abelian) case (1) has 
been extended to the non-Abelian case of the adiabatic evolution of a set 
of degenerate states, (2) and both of these have been further extended' 3 ' 4) to 
show that there is a geometric phase associated with any cyclic but non¬ 
adiabatic evolution of a single quantum state or of a degenerate group of 
quantum states. 

In this paper we take up another direction for generalization of the 
geometric phase, from quantum mechanics in complex Hilbert space to 
quantum mechanics' 5,6) in quaternionic Hilbert space. The generalization 
of the adiabatic geometric phase to quaternionic Hilbert space was given in 
Ref. 6, where it was shown that for states of nonzero energy the adiabatic 
geometric phase is complex, as opposed to quaternionic, with a quater¬ 
nionic adiabatic geometric phase occurring only for the adiabatic cyclic 
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evolution of zero energy states. Consideration of nonadiabatic cyclic evolu¬ 
tions was also begun in Ref 6, but the discussion given there is incomplete. 
While Sec. 5.8 of Ref. 6 constructed a nonadiabatic cyclic invariant phase, 
it did not address the problem of separating this phase into a dynamical part 
determined by the quantum mechanical Hamiltonian, and a geometric part 
that depends only on the ray orbit and is independent of the Hamiltonian. 

The purpose of the present paper is to give a complete discussion of 
the nonadiabatic geometric phase in quaternionic Hilbert space. In Sec. 2 
we give a very brief survey of the properties of quantum mechanics in 
quaternionic Hilbert space that are needed in the analysis that follows. 
In Sec. 3 we consider the cyclic nonadiabatic evolution of a single quantum 
state, and show how to explictly generalize to quaternionic Hilbert space 
the construction of a nonadiabatic geometric phase given in Ref. 3. In Sec. 4 
we extend our analysis to the case of a degenerate group of states, thereby 
obtaining a quaternionic nonadiabatic non-Abelian geometric phase 
corresponding to the complex construction given in Ref. 4. A brief sum¬ 
mary and discussion of our results is given in Sec. 5. 


2. QUANTUM MECHANICS IN QUATERNIONIC HILBERT SPACE 

Only a few properties of quaternionic quantum mechanics are needed 
for the discussion that follows; the reader wishing to learn more than we 
can present here should consult Ref. 6 . In quaternionic quantum 
mechanics, the Dirac transition amplitudes are quaternion valued, 

that is, they have the form 

<iAI^> =rv + r x i + r 2 j + r 7t k ( 1 ) 

where r Q lt2t3 are real numbers and where /, j\ k are quaternion imaginary 
units obeying the associative algebra i 2 = j 2 = k 2 = — 1 and ij= —ji = k 9 
jk= —kj=i , ki= —ik = j . Because quaternion multiplication is noncom- 
mutative, two independent Dirac transition amplitudes and <at [ 17 ) 

in general do not commute with one another, unlike the situation in 
standard complex quantum mechanics, where all Dirac transition 
amplitudes are complex numbers and mutually commute. The transition 
probability corresponding to the amplitude of Eq. (1) is given by 

K*AI^>I 2 =<*AI^> <iAI^> = rl + r 2 x +r\ + r\ ( 2 ) 

where the bar denotes the quaternion conjugation operation {/, y, k } -> 
{—i, —j, —k) and where we have assumed the states |i p) and \(j>) to be 
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unit normalized. Since the quaternion norm defined by Eq. (2) has the 
multiplicative norm property 

\(h(h\ = kil \q 2 \ ( 3 ) 

the transition probability of Eq. (2) is unchanged when the state vector |</>} 
is right multiplied by a quaternion co of unit magnitude, 

l^>-H <t>) (O, |eo| = 1 (4) 

Hence as in complex quantum mechanics, physical states are associated 
with Hilbert space rays of the form {| (j>) co: |a>| = 1}, and the transition 
probability of Eq. (2) is the same for any ray representative state vectors 
\(j>} and 1 1 p} chosen from their corresponding rays. In the next section, we 
shall follow Ref. 3 in denoting quaternionic Hilbert space by and the 
projective Hilbert space of rays of by 0. 

Time evolution of the state vector |i py is described in quaternionic 
quantum mechanics by the Schrodinger equation 

<9 li by ~ 

(5a) 

with 

H=-H f (5b) 

an anti-self-adjoint Hamiltonian. From Eqs. (5a) and (5b) we see that the 
Dirac transition amplitude <i//| <j>y is time independent, 

= l)l*> + <*l|l*> 

= ^\H-H\<t>y=o ( 6 ) 

and thus the Schrodinger dynamics of state vectors preserves the inner 
product structure of Hilbert space. The dynamics of Eqs. (5) and (6) is 
evidently preserved under right linear superposition of states with quater¬ 
nionic constants, 



d(liA) qi + l<ft) q 2 ) 

^ dt 

= —H(\\p} q , + \(j >> q 2 ) 


(7) 
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Equation (7) illustrates two general features of our conventions for quater- 
nionic quantum mechanics, which are that linear operators (such as H) act 
on Hilbert space state vectors by multiplication from the left, whereas 
quaternionic numbers (the scalars of Hilbert space) act on state vectors by 
multiplication from the right. Adherence to these ordering conventions is 
essential because of the noncommutative nature of quaternionic multiplication. 


3. THE NONADIABATIC ABELIAN QUATERNIONIC GEOMETRIC 
PHASE 


Let us now consider a unit normalized quaternionic Hilbert space 
state \\p(t)} which undergoes a cyclic evolution between the times t = 0 and 
t = T, Since physical states are associated with rays, this means that 

hKr» = wo»fl, |fl| = i (8) 

and so the orbit ^ of |iin projects to a closed curve $ in the 
projective Hilbert space 

Let us now define a state that is equal to |i//(0> at t = 0, that 

differs from |i 1/(0} only by a reraying at general times, i.e., 

hA(0> = l<M0> cb(t) 

\cb(t)\ = 1 (9a) 


c£>( 0 ) = 1 


and that evolves in time by parallel transport, i.e., 


a 



(9b) 


The conditions of Eqs. (9a) and (9b) uniquely determine (b(t), and hence 
the state |$(t)>, as follows. Substituting the first line of Eq. (9a) into the 
Schrodinger equation of Eq. (5a), we get 


- h mt)y cb(o= 


— H l<M0> 
3 1 ^( 0 ) 


IV'(O) 


da{t) d |i j/(t)> 

— 7— +-5- - O >(0 


( 10 ) 


dt 


dt 


dt 
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Taking the inner product of this equation with the state <i£(t)|, and using 
the unit normalization of the state vector |itogether with the parallel 
transport condition of Eq. (9b), we get 


dcb(t) 

dt 


<kt)\ h iiA(o> m 



This differential equation can be immediately integrated to give 


dj(t)= T t e~^ dv 



where T t denotes the time-ordered product which orders later times to the 
left, and where we have used the initial condition on the third line of 
Eq. (9a). In particular, Eq. (12) gives us a formula for the value co(T) at 
the end of the cyclic evolution. We shall see that this has the interpretation 
of the dynamics-dependent part of the total phase change Q. 

To relate Eq. (12) to the total phase change, we use Eqs. (8) and (9a) 
to write 


I kT)> cb(T)=mT)) = |iA(0)> Q= |0(O)> Q (13a) 

so that taking the inner product with <i^(0)| gives 

Q = <$(Q)\ $(T)} d>(T) (13b) 

To complete the calculation, we must now evaluate the inner product 
appearing in Eq. (13b). To do this, we introduce a third state vector | \p(t)} 
which differs from |$(t)> by a change of ray representative, by writing 

hM*)> = l*(0> "(0 

1*5(01 = 1 (14a) 


£U(0)= 1 


and by requiring that ij/ should be continuous over the orbit 


\$(T)> = |#(0)> 


(14b) 


Differentiating the first line of Eq. (14a) with respect to time, we get 




dt 


dt 


dt 


(15) 
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Taking the inner product of Eq. (15) with a>(t) <i//(t)|, using the parallel 
transport condition of Eq. (9b) together with the first line of Eq. (14a), and 
abbreviating the time derivative d/dt by a dot, we obtain 

0 =<$(OI $ 1(0) <5(0+ <#(01#(0> co(t)) (16a) 

Since the second line of Eq. (14a) implies that the state \\p(t)} is unit nor¬ 
malized, Eq. (16a) simplifies to 

<5(0)=-<^(0l^(0><5(0 06b) 

which can be immediately integrated to give 

(17) 


with T { as before indicating a time-ordered product. In particular, Eq. (17) 
gives us a formula for ca(T), But from Eqs. (14a) and (14b) we have 


\$(T)} = \$(T)} cb(T)= |«A(0)> cb(T)= |«A(0)> cb(T) (18a) 

and so taking the inner product of Eq. (18a) with <$(0)| we get 

<*A(0)| \j/(T)} = d>(T) (18b) 

determining the inner product appearing in Eq. (13b). 

We thus get as our final result, 


with 


and with 


^ ^geometric ^dynamical 


^geometric 


<2dy_, =db(T) = T,e^ 


(19a) 


(19b) 


(19c) 


The dynamical part of the phase is so called because it depends explicitly 
on H , as well as on the orbit $ in the projective Hilbert space it is 
uniquely determined by the conditions of Eqs. (9a) and (9b), since these 
conditions uniquely determine the state |iThe geometric part of the 
phase is so called because, as we shall now show, it depends uniquely on 
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the projective orbit up to an overall quaternion automorphism transfor¬ 
mation. To see this, let us make the reraying 

|G)'| = 1 (20a) 

with co’(t) continuous over the orbit # so that 

cd , (T) = cd’( 0) (20b) 

Then (as shown in detail in Sec. 5.8 of Ref. 6) the properties of the time- 
ordered integral in Eq. (19b) imply that under this transformation, 

^geometric -W(r)& geometric a/( 0) (21a) 

which by the continuity condition of Eq. (20b) reduces to the quaternion 
automorphism transformation 

^geometric * ^ (0) ^ geometric^ (^) (21b) 

Since for any two quaternions q Xi q 2 we have Refl 1 fl 2 = R e ? 2 ?i> with Re 
denoting the real part, Eq. (21b) implies that 


COS }*geometric ^geometric 


( 22 ) 


is a reraying invariant, and thus y geo metric is a nonadiabatic geometric phase 
angle that is a property solely of the projective orbit The fact that the 
nonadiabatic geometric phase in quaternionic Hilbert space is only deter¬ 
mined modulo n is a reflection of the fact that e iy is changed to e~ iy by the 
quaternion automorphism transformation 


e~ iy = je iy j (23) 

Thus, to recover the result that the complex nonadiabatic geometric phase 
is determined modulo In by embedding a complex Hilbert space in a 
quaternionic one and using Eqs. (19a)-(19c), one must exclude the possi¬ 
bility of making intrinsically quaternionic automorphism transformations 
involving the quaternion units j or k , as in Eq. (23). 

In geometric terms, £2 geornetric is the holonomy transformation of the 
connection A = But since this connection is quaternion-imaginary 

valued, it is analogous to an 5(9(3) gauge potential. Therefore, the 
corresponding curvature is of the Yang-Mills type and is given by 
F= dA + A a A, 
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An alternative expression for the total phase change Q can be 
obtained * 1 ] by writing 


l<MO> = l*(0> x(t) 

*(o) = i 


(24) 


Substituting Eq. (24) into the Schrodinger equation and then taking the 
inner product with <$(01, we obtain 


dx(t) 

dt 


= -«^(0l^l^(0>+<^(0l^(O»z(0 


(25a) 


which can be integrated from 0 to T to give 

q — y e R Ii5(t')> +- 


(25b) 


This procedure and the resulting formula of Eq. (25b) are direct analogs of 
the derivation given in Ref. 3 for the complex Hilbert space case, but in 
quaternionic Hilbert space the two terms in the exponential are noncom- 
mutative, and so the exponential in Eq. (25b) cannot be immediately 
factored into dynamical and geometric phase factors. As we have seen, to 
achieve this factorization it is necessary to use a two-step procedure, 
involving the parallel transported state |i p(t)} as well as the state 
that is continuous over the cycle. 


4. THE NONADIABATIC NON-ABELIAN QUATERNIONIC 
GEOMETRIC PHASE 

We turn next to the quaternionic Hilbert space generalization of the 
complex nonadiabatic (4) non-Abelian * 2) geometric phase. We consider now 
a cyclic evolution in an /7-dimensional Hilbert subspace V„, i.e., V„(T) = 
V n (0). Let |i p a (t)}, a = 1,..., n be a complete orthonormal basis for V n> so 
that the reraying invariant projection operator for V„ is 


P n (t)= X I'M*)><><,(01 (26a) 

a — 1 

in terms of which the cyclic evolution condition takes the form 


p n {T) = p n ( 0) 


(26b) 
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Expressed in terms of the state vectors of the basis, the invariance of V n 
implies that the basis element |i p b (T)} must be a superposition of the basis 
elements |i// H (0)>, multiplied from the right by quaternionic coefficients U abi 

\UT)>= £ m0)> U ab (27) 

a = 1 

Substituting Eq. (27) into Eqs. (26a, b), we find that invariance of the 
projection operator requires 

Pn(T)= £ \UT)XUT)\ 

b = 1 

= £ |iA a (0)> u ab u cb <iA c (0)i 

a, b, c = 1 

= £ \^ a m<^ a (0)\=p n (0) (28a) 

a = 1 


which implies that 


X U ab U cb = d ac (28b) 

b= 1 

Similarly, orthonormality of the basis at t = 0 and t = T implies that 


S ah = <^ a (T) I = £ U ca <^( 0 ) I ^( 0 )> u db 

C, d = \ 

= £ U ca d cd u db = £ U da U db (28c) 

c, d= 1 d = 1 

Hence U is an nxn quaternion unitary matrix, U^U= t/t/ t = 1, which 
replaces the quaternion phase Q (which is a lxl quaternion unitary 
matrix) of the preceding section. Thus, to generalize the results of the 
preceding section to the non-Abelian case (i) one replaces the state vectors 
11A), |$>, \$) by ^-component column vectors |i p Q }, \\p a }, 

a= 1(ii) one replaces the phases by nxn quaternion unitary 

matrices acting on the state vector indices, (iii) one replaces Re in Eq. (22) 
by Re Tr, with Tr the trace over the subspace V in and (iv) as in Ref. 4, one 
generalizes the parallel transport condition of Eq. (9b) to 


1 


d\H0> 



dt 


a,b= 1,..., n 


(28d) 
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The principal difference from the complex case treated in Ref. 4 is that in 
the quaternion case, the unitary matrix factors must always be ordered to 
the right of ket state vectors, whereas in the complex case the ordering is 
irrelevant, and in fact in Ref. 4 the matrix factors are ordered to the left. 
The results of Ref. 4 can be obtained by the complex specialization of the 
results obtained in this paper. However, we have introduced here a new 

A 

technique of using parallel transported states to cleanly separate the 
non-Abelian geometric phase and the dynamical phase, which in general 
(even in the complex non-Abelian case) do not commute with each other. 


5. SUMMERY AND DISCUSSION 

To summarize, we have shown that both the complex Abelian and 
non-Abelian nonadiabatic geometric phases can be generalized to quater- 
nonic Hilbert space. These results are both of theoretical interest and of 
experimental relevance for possible tests for complex versus quaternionic 
quantum mechanics. Long ago, Peres (8) proposed testing for quaternionic 
quantum mechanical effects by looking for noncommutativity of scattering 
phase shifts. However, the result of Ref. 6 that the S-matrix in quaternionic 
quantum mechanics is always complex valued (for nonzero energy states) 
implies that there are no quaternionic scattering phase shifts, and the Peres 
test necessarily gives a null result. An alternative but related method is to 
look for interference effects in cyclic evolutions that could show the 
presence of quaternionic effects. The fact (6) that the adiabatic geometric 
phase is always complex (for nonzero energy states) is a counterpart of the 
complexity of the S-matrix, and implies that a null result will always be 
obtained for cyclic interference experiments involving adiabatic state evolu¬ 
tions. However, the results obtained here show that for cyclic evolutions 
that are nonadiabatic, one could in principle devise interference 
experiments to place meaningful bounds on postulated quaternionic com¬ 
ponents of the wave function. 
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We develop Perelomov’s coherent states formalism to include the case of a qua- 
temionic Hilbert space. We find that, because of the closure requirement, an at¬ 
tempted quaternionic generalization of the special nilpotent or Weyl group reduces 
to the normal complex case. For the case of the compact group SU(2), however, 
coherent states can be formulated using the quaternionic half-integer spin matrices 
of Finkelstein, Jauch, and Speiser, giving a nontrivial quaternionic analog of co¬ 
herent states. © 1997 American Institute of Physics. [S0022-2488(97)01005-0] 


I. INTRODUCTION 

The coherent states formalism is an important part of the apparatus of complex quantum 
mechanics, and in this framework has been given a general and elegant form through the work of 
Perelomov. 1 However, in a recent systematic study of quantum mechanics in quaternionic Hilbert 
space, 2 the issue of whether there is a quaternionic analog of coherent states was left open; filling 
this gap is the object of the present paper. In Sec. II we show that the general Perelomov con¬ 
struction readily extends to quaternionic Hilbert space, even when the subtleties arising from 
projective group representations 3 are taken into account. In Sec. Ill we demonstrate that when this 
quaternionic generalization is applied to the special nilpotent or Weyl group, the requirement of 
group closure reduces the structure of the coherent states so obtained to a quaternionic embedding 
of the standard complex construction. Hence, as suspected by Klauder, 4 there is no nontrivial 
quaternionic generalization of the standard complex coherent states based on the Weyl group. As 
an application of our formalism to a case in which the quaternionic coherent states are not simply 
embeddings of the corresponding complex ones, we discuss in Sec. IV the case of the quaternionic 
coherent states constructed by the Perelomov method based on the intrinsically quaternionic half¬ 
integer spin representations of the rotation group. 


II. GENERAL PROPERTIES OF PERELOMOV COHERENT STATES IN QUATERNIONIC 
HILBERT SPACE 

Let | if/ Q ) be a fixed state in a quaternionic Hilbert space V For some Lie group G and its 
irreducible unitary representation, {T(g):g e G}, consider the set of states {|^)}, where 

\'/'g)=T(g)\4> 0 ). 

Consider transforming from a state \i// gi ) to another \>P g )- In terms of \<p g ), 

\<!'o)=T- l (gi)\>!' gi )=T(gy)\i// gi ), 


hence, 


^Electronic mail address: adler@ias.edu 

^Electronic mail address; amillard@phoenix.princeton.edu 
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\*l'g 1 )= T (gi)T(gi l )\^g x ) == T(g 2 g l l Mg x )^ P {g2^gi *), 

where the factor of cv p (g 2 ,g ~ l ) is a quaternionic phase, inserted so that projective 
representations 3 may also be considered in this approach. Then, if 

where a> g (g 2i g j” 1 ) is another quaternionic phase, then 

or, in other words, \tp g2 ) and | ^ ) differ only by a phase factor and hence determine the same 
physical state. 

Let IT be the set of elements {h} in G such that 

T(h)\if/ 0 )=\if/ 0 )co(h). 

Then IT is a subgroup of G, being the stationary group for the ray containing |*/r 0 ). Forming the 
set of left cosets M= GIH , for each coset x e M, one representative g(x) can be selected to form 
the set of states {|^(jc))} = {!*)}• The following definition may then be made: 

Definition 1: The system of coherent states of type ( T,\if / 0 )) is the set of states {|^)}» where 
| ipg)— T(g)\if/ 0 ) and g runs over G. The coherent state \ tf / g ) is determined up to a quaternionic 
phase by the coset x=x(g), which is an element of GIH, corresponding to the element g\ that is 

I ipg)=\x>(o(g), 

where \ if/ 0 ) is henceforth abbreviated as |0). 

Consider A!, h 2 sH . A general element of G is g = g(x)h, whereg(x) is a particular element 
corresponding to a coset in GIH and g(0)= 1. From before, 

|^)=7’(g)|0)=|x)«(g), 


so hr g 1 = g(x)h 1 and g 2 = g(x)h 2 , 


T(gi)\0)=\x)a>(x,h l ) 

and 


similarly, if 


then 


T{g 2 )\Q)=\x)w{x,h 2 )\ 


gn=g(x)h 1 h 2 = g 1 h 2 , 


T(gn)\0) = \x)eo{x,h l h 2 ). 

Now consider the case where \x) = | 0); Eqs. (l)-(3) then become 

n«i)|0)=|0)a»(Aj), r(g 2 )|0)=|0)a»(A 2 ), 

T’(*i 2 )|0)=|0)a,(A 1 A 2 ). 


(1) 

( 2 ) 
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However, since g(0)=l implies thatgi = /ij and g 2 = h 2 , then gn= g}h 2 = gig 2 > allowing for 
projective representations, 

ng 1 2)|o)=ng 1 g2)|o)=ng 1 )ng2)|o)«; 1 (g.,g2)=ng.)ng2)|o)«; l (ft 1 ^2) 

= T(g 1 )\0)(o{h 2 )(o ~ 1 (h 1 ,h 2 )= \0)(o(h 1 )u{h 2 )(o~ l (h lt h 2 ). 


giving 


^(^ih 2 ) = (o(h l )co(h 2 )(o p l (/ii ,h 2 ). (4) 

If T is a true representation, as opposed to a projective representation, then the projective phases 
are unity and 


(o(h l h 2 ) = (o(h l )(o(h 2 ) i 


in correspondence with the complex phase relationship 

exp[ i a(/i i/i 2 ) ] = exp[ i <*(/i i) ]exp[ i <*(/i 2 ) ] 


given by Perelomov. 

Consider now elements g e G and h e //and the action of the corresponding operators on |0). 
Now 


TWIOHOMA) 

and 


and similarly 


then, 


ng)|0) = |x(g))«(g), 

n«*)|0)=|x(gA))w(gA); 


r(«A)|0) = r(g)r(A)|0)a,; 1 ( ij A)=|x(g))a,(g)a,(A)a,; 1 (g,A) J 

and since x(gh) = x(g), this means that 

c o(gh) = a>(g)co(h)w-'(g,h ), (5) 

which is the same phase relationship as in Eq. (4) but with one of the elements of G now not in 

H. 

Finally consider the action of an arbitrary operator, T{g r ), on an arbitrary coherent state, 
|jc). This may be written 

n* , )|x)=r(g')|x(g))=r(g')r(g)|o)a,- 1 (g)=r(g , g)|o)a, p (g' I g)a,- 1 (g) 

= \x(g 'g))oj(g' g)«> p (g' ,g)a>~ l (g) = lx(g' g))6(g' ,g), (6) 

where we have defined the new phase 

Replacing g by gh, where h is an element of //, and using Eq. (5) gives 
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9{g' ,gh)= a>{g'gh)co p (g' ,gh)co~'(gh) 

= "(#' g)<>>{h)bj~\g' g,h)co p (g' ,gh)w p (g,h)a>- 1 (h)a>~ 1 (g)-, 

from the associativity condition for projective representations, 

*>p(g' ,gh)a> p (g,h)= co p (g'g,h)w- } (h)w p (g' ,g)a>(h), 

we see that the middle five factors of the last expression for 6(g f ,gh) are simply equal to 
giving 

0(g' ,gh)= a>(g'g)a> p (g' 

Hence, changing g to gh, where h is any element of H, gives the same 0, so it may be written 
0(g\x), since it only depends on the coset jt(g) and not on g itself. 

Writing two coherent states as 

ki)=k(gi))= 7 ’(gi)|0)«“ 1 (gi)=7’(g 1 )|0)«(g 1 ), 

1 ^ 2 )= l^(s r 2)) = 1 (^ 2 )= 7’(<g 2 )|0)oJi(^ 2 ), 

their inner product is 

(xi\x 2 ) = (x(g 1 )\x(g 2 )) 

= M (gi)(0\T(g^ l )T(g 2 )\0)co(g 2 ) = uig^iomg^ g 2 )\0)u p (g^ 1 ,g 2 )co(g 2 ) 

= "(g 1 )"p(g2 1 .gi)<0|7’(g7’«2)|0)«(g 2 ); 
replacing g { by g { h, where h is an element of H , in the last line gives 

(x 1 |x 2 )=«(g 1 A)ftr p (g7 1 ,g 1 A)(o|r(A“ 1 g 1 _1 g 2 )|o)«(g2) 

= (o{g l )co{h)(o~ l (g 1 ,h)(o~ 1 {g2 1 ,g l h)co p {g2 1 g l ,A)(0|7’(A“ 1 )7’(gj; 1 g 2 )|0)ft7(g 2 ) 

= oi(g\)W(h)a) p 1 (g 1 ,h)co~ 1 (g^ 1 ,gift)« p (g7 1 gi,ft)«“ 1 (A)]<0|7’(g7 1 g 2 )|0)ftJ(g 2 ) 

= «(gi)«“ 1 (g^ 1 ,gi)<o|r(gj; 1 g 2 )|o)ft7(g 2 ) 

= "(gi)ftV(g^ 1 ,gi)(0|7’(g7 1 g 2 )|0)ftJ(g 2 ), 

where, in a very similar way to before, the second to sixth factors in the third line have been 
contracted via the projective representation associativity condition to give <o p l (g 2 l ,gi) in the 
fourth line. Thus, as implied by our notation, the inner product does not depend specifically on 
g l but just on the coset jcj = jc(^ x), and this can similarly be shown forg 2 - If all coherent states 
are operated on by the same T(g), then the inner product of two of the new states, using Eq. (6), 
is 

(x(gg l )\x(gg 2 ))=e(g,x l )(x 1 \T~ } (g)T(g)\x 2 )6(g,x 2 )=6(g,x 1 )(x 1 \x 2 )e{g,x 2 ). 

Let us now assume that the invariant measure dg on the group induces an invariant measure 
dx on the set of cosets M=G/H. Given sufficient convergence, consider the operator 
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from the definition of B and the invariance of the measure, Eq. (6) implies that 


T(g)BT l (g) = T(g)\y)(y\T{g *) dy= Wg3O>0(soO0(soO<*(g3O| dy 


= J |*)(jc| dx~B , 

so B commutes with all of the operators T(g ). By the quaternionic generalization of Schur’s 
Lemma, 5 this means that B is of the form £ 0 1 + BJ, where B Q and B x are real, 1 is the usual 
identity operator, and I is a unit anti-self-adjoint operator, 7 t= -/; however, since B is clearly 
self-adjoint, B x must vanish, so B is a multiple of the identity as in the compex case. Given a 
coherent state | y) that is normalized, (y|y) = 1, 


Bo=(y\B\y)= (y\x)(x\y) dx= \{y\x)\ 2 dx= \(0\x)\ 2 dx; 


hence, 


1 

B~o 


jc)(jc| dx=l. 


With this form of the identity, an arbitrary state may be expanded over the coherent states, 



x)ip(x) d; c, 



where 


Then, 


however, 


ip{x) = {x\4i). 


(<AI <P)~ 71 

“o 


<l'(x)(x\y)i//(y) dx dy\ 


i//(x)=(x\il/) = (x 


1 

B~o 


\y)(y\'/') dy= — 

*>0 


( x \y)*Ky) d y. 



SO, 


{ 4 >\<P)= y j I'M*)! 2 dx. 

Defining 


K (x,y)= y W?)- 

Eq. (8) implies that this is a reproducing kernel, 


J. Math. Phys., Vol. 38, No. 5, May 1997 




R62 


725 


2122 S, L Adler and A. C, Millard: Coherent states in quaternionic quantum mechanics 


K(x,z)- K(x,y)K(y,z ) dy. 


and the function 


/(*) = J K(x,y)f(y) dy 

satisfies Eq. (8), in the place of ip(x), for an arbitrary function f(x). If 10) is itself a coherent state 
|y), then, from Eq. 7, 



so the coherent states are not linearly independent, meaning that the system of coherent states is 
overcomplete. 


III. THE CASE OF THE SPECIAL NILPOTENT OR WEYL GROUP 

Having extended Perolomov’s formulation of coherent states to a quaternionic Hilbert space, 
we continue to follow his paper and consider the case of the nilpotent group. In the complex case, 
this group leads to the familiar coherent states widely used in quantum optics. The special nilpo¬ 
tent or Weyl group is generated by a set of annihilation operators, {a,}, where i runs from 1 to 
N, their conjugate creation operators, {a]}, and the identity operator, 1. The commutation rela¬ 
tions between these operators are 

[>.■ >*_/]=[>! ,a]] = [a h l] = [aj,l] = 0 

and 

Let the E A , where A runs from 1 to 3, be three quaternion imaginary operators 6 with an algebra 
isomorphic to the algebra of i, j , and k and all of which commute with the a t and the a] . Then, 
an anti-self-adjoint 7 element of the Lie algebra of the group may be written 

t +2 A-a,—2 A a], 

i i 

where t is a quaternion imaginary operator, 

t — 2 ’ 

A 


and the /3, are quaternion operators, 


A^AoI + 2 PiA^A- 

A 

For convenience, the generator is written using a shorter notation, 

t + pa — . 
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The group is then obtained from the algebra by means of the exponential mapping, so that for a 
general element g e G,T(g) may be written 

T(g) = (t,/3)=exp(t + 

In the case of a complex Hilbert space, as considered by Perelomov, group closure follows 
quickly. However, in the quaternionic case, t and the /3 ( may be noncommutative, and requiring 
group closure will impose significant restrictions. Consider, then, the product of two group ele¬ 
ments, 


(^,a)(r,/3) = exp(^+ aa — oa t )exp(r +/3a — /3a 1 ”); 
using the Baker-Campbell-Hausdorff formula to second order, 

exp X exp T=exp(X+ T+ -1-), 


the product may be written 

exp{^ + 1+ (<* + (3)a — (a + /3)a t + aa — aa^,t + pa — /3a t ]}, 

and so, to obtain a group, this requires that 

^[s+aa — aa^ ,t + fia — (3a^]=u + ya— ya (9) 
In particular, the coefficients of a^j and ajaj must vanish, which requires that 

for each i and j; hence, all and must belong to the same /) subalgebra rather than 
being free to range over any quaternion. With this constraint, the coefficients of and a]aj also 
vanish, and Eq. (9) implies that 

U=\{s,t], y,= 2 l>,A]+ 2 l>i>*]- 

However, the y- must have the same structure as the a ( and fi { and thus belong to the same 
gf( 1, /) subalgebra, which requires that s and t are simply proportional to I —consequently, u and 
the y t vanish. Therefore, for group closure, the representation can only be 1, 1) embedded in 
the quaternionic Hilbert space rather than fully quaternionic. For the case of the nilpotent group, 
then, there is no quaternionic generalization of standard complex coherent states. 


IV. THE CASE OF INTRINSICALLY QUATERNIONIC IRREDUCIBLE REPRESENTATIONS 
OF SU(2) 

We consider now the anti-self-adjoint generators of SU(2), S x , S y and S z , such that 8 

[S/ 2 e lmn^n ■ 

n 

It can readily be observed that a quaternionic realization of this algebra is 

^ y — ?/, S z = , 

which is a one-dimensional quaternionic irreducible representation of SU(2). Consider eigenstates 
of S z ; these can be chosen to be 
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such that 

S z \± j)=± j|± %)k. 

Then choosing either of these states as |0), the stationary subgroup is 

H={qx p aSj. 

Following Perelomov , 1 a coherent state based on such a |0) may be characterized by a vector n or 
by a polar angle 6 and an azimuthal angle </>. For the purposes of an example, choose |0) to be 
1 5 ) and then 


n) = exp <f>S z exp 0S y \O) = Qxp \<t>k exp \dj\ 


then 


(n'|n) = exp— j exp exp \0j 


and hence 


I <0| n) 1 2 = 1 


so that 


B 0 = J |(0|n)| 2 dn=47T. 


Correspondingly, 


^ f |n)(n| dn=l. 


as expected. 

Since, with the above definitions for the S n , 

r2 r2 r»2 r»2 3 

O — C» x Oy 0^ — 4, 

this can be seen to be a one-dimensional spin \ representation of SU(2). This is a special case of 
a general result due to Finkelstein et al . 9 which states that besides the real, integer spin 
(Frobenius-Schur class +1) and the half-integer spin (Frobenius-Schur class —1) representa¬ 
tions, there are quaternionic representations for half-integer spin of precisely half the dimension of 
the Frobenius-Schur class — 1 representations for the same spin. For instance, one choice for the 
spin \ representation is 


Sx 2 *1 ^3j 


0 


1 / 2 


1 (1 0 \ 


0 j 

’ S ‘“2*lo 3 ; 


Then, 


S 2 = -S 
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2125 


so the spin is indeed \. Consider, as before, eigenstates of S z ; these may be chosen to be 


3 

2 


0 

1 


1 

2 




'1 

or 


1 

2 


j 

or 


3 

2 


o 

t 

j 


Again, when one of these is chosen as |0), the stationary subgroup is 


ff={exp aS }, 


As an example, choose |0) to be ||) so that 


n) = exp <f>S z exp 0SJO) 


ex P 2 


1 Ik 0\ 1 / 2j 

\0 3*:J eXp 2 e -J3 


1 Ik 0 


1 


eXP 2^0 3 * 1 ^ 2 * 1-73 


1 


exp - <j> 


Ik 

0 \ 3 


i M 

j 

,0 

3 kj 4 


l 73/ 



2 j v3\ 3 

0 4 


1 \ 
J_ 

73 / 




j 3 

73“ P 2®- ,+ 


2 _' 
73 
\ 1 / 


73 

1 / 

1 

exp- - 0 / 


3 

4 



1 exp ' 

— 

3 , J 


ap 2 * 111 


3 

4 


1 

exp j <£* 




l 


l M 

m 

exp 2 ** 73 

1 

exp - - 0 / 

exp | 4>k j 



-exp - 0 j + exp - -«/ 


J_\ 
V3 


exp | #[ 3 l exp | 0 j + exp- } 0 j] / 


where the calculation has been carried out efficiently by decomposing (,) into S y eigenstates. 
From this, we find 


( n 'l n )~ u [i (-exp- I 0 'j + exp l 2 0 'j)exp \ (</>'-</>)*;(-exp § 0 / + exp- j 

+ ( 3 exp— | 0 'j + exp j 0'j) ex P f (</>-</>')*( I exp § 0 / + exp- j 0/)], 
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and hence 

|<01 n)| 2 = |+1 cos 26, 

so that 

B o= J l(0|n)| 2 dn=2n. 

Correspondingly, 


1 

277 


n)(n| dfl 


3 

877 


/ sin 2 6 


\x[0,4>) 


x(6,4>) 
j+cos 2 6 j 


d£l 


with 


1 

x( 6,<b) = — (/ sin 
V3 


2 


0 <?~ 2 **+sin 20 <?~**), 


which on doing the <f> integration becomes 


' sin 2 6 

0 ^ 


(\ 

0 ^ 

, o 

3 + cos 2 6) 

sin 6 dd= 

\0 

1 / 


3 f* 

4 Jo 


as expected. These examples can be readily extended to the general half-integral spin quaternionic 
irreducible representations of SU(2). 
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We extend the discussion of projective group representations in quaternionic Hil¬ 
bert space that was given in our recent book. The associativity condition for quater¬ 
nionic projective representations is formulated in terms of unitary operators and 
then analyzed in terms of their generator structure. The multi-centrality and cen¬ 
trality assumptions are also analyzed in generator terms, and implications of this 
analysis are discussed. © 1996 American Institute of Physics. 

[S0022-2488(96)01105-7] 


I. ASSOCIATIVITY CONDITION FOR QUATERNIONIC PROJECTIVE GROUP 
REPRESENTATIONS 

In quaternionic quantum mechanics, all symmetries of the transition probabilities are gener¬ 
ated by unitary transformations acting on the states of Hilbert space. 1 ' 3 In the simplest case, the 
unitary transformations U a ,U b ,... form a representation (or vector representation) of the symme¬ 
try group with elements a,&,..., 


U b U=U ba . (1) 

A more general possibility is that the group multiplication table is represented over the rays 

corresponding to a complete set of physical states, but not over individual state vectors chosen as 
ray representatives. This more general composition rule defines a quaternionic projective repre¬ 
sentation (or ray representation), and takes the form (Ref. 4, Sec. 4.3) 

U b U a \j) = U ha \f)co(f;b,a), \a>(f;b,a)\=l, (2) 

for one particular complete set of states |/) and a set of quaternionic phases a>(f;b,a). When we 
change ray representative from |/) to \f^)=\f)</>, with |<£|=1, the phase defining the projective 
representation is easily seen to transform as 

\b,a)= <f>a>(f\b 9 a)<f>, (3) 

with the bar denoting quaternion conjugation. Equation (3) shows clearly that the projective phase 
a) must depend on the state label / as well as on the group elements a,b\ failure to take this into 
account can lead 4 to erroneous conclusions (as in Ref. 5) concerning quaternionic projective 
representations. 

The defining relation for quaternionic projective representations given in Eq. (2) can be 
rewritten in operator form by defining a left-acting operator ft (b,a). 


n(M) = S |/M/;M)</|, 

/ 



a) Electronic mail address: adler@sns.ias.edu 
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which, using Eq. (3), is seen to be independent of the ray representative chosen for the states |/). 
Multiplying Eq. (2) from the right by (f\ and summing over the complete set of states |/), we 
obtain the operator form of the projective representation, 


U b U a =U ba £l(b,a). (4b) 

It is also immediate from the definition of Eq. (4a), and the fact that |co|— 1, that the operator 
£l(b,a) is quaternion unitary, 

£l(b,a)*£l(b,a) = {l(b,a)£l(b,a)*= 1. (5) 

Note that if we were to make the definition of a quatemionic projective representation more 
restrictive by requiring that Eq. (2) hold for all states in Hilbert space, rather than for one 
particular complete set of states, then we would require ft(fc,a) = l, since the unit operator is the 
only unitary operator which is simultaneously diagonal on all complete bases in quatemionic 
Hilbert space. Hence this more restrictive definition excludes quatemionic embeddings of complex 
projective representations, whereas these are admitted as quatemionic projective representations 
by the definition of Eq. (2). 

A nontrivial condition on the projective representation structure is obtained from the associa¬ 
tivity of multiplication in quatemionic Hilbert space, which implies 


(U c U b )U a =U c (U b U a ). 



Applying Eq. (4b) twice to the left-hand side of Eq. (6), we obtain 


( U c U b )U a =U cb £l{c,b)U a =U cb U a U- a '£l{c,b)U a =U cba £l{cb,a)U-'£l{c,b)U a , (7a) 


while applying Eq. (4b) twice to the right-hand side of Eq. (6) gives 


U c {U b U a ) = U c U h Mb,a) = U ch McM)£l{b,a). (7b) 

Upon multiplying from the left by U~^ a , Eqs. (7a) and (7b) give the operator form of the asso¬ 
ciativity condition’. 


n(c,ba)n(b,a) = n,(cb,a)uyn,(c,b)u a . ( 8 ) 

We can also express the associativity condition as a condition on the quatemionic phase 
( o(f \b,a ) introduced in Eq. (2), by applying the spectral representation of Eq. (4a) to the operator 
form of the associativity condition given in Eq. (8). From Eq. (4a) we obtain 

ft(c,fca) = 2 |/>a>(/;c,fca)(/|, (9a) 

which when multiplied from the right by Eq. (4a) gives 

ft(c,fca)ft(fc,a)=2 \f)(o(f;c,ba)co(f;b,a)(f\. (9b) 

Equation (4a) and the unitarity of ft (cb,a) also imply that 

ft(cM)“ 1 = 2 |/W/;cM)</|, (9c) 

and so the associativity condition of Eq. (8) can be rewritten as 
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U~ l £l(c,b)U a =£l(cb,a)~ l n(c,ba)n(b,a)=^ \f)c*(J;cb,a)cj(f;c,ba)(o(f;b>a)(f\. 

( 10 ) 

Hence U~ l Q(c,b)U a is diagonal in the basis spanned by the states |/). Taking matrix elements 
of Eq. (10), and using the unitarity of U a , the associativity condition gives the two relations 

co{f-,cb,a)co{f-c,ba)co(f-,b,a)=^ (f"\U a \f)co{f"-,c,b){f\ U a \f), (11) 

rll 


and, when (/'|/)=0, 


o = E (f"\U a \f)co(f"-,c,b)(f"\U a \f). (12) 

r 

We conclude this section by comparing the quaternionic Hilbert space form of the associa- 
tivity condition with the simpler form which is familiar from complex Hilbert space. ’ In a 
complex Hilbert space, the phase a)(f\b,a) introduced in Eq. (2) is a complex number, and 
commutes with the phase <j>, also now complex, which we introduced in Eq. (3) to describe a 
change of ray representative. Hence Eq. (3) implies, in the complex case, that o)(f\b,a) is inde¬ 
pendent of the ray representative chosen for the state |/), and it is then consistent to assume that 
to(f;b,a ) is independent of the state label /, so that 

(o(f\b,a)= (o{b,a) complex Hilbert space. (13a) 

Substituting Eq. (13a) into Eq. (4a), we now obtain 

0(M) = 2 \f)<*>(b,a)(f\= (o(b,a)^ \f)(f\ = (o(b,a) 1, (13b) 

/ / 

where 1 denotes the unit operator in complex Hilbert space. Since the complex phase a>(b,a) is a 
c -number in complex Hilbert space, on substituting Eq. (13b) into Eq. (4b) we learn that 

U b U a =U b Mb,a) = o>(b,a)U baf (14a) 

which is the standard definition of a projective representation in complex Hilbert space. Moreover, 
since Eq. (13b) implies that 0(6?,a) commutes with the unitary operator U a , the associativity 
condition of Eqs. (8) and (11) reduces to the familiar complex Hilbert space form 

(o(c,ba)(o(b,a)= <o(cb,a)(o(c,b). (14b) 

II. THE ASSOCIATIVITY CONDITION IN GENERATOR FORM 

Let us now assume that the symmetry group with which we are dealing is a Lie group, so that 
in the neighborhood of the identity e the unitary transformations U a ,Ub> U ba can be written in 
terms of a set of anti-self-adjoint generators G A as 

^= e *p(s ^G a ), C/i = exp|E & a G a ). C/ ia = exp|E (15a) 

with 6*4=0 and £7^=1. Then Eq. (4b) implies that 0(6?, a) must be unity when either a or b is the 
identity, and thus the generator form for this operator is 
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£l(b,a)=ex p 



e» B e c e A ^c)A^ 

c c 



, (15b) 


where the parentheses () around a set of indices indicate that the tensor in question is symmetric 
in those indices, and where we use the tilde to indicate operators which are anti-self-adjoint. The 
parameters B h £ must be functions of the parameters 0 A and 0 B , 


e b c a =i/, b c %{(? B },{6 a A })=6 b c +e a c +^'Z c BAC e b B <r A +o(d 3 ), (i5c) 

* BA 

where in making the Taylor expansion we have used the fact that U be = U b and U ea = U a , which 
fixes the linear terms in the expansion and requires the quadratic term to be bilinear. 

We proceed now to derive a number of relations by combining the generator expansions of 
Eqs. (15a)-(15c) with the formulas of Sec. I. We begin by substituting Eqs. (15a)-(15c) into Eq. 
(4b) using the Baker-Campbell-Hausdorff formula, 

exp X exp T=exp(X+T+ \\X, y]+---), (16a) 

to combine exponents arising from the factors on the left and right. From the left-hand side of Eq. 
(4b) we obtain, 


U b U a = Qxp 

while from the right-hand side of Eq. (4b) we obtain 

c/ ia a(M)=exp 2 {d b c+eP c )G c +\2 c BAC 6 b B tr A G c + l -'Z e^J^+oie 3 ) 

\ C ± CBA l BA 

Equating Eqs. (16b) and (16c) thus gives the relations 

[G B , G a \=^ C[ BA ] C G c +/[ Bj 4 ] 



0 b g 


B 


+ S 0 a A G A 


l BA 


e b B e° A [G B , g a ]+o( 6 3 ) 


(16b) 


(16c) 


(17a) 


and 



C(ba)cGc+I(ba) » 


(17b) 


where the square brackets [ ] around a set of indices indicates that the tensor in question is 
antisymmetric in these indices. We shall restrict ourselves henceforth to the case in which 
C (BA)C =0, which by Eq._(17b) implies that I(ba) = 0; making this assumption then implies that 
C BAC = C[ba]c and Iba = I[ba] ■ I n other words, we are assuming that the structure constants C BAC 
for a projective representation have the same antisymmetric form as holds for a vector represen¬ 
tation. Changing the summation index C to D in Eq. (17a), and then taking the commutator of Eq. 
(17a) with G c , we find 


[ G c> I g b> G a ]]-2 C[ba]d[G c , G d ] + [G c , I[BA]]'* 


(18a) 
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adding to this identity the two related identities obtained by cyclically permuting A,J3, C, using the 
fact that the left-hand side of the sum vanishes by the Jacobi identity for the commutator, and 
substituting Eq. (17a) for the commutators appearing on the right-hand side of the sum, we obtain 
the identity 


2 


( C[ba]dC[cd]e+ C[cb]dC[ad]e+ C[ac]dC[bd]e)Ge 


+ 2 ( G [BA]D^[CD] + G [Cfl]D^[AD] + ^[AC]D^[BD]) 


+ [ G C> *[A4]] + [ G A» 7 [cb]] + [ g b> 7 [ ac ]]- 0 . 


(18b) 


We next substitute Eqs. (15a)-(15c) into the associativity condition of Eq. (8), now keeping 
cubic terms in the exponent of the form 0^8 b B 8 c c , but dropping cubic terms, such as 0%8 a B 8 c c , that 
do not contain all three of the upper indices a,b,c. For the first factor on the left-hand side of Eq. 
(8), we find from Eqs. (15b) and (15c) that 


1 / \ 
£l(c,ba) = QX p( “ 2 ( ^B^A^fBAJ + S ^B(AC) 


= exp| \ 2 

l BA 


<£+*A+^2 C 


DE 


r&) r\a 

[de]aVdVe 


[BA] 


+ 22 Vb^Jb’ac, 


(19a) 


while for the second factor on the left-hand side of Eq. (8) we have 


n(M) = exp( \ S tfs&AhBA])- 

' l BA 1 


/ 


(19K> 


Since the exponents in Eqs. (19a) and (19b) both begin at order f? 2 , through order $ we can simply 
add exponents to get the product on the left-hand side of Eq. (8). Proceeding similarly for the first 
factor on the right-hand side of Eq. (8), we obtain 


I 1 ^ 

fl(cb,a) = exp( - Z, 

l BA 


®B ^A-^BAj+S fie!!} &C ^A*^(BC)A 

0 


= expi l - E 

l BA 


1 


2 2 C[DE]B&D&E &a![BA] 


+ 2^) @b&c0aJ\bc)a 


(20a) 


while for the second factor on the right-hand side of Eq. (8), use of the Baker-Campbell- 
Hausdorff formula gives 


U a l Sl(c,b)U a = ex p 



*a g 


1 ^ h ~ 

A|eXp(-2, ^C^BhcB] ) ex P 

l CB ' 



*A G 
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= g e c c 6 b B i [CBr \ g g o a A e c c G b B [G A , / [Cfl] ]j. (20b) 

Since the exponents in Eqs. (20a) and (20b) begin at order it again suffices to simply add the 
exponents to form the product appearing on the right-hand side of Eq. (8). Thus, to the requisite 
order, the content of Eq. (8) is obtained by equating the sum of the exponents in Eqs. (19a) and 
(19b) to the corresponding sum of exponents in Eqs. (20a) and (20b). The quadratic terms in G are 
immediately seen to be identical on left and right, while the cubic term proportional to 0 A 0 B G L C 
gives (after some relabeling of dummy summation indices) the nontrivial identity 


712) 

J C{BA) 


1 ^ 

+ t2 c, 

4 D 


[BA]D^[CD]“^(CB)A + 4 2) C[CB]dI[DA ] 2 [Ga > I[CB]\- 


f Ec 


( 21 ) 


On totally antisymmetrizing with respect to the indices A,J3,C, the terms in Eq. (21) involving 
ji^. 2 ) djqp outj and We aj-g j e f t identity 



D 


(C[ba]dI[Cd]+ C[cb]dI[ad]+ C[ac]dJ[bd]) + [Gc> I[ba]] + [Ga ^[cb]] + [^b»^[ac]] — 0- 


(22a) 


In other words, associativity implies that the sum of the second and third lines of Eq. (18b) 
vanishes separately, hence the first line of Eq. (18b) must also vanish, and since the generators G E 
are linearly independent this gives the Jacobi identity for the structure constants, 



(C[ba]dC\cd]e+ C[cb~\dC[ad~\e+ C[ac~\dC[bd~\e) — ®’ 


(22b) 


In the complex case, in which fl(a,fc) = a>(a,b) 1 is a c-number, the tensor I[ab] * s a c-number 
“central charge” and the commutator terms in Eqs. (18b) and (22a) vanish identically. Therefore, 
in the complex case, Eq. (18b) implies both Eq. (22b) and the identity 


2 (C[ba]d^[cd]+ C'[ CB ] D /[ j 4 D ] + C[ j 4 C ] D /[ BD ]) — 0 complex case, (23) 

and so one obtains the entire content of the associativity condition from the simpler analysis 
leading to Eq. (18b), without having to perform the third-order expansion needed to obtain Eq. 
(22a). 


III. GENERAL, MULTI-CENTRAL, AND CENTRAL QUATERNIONIC PROJECTIVE 
REPRESENTATIONS 

The analysis of Sec. II applies to the general case (apart from the restriction C (Bj4)C =0) of a 
quaternionic projective representation; in order to obtain more detailed results it is necessary to 
introduce further structural assumptions. In Ref. 4 two special classes of quaternionic projective 
representations are defined. A quaternionic projective representation is defined to be multi-central 
if 


[ft(M), f/ a ]=[n(Z?,a), U b ] = 0, all a,b , 
while it is defined to be central if 

[ft(fc,a), £/ c ] = 0, all a,b,c. 


(24a) 


(24b) 
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Expressed in terms of the generators introduced in Eqs. (15a)-(15b), the multi-centrality condition 
takes the form 


2 Qa®b®c\-Gc ■> I[BA\\ 2 

ABC ABC 


&a@b@c\-Gc » ^[ba]]“'0» 


while the centrality condition becomes 


2 

ABC 



all a,b,c. 


(25a) 


(25b) 


Making the definition 


A[ab~\c~ [^c» (25c) 

we see from Eq. (25a) that multi-centrality requires that A [j4B j C be antisymmetric in A,C and in 
5,C as well as in A,B; thus in the multi-central case A is totally antisymmetric, which we will 
indicate by writing it as A [j4BC j . From Eq. (25b), we see that centrality requires that \ AB]C must 
vanish. 

Using the generator formulation, we proceed now to discuss successively the general, multi¬ 
central, and central cases in the light of the associativity analysis of Sec. II. 

(1) The general case. An example given in Eqs. (13.54g) and (14.23a) of Ref. 4 shows that 
one can have a quatemionic projective representation which is neither central nor multi-central. 
The example is constructed from n independent fermion creation and annihilation operators b/ , 
b/> / = l,...,n, which commute with a. left algebra quaternion basis E 0 = l,E l ~I,E 2 = J>E 3 = K. 
Consider the set of three generators G A defined by 

G a =-\E a N , A =1,2,3, (26a) 


with N the number operator 


n 


b)?bf. 


(26b) 


The commutator algebra of these generators has the form of a projective representation of SU(2), 


3 

[Gtf, G a ]=~ 2 f[BAC]Gc + ^ 

C= 1 


[BA] 


(26c) 


^ 1 

J[ba] — € [bac] 2 £cA r (A r_ 1)> 


with e the usual three-index antisymmetric tensor. A simple calculation now shows that 

[G a , T[BC]]=-m-l)(S AB G c -S AC G B ), 


(27a) 


which is not antisymmetric in either the index pair A, C or the pair A,B, and so the multi-centrality 
condition is not satisfied. Another simple calculation shows that 


2 ( € [bad]I[cd] + € [cbd]I[ad]+ € [acd]I[bd]) — 0. 


(27b) 
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by virtue of the Jacobi identity for the structure constant e, and also 


[ g cJ[ba]] + [GaJ[cb-\\ + [ g bJ[ac]]-®- (27 c) 

Hence the associativity condition of Eq. (22a) is satisfied, with the first and second lines each 
vanishing separately. 

(2) The multi-central case . Let us now consider the multi-central case, in which A [AB j C 
defined in Eq. (25c) is totally antisymmetric in A,5,C, as indicated by the notation A [ABC] . The 
associativity condition of Eq. (22a) then simplifies to 

2 (C'[ba]d7[cd]+ C'[ CB ] D /[ j4D ]+ C'[ j 4c]d7[bd]) + 3A[ j4BC ] = 0. (28a) 

A further equation involving A is obtained from the Jacobi identity 

[G d , [G c , 7 [ Bj4 ]]]-[G c ,[G d , I[ba]]] = U[ba] > [G c » G D ]], (28b) 

which on substituting Eqs. (17a) and (25c) becomes 


[G D , A [ab]c! [G c , A[ AB ] D ]— 2 G[ CD ] £ A[ AB ] £ +[/[ BA ], /[cd]]» (28c) 

E 

an equation which holds even in the general case in which A is not totally antisymmetric. Spe¬ 
cializing Eq. (28c) to the multi-central case and contracting it with S AC S BD , the left-hand side 
vanishes because of the antisymmetry of A, while the commutator term on the right-hand side 
becomes X ab U[ba j leaving the identity (after relabeling the dummy index E as C) 

2 C[ab]c&[abc] = 0- (29) 

Thus in order for a multi-central projective representation to exist which has A^O and so is not 
also central, there must be a three-index antisymmetric tensor A [ABC j which vanishes when all 
three indices are contracted with the structure constant C [AB j C . This condition is not easy to 
satisfy and so we pose the question, which we have not been able to answer: Can one construct an 
example of a multi-central quaternionic projective representation which is not central, or can one 
prove (in general, or with a restriction, e.g., to simple or semi-simple groups) that a multi-central 
quaternionic projective representation must always be central? The application of multi-centrality 
in Ref. 4 sheds no light on this issue; multi-centrality was used there (e.g., in Sec. 12.3) to show 
that quaternionic Poincare group projective representations outside the zero energy sector can 
always be transformed to complex Poincare group projective representations, which in the sector 
continuously connected to the identity are known 8 to be transformable to vector representations. 

(3) The central case. Let us finally consider the central case in which A=0, which by Eqs. 
(25c) and (28c) implies that I [BA] commutes with both G c and / [CD] for arbitrary values of the 
indices. Thus I [B a] behaves as a central charge, justifying the name “central” for this case. The 
various results obtained in Bargmann 6 can be immediately generalized to the quaternionic central 
case; for example, the analysis of Ref. 6 can be easily extended to show that the central charges 
associated with a quaternionic central projective representation of a semi-simple Lie group can 
always be removed by redefinition of the generators; and again, the nontrivial illustration 6 of a 
complex projective representation, constructed in terms of the phase space translation generators 
in nonrelativistic quantum mechanics, can be embedded 4 in quaternionic quantum mechanics as a 
central projective representation. 
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In a series of papers published in this journal, a discussion was started on the 
significance of a new definition of projective representations in quaternionic Hilbert 
spaces. In the present paper we give what we believe is a resolution of the semantic 
differences that had apparently tended to obscure the issues. © 1997 American 
Institute of Physics. [S0022-2488(97)01709-X] 


I. WIGNER’S THEOREM REVISITED 

We must first harmonize the notations in papers 1-4 that were written more than 30 years apart, 
and for different audiences. Let be a quaternionic Hilbert space. In order to facilitate the 
transcription to Dirac’s bra-ket notation, we write the multiplication by scalars on the right, with 
the scalar product defined to be linear in its second term: 


in conformity with \<l>q)=\<l>)q . 

Under the initial assumptions of Wigner, 5 reformulated by Bargmann, 6 or the assumptions of 
Emch and Piron, a symmetry (l is defined as a map that preserves transition probabilities between 
rays, or equivalently as an automorphism of the orthocomplemented lattice the elements 

of which are the closed subspaces (i.e., the projectors) of the Hilbert space . 

The theorem known as Wigner’s theorem (by physicists), and as the infinite-dimensional 
version of the fundamental theorem 8 of projective geometry (by mathematicians) asserts that every 
symmetry is implemented by a counitary operator U , satisfying 

P e W e )^>il[P] = U*PU , (2) 

with 

U(if/q) = (Uif/)au[q\y if/BJ&H and qeJ$, (3a) 

and 

a ulq] = (t) u^ a) u^ for some with (o*(t) U =\\ (3b) 

i.e., a v is an automorphism of the field of quaternions. The counitarity of U means that 

U*U=UU*=I, so that (U$,U4>) = ad(*l',4>)]> (3c) 

which reflects the fact that for a colinear operator A the adjoint is defined by 

(A*iA,</>) = a^ 1 [(iA,A</))]. (4) 

Conversely, every counitary operator implements a symmetry. 
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Finally, a symmetry determines the counitary operator that implements it, uniquely up to a 
“phase;” specifically the quaternionic form of Schur’s lemma 1 implies that two counitary opera¬ 
tors U i and U 2 implement the same symmetry if and only if there exists a unit quaternion a>, such 
that U 2 = U i C a} , where C w is the counitary operator defined by 


tfro). (5) 

Indeed, 

Pb^^CZPC^P. ( 6 ) 

Hence, for every symmetry separately, one can choose a unitary operator to implement this 
symmetry; and this unitary operator is unique up to a sign. 

So far, and as long as each symmetry is treated separately, the above approach covers the 
premises of both Adler 2 and Emch. 1 


II. STRONG AND WEAK PROJECTIVE REPRESENTATIONS 

When an abstract group G is represented as a group of symmetries, i.e., when a symmetry 
ft(g) is assigned to every g e G in such a manner that 

H‘(g\)H‘(g2) = H‘(g\gl), V (gl.g2) eGXG . ( 7a ) 

i.e., 

/ , E^^)M>^(g 1 )[>(g 2 )[/ , ]]=Mgig 2 )[ 7, ]> V(gj ,g 2 )eGxG, (7b) 

one can repeat the above procedure for each g separately, and obtain a lifting by unitary operators 
U(g ), satisfying 


U(gi)U(g 2 )=±U(g lg2 ). (8) 

When G is a Lie group, and (l is a continuous representation, the brutal lifting just described 
may, however, not lead to a continuous unitary representation. As physics needs continuity to 
define the observables corresponding to the generators of the unitary representation, it is reassur¬ 
ing to know that continuity is obtained, nevertheless, 1 as a result of the following procedure. 

First, one shows that there always exists a continuous local lifting by counitary operators, thus 
satisfying the condition 


U(g\)U(g 2 ) — U(g\g 2 )Cw(g ly g 2 ) ■ (9) 

In this expression C w is a counitary operator, defined as in (5), where now u)= is a 

continuous function of each of its arguments, takes its values in the unit quaternions, and satisfies, 
besides the trivial conditions o>(g,e) = u)(e,g), the 2-cocycle condition: 

*>(g i ^ 82 g 3 )^(g 2 ^g 3 ) = (o(gig 2 t g 3 )a u -i[a>(g l ,g 2 )]; (10a) 

^3 

for the purpose of ulterior comparison with (13), we rewrite (10a) as 

> 8283 )^ w(s 2 .s 3 )“^w( 5 1 « 2 .« 3 )^ 3 ^(10b) 

Second, one shows that such a lifting is always equivalent to a continuous, unitary, local , but 
true representation (i.e., no a), not even a ± sign, ambiguity). 
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Third, whenever the Lie group G is simply connected, this can be extended to a continuous, 
unitary representation of the whole group G. In cases where the group is doubly connected (e.g., 
the rotation group in three dimensions), one only obtains the above result for its covering group; 
it is when one has to consider the group itself that the ± ambiguity of (8) can possibly manifest 
itself. As the latter amendment (covering multiply connected groups) is not germane to the issue 
on which we want to concentrate in this paper, we will not pursue that part of the discussion here. 

The straightforward generalization we just sketched, extending to quatemionic Hilbert spaces 
the analysis familiar from the complex Hilbert spaces situation presents one remarkable feature: 
the “phase reduction’’ is always locally trivial. Mathematically, this can be understood 1 from the 
fact that the local phase reduction amounts to finding, up to equivalence, all the extensions 9 of the 
Lie algebra of G by the Lie algebra of the group of automorphisms of the field of quaternions; as 
the latter happens to be the semisimple Lie algebra su(2,C), all such extensions are trivial. 10 In 
this respect the complex case is much more involved, as shown by Bargmann. 11 In particular, the 
phase reduction is not locally trivial for the Galilei group, a fact that is interpreted as viewing the 
mass as parametrizing the sectors of a superselection rule. Two attitudes are possible in this 
juncture. The first, which was chosen by Emch, 1 was to accept that Galilean QM is different in its 
quatemionic realization from what it is in its complex realization. The second is to pursue the 
issue, and to generalize the definition of a projective representation; this was recently proposed by 
Adler. 2 

Translated in the notation of this paper, Adler’s proposal 2 is to replace condition (9) by the 
weakened condition, 


U(gi)U(g2)-U(g i g 2 )L n ( giyg2) , 


where L^ g yg ^ is the linear operator, 



L ci( gl ,g 2 )<P='L ( 12 ) 

k 

with o) k (gi ,g 2 )*<°k(gi ,gi) = 1 and $ = {<f> k \k= 1,2,...} is a complete orthonormal basis in 
the same for all pairs (gi,g2) elements of G. Note that 

^n{g 1 ^n(g 1 g 2l g 3 )^ 3 ^n(g 1 ,g 2 )^g 3 * 0^) 


III. DISCUSSION 

While {11,13} look somewhat similar to {9,10fc}, there are major differences between these 
two formulations; our purpose in this paper is to delineate sharply the scope and reach of these 
variations. 

First, (9) is a direct consequence of the condition (7). Hence one should expect condition (7) 
to be violated by (11). This is indeed the case: see (16) below. Recall that (7) is the defining 
condition for the usual definition of a projective representation, as is the projective space 

associated to the vector space It is, in fact, equivalent to (9), and it is the condition Adler 2 
refers to as the defining property of a strong projective representation, in opposition to (11), which 
is equivalent to (16), and which he introduces as the definition of a weak projective representation. 

Second, (9) is a relation among essentially counitary operators. It is true, as we just men¬ 
tioned, that a powerful theorem 1 allows us to reduce the phases and thus to obtain a locally trivial 
continuous unitary representation, so that (9) becomes ultimately a relation between linear opera¬ 
tors. Nevertheless, this reduction is not instructive in the present juncture since it is (9) itself [not 
(8)] that serves as a motivation for the extension (11). By contrast, (11) is in its very essence a 
relation between unitary operators; in particular, L is a linear operator (in fact, a unitary operator) 
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that involves the choice of a complete orthonormal basis <& = {<p k \k= 1,2,...}; i.e., the focusing on 
one complete set of commuting observables, or more precisely, on a discrete, maximal Abelian, 
real subalgebra, 



A: ips ip — $k a k($k »00 ^ 

k 



the minimal projectors of which are the projectors P* on the one-dimensional rays corresponding 
to each element <p k of the chosen basis d>. We denote by the Boolean sublattice of 

generated by these projectors. 

Third, as a consequence of the above remark, whereas the colinear operators 111 W 

implement the trivial symmetry [see (6)]—and are, in particular, independent of any choice of a 
Hilbert space basis—that is not the case for the symmetry implemented by the linear operators 
^o,(g 1 ,g 2 )' Indeed, we ^ ave genetically only 


P*&i^*)~LZ (gi<g2) PL 0{t 2) = P. 



Hence, the symmetry implemented by U(g l g 2 ) coincides with the symmetry implemented by 
U(gi)U(g 2 ) only for the elements of the distinguished maximal Abelian algebra chosen to 
define the linear operators ; 

Pe5 z (^ 4) )H^ / i(g 1 )[ / i(g 2 )[/ J ]] = / i(g 2 g 2 )[/ J ], V(g!,g 2 ) eGxG. (16) 


This, compared to (7), is the major difference between the conditions defining weak versus strong 
projective representations. While both require, for each symmetry separately , that pu{g ) be an 
automorphism of the whole system (a condition necessary to support the use of Wigner’s theo¬ 
rem), the difference appears when it comes to the representation of a group of symmetries: the 
strong definition requires (7b), i.e., that pu is a representation on the full ^J^), whereas the 
weak definition requires only (16), i.e., that this condition hold on 

This is the price one must be prepared to pay for the relaxing from the ‘ ‘strong’ ’ condition (9) 
to the “weak” condition (11)—which is the generalization proposed by Adler. 2 At this price, it 
has become possible * * to classify the irreducible weakly projective representations of con¬ 
nected Lie groups; to embed complex projective representations into weakly projective quater¬ 
nionic representations (even when the Bargmann complex phase reduction is not locally trivial); to 
construct quaternionic coherent states (including the weakly projective case); and to discuss how, 
in the complex case, the weak condition (11) already implies the stronger condition of (9). 

After comparing their original motivations, the authors realized how they both had hoped to 
take advantage of the SU(2) symmetry of the quaternions: Emch 1 was interested in finding some 
natural coupling between the inhomogeneous Lorentz group of special relativity and the internal 
symmetries then known in elementary particle theory; Adler 2 was similarly interested in finding a 
source in the ray structure of Hilbert space for the color symmetry. It seems fair to say that, even 
with the generalization proposed by Adler, the structure of the current quaternionic models for 
quantum theories is not (yet) rich enough to accommodate dreams that extend beyond the complex 
Hilbert space formalism. 
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